Search Results for author: Li Su

Found 25 papers, 15 papers with code

Weakly-Supervised Crowd Counting Learns from Sorting rather than Locations

no code implementations • ECCV 2020 • Yifan Yang, Guorong Li, Zhe Wu, Li Su, Qingming Huang, Nicu Sebe

We propose a soft-label sorting network along with the counting network, which sorts the given images by their crowd numbers.

Crowd Counting

Paper
Add Code

Adapting pretrained speech model for Mandarin lyrics transcription and alignment

1 code implementation • 21 Nov 2023 • Jun-You Wang, Chon-In Leong, Yu-Chen Lin, Li Su, Jyh-Shing Roger Jang

With the use of data augmentation and source separation model, results show that the proposed method achieves a character error rate of less than 18% on a Mandarin polyphonic dataset for lyrics transcription, and a mean absolute error of 0. 071 seconds for lyrics alignment.

Automatic Lyrics Transcription Data Augmentation

Paper
Code

Enhancing Motor Imagery Decoding in Brain Computer Interfaces using Riemann Tangent Space Mapping and Cross Frequency Coupling

no code implementations • 29 Oct 2023 • Xiong Xiong, Li Su, Jinguo Huang, Guixia Kang

Objective: Motor Imagery (MI) serves as a crucial experimental paradigm within the realm of Brain Computer Interfaces (BCIs), aiming to decoding motor intentions from electroencephalogram (EEG) signals.

EEG Motor Imagery

Paper
Add Code

Popularity, face and voice: Predicting and interpreting livestreamers' retail performance using machine learning techniques

no code implementations • 29 Oct 2023 • Xiong Xiong, Fan Yang, Li Su

Livestreaming commerce, a hybrid of e-commerce and self-media, has expanded the broad spectrum of traditional sales performance determinants.

Explainable artificial intelligence Feature Importance

Paper
Add Code

Self-supervised Cross-view Representation Reconstruction for Change Captioning

1 code implementation • ICCV 2023 • Yunbin Tu, Liang Li, Li Su, Zheng-Jun Zha, Chenggang Yan, Qingming Huang

Change captioning aims to describe the difference between a pair of similar images.

Caption Generation Hallucination

Paper
Code

A Phoneme-Informed Neural Network Model for Note-Level Singing Transcription

no code implementations • 12 Apr 2023 • Sangeon Yong, Li Su, Juhan Nam

Note-level automatic music transcription is one of the most representative music information retrieval (MIR) tasks and has been studied for various instruments to understand music.

Information Retrieval Music Information Retrieval +2

Paper
Add Code

Neighborhood Contrastive Transformer for Change Captioning

1 code implementation • 6 Mar 2023 • Yunbin Tu, Liang Li, Li Su, Ke Lu, Qingming Huang

Change captioning is to describe the semantic change between a pair of similar images in natural language.

Image Captioning

Paper
Code

ReAssigner: A Plug-and-Play Virtual Machine Scheduling Intensifier for Heterogeneous Requests

no code implementations • 29 Nov 2022 • Haochuan Cui, Junjie Sheng, Bo Jin, Yiqiu Hu, Li Su, Lei Zhu, Wenli Zhou, Xiangfeng Wang

With the rapid development of cloud computing, virtual machine scheduling has become one of the most important but challenging issues for the cloud computing community, especially for practical heterogeneous request sequences.

Cloud Computing Scheduling

Paper
Add Code

Modeling Temporal Concept Receptive Field Dynamically for Untrimmed Video Analysis

1 code implementation • 23 Nov 2021 • Zhaobo Qi, Shuhui Wang, Chi Su, Li Su, Weigang Zhang, Qingming Huang

Based on TDC, we propose the temporal dynamic concept modeling network (TDCMN) to learn an accurate and complete concept representation for efficient untrimmed video analysis.

Image Categorization

Paper
Code

Self-Regulated Learning for Egocentric Video Activity Anticipation

1 code implementation • 23 Nov 2021 • Zhaobo Qi, Shuhui Wang, Chi Su, Li Su, Qingming Huang, Qi Tian

Future activity anticipation is a challenging problem in egocentric vision.

Multi-Task Learning

Paper
Code

Actions Speak Louder than Listening: Evaluating Music Style Transfer based on Editing Experience

1 code implementation • 25 Oct 2021 • Wei-Tsung Lu, Meng-Hsuan Wu, Yuh-Ming Chiu, Li Su

In this paper, we propose an editing test to evaluate users' editing experience of music generation models in a systematic way.

Music Generation Music Style Transfer +1

Paper
Code

ReconVAT: A Semi-Supervised Automatic Music Transcription Framework for Low-Resource Real-World Data

no code implementations • 11 Jul 2021 • Kin Wai Cheuk, Dorien Herremans, Li Su

Most of the current supervised automatic music transcription (AMT) models lack the ability to generalize.

Continual Learning Music Transcription

Paper
Add Code

Omnizart: A General Toolbox for Automatic Music Transcription

1 code implementation • 1 Jun 2021 • Yu-Te Wu, Yin-Jyun Luo, Tsung-Ping Chen, I-Chieh Wei, Jui-Yang Hsu, Yi-Chin Chuang, Li Su

We present and release Omnizart, a new Python library that provides a streamlined solution to automatic music transcription (AMT).

Chord Recognition Information Retrieval +3

1,556

Paper
Code

SE-Harris and eSUSAN: Asynchronous Event-Based Corner Detection Using Megapixel Resolution CeleX-V Camera

no code implementations • 2 May 2021 • Jinjian Li, Chuandong Guo, Li Su, Xiangyu Wang, Quan Hu

The proposed eSUSAN extracts the univalue segment assimilating nucleus from the circle kernel based on the similarity across timestamps and distinguishes corner events by the number of pixels in the nucleus area.

Paper
Add Code

Rethinking Graph Neural Architecture Search from Message-passing

1 code implementation • CVPR 2021 • Shaofei Cai, Liang Li, Jincan Deng, Beichen Zhang, Zheng-Jun Zha, Li Su, Qingming Huang

Inspired by the strong searching capability of neural architecture search (NAS) in CNN, this paper proposes Graph Neural Architecture Search (GNAS) with novel-designed search space.

feature selection Neural Architecture Search

Paper
Code

Crossing You in Style: Cross-modal Style Transfer from Music to Visual Arts

1 code implementation • 17 Sep 2020 • Cheng-Che Lee, Wan-Yi Lin, Yen-Ting Shih, Pei-Yi Patricia Kuo, Li Su

Its major difference from the traditional image style transfer problem is that the style information is provided by music rather than images.

Generative Adversarial Network Style Transfer

Paper
Code

Temporally Guided Music-to-Body-Movement Generation

1 code implementation • 17 Sep 2020 • Hsuan-Kai Kao, Li Su

This paper presents a neural network model to generate virtual violinist's 3-D skeleton movements from music audio.

Paper
Code

A Human-Computer Duet System for Music Performance

no code implementations • 16 Sep 2020 • Yuen-Jen Lin, Hsuan-Kai Kao, Yih-Chih Tseng, Ming Tsai, Li Su

Virtual musicians have become a remarkable phenomenon in the contemporary multimedia arts.

Pose Estimation

Paper
Add Code

Knowledge-guided Pairwise Reconstruction Network for Weakly Supervised Referring Expression Grounding

1 code implementation • 5 Sep 2019 • Xuejing Liu, Liang Li, Shuhui Wang, Zheng-Jun Zha, Li Su, Qingming Huang

Weakly supervised referring expression grounding (REG) aims at localizing the referential entity in an image according to linguistic query, where the mapping between the image region (proposal) and the query is unknown in the training stage.

Object Referring Expression +2

Paper
Code

Cascaded Partial Decoder for Fast and Accurate Salient Object Detection

1 code implementation • CVPR 2019 • Zhe Wu, Li Su, Qingming Huang

In this paper, we propose a novel Cascaded Partial Decoder (CPD) framework for fast and accurate salient object detection.

Ranked #1 on RGB Salient Object Detection on ISTD

Camouflaged Object Segmentation object-detection +2

275

Paper
Code

Multi-layered Cepstrum for Instantaneous Frequency Estimation

1 code implementation • 1 Feb 2019 • Chin-Yun Yu, Li Su

We propose the multi-layered cepstrum (MLC) method to estimate multiple fundamental frequencies (MF0) of a signal under challenging contamination such as high-pass filter noise.

Paper
Code

A Streamlined Encoder/Decoder Architecture for Melody Extraction

1 code implementation • 30 Oct 2018 • Tsung-Han Hsieh, Li Su, Yi-Hsuan Yang

Our experiments on both vocal melody extraction and general melody extraction validate the effectiveness of the proposed model.

Melody Extraction

Paper
Code

Vocal melody extraction using patch-based CNN

3 code implementations • 24 Apr 2018 • Li Su

A patch-based convolutional neural network (CNN) model presented in this paper for vocal melody extraction in polyphonic music is inspired from object detection in image processing.

Sound Audio and Speech Processing

Paper
Code

Between Homomorphic Signal Processing and Deep Neural Networks: Constructing Deep Algorithms for Polyphonic Music Transcription

no code implementations • 26 Jun 2017 • Li Su

This paper presents a new approach in understanding how deep neural networks (DNNs) work by applying homomorphic signal processing techniques.

Music Transcription Relation

Paper
Add Code

多通道之多重音頻串流方法之研究(Multi-channel Source Clustering of Polyphonic Music) [In Chinese]

no code implementations • ROCLINGIJCLCLP 2016 • Chih Yi Kuan, Li Su, Yu Hao Chin, Jia-Ching Wang

Clustering

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.