Search Results for author: Li Su

Found 25 papers, 15 papers with code

Weakly-Supervised Crowd Counting Learns from Sorting rather than Locations

no code implementations ECCV 2020 Yifan Yang, Guorong Li, Zhe Wu, Li Su, Qingming Huang, Nicu Sebe

We propose a soft-label sorting network along with the counting network, which sorts the given images by their crowd numbers.

Crowd Counting

Adapting pretrained speech model for Mandarin lyrics transcription and alignment

1 code implementation21 Nov 2023 Jun-You Wang, Chon-In Leong, Yu-Chen Lin, Li Su, Jyh-Shing Roger Jang

With the use of data augmentation and source separation model, results show that the proposed method achieves a character error rate of less than 18% on a Mandarin polyphonic dataset for lyrics transcription, and a mean absolute error of 0. 071 seconds for lyrics alignment.

Automatic Lyrics Transcription Data Augmentation

Enhancing Motor Imagery Decoding in Brain Computer Interfaces using Riemann Tangent Space Mapping and Cross Frequency Coupling

no code implementations29 Oct 2023 Xiong Xiong, Li Su, Jinguo Huang, Guixia Kang

Objective: Motor Imagery (MI) serves as a crucial experimental paradigm within the realm of Brain Computer Interfaces (BCIs), aiming to decoding motor intentions from electroencephalogram (EEG) signals.

EEG Motor Imagery

Popularity, face and voice: Predicting and interpreting livestreamers' retail performance using machine learning techniques

no code implementations29 Oct 2023 Xiong Xiong, Fan Yang, Li Su

Livestreaming commerce, a hybrid of e-commerce and self-media, has expanded the broad spectrum of traditional sales performance determinants.

Explainable artificial intelligence Feature Importance

A Phoneme-Informed Neural Network Model for Note-Level Singing Transcription

no code implementations12 Apr 2023 Sangeon Yong, Li Su, Juhan Nam

Note-level automatic music transcription is one of the most representative music information retrieval (MIR) tasks and has been studied for various instruments to understand music.

Information Retrieval Music Information Retrieval +2

Neighborhood Contrastive Transformer for Change Captioning

1 code implementation6 Mar 2023 Yunbin Tu, Liang Li, Li Su, Ke Lu, Qingming Huang

Change captioning is to describe the semantic change between a pair of similar images in natural language.

Image Captioning

ReAssigner: A Plug-and-Play Virtual Machine Scheduling Intensifier for Heterogeneous Requests

no code implementations29 Nov 2022 Haochuan Cui, Junjie Sheng, Bo Jin, Yiqiu Hu, Li Su, Lei Zhu, Wenli Zhou, Xiangfeng Wang

With the rapid development of cloud computing, virtual machine scheduling has become one of the most important but challenging issues for the cloud computing community, especially for practical heterogeneous request sequences.

Cloud Computing Scheduling

Modeling Temporal Concept Receptive Field Dynamically for Untrimmed Video Analysis

1 code implementation23 Nov 2021 Zhaobo Qi, Shuhui Wang, Chi Su, Li Su, Weigang Zhang, Qingming Huang

Based on TDC, we propose the temporal dynamic concept modeling network (TDCMN) to learn an accurate and complete concept representation for efficient untrimmed video analysis.

Image Categorization

Actions Speak Louder than Listening: Evaluating Music Style Transfer based on Editing Experience

1 code implementation25 Oct 2021 Wei-Tsung Lu, Meng-Hsuan Wu, Yuh-Ming Chiu, Li Su

In this paper, we propose an editing test to evaluate users' editing experience of music generation models in a systematic way.

Music Generation Music Style Transfer +1

Omnizart: A General Toolbox for Automatic Music Transcription

1 code implementation1 Jun 2021 Yu-Te Wu, Yin-Jyun Luo, Tsung-Ping Chen, I-Chieh Wei, Jui-Yang Hsu, Yi-Chin Chuang, Li Su

We present and release Omnizart, a new Python library that provides a streamlined solution to automatic music transcription (AMT).

Chord Recognition Information Retrieval +3

SE-Harris and eSUSAN: Asynchronous Event-Based Corner Detection Using Megapixel Resolution CeleX-V Camera

no code implementations2 May 2021 Jinjian Li, Chuandong Guo, Li Su, Xiangyu Wang, Quan Hu

The proposed eSUSAN extracts the univalue segment assimilating nucleus from the circle kernel based on the similarity across timestamps and distinguishes corner events by the number of pixels in the nucleus area.

Rethinking Graph Neural Architecture Search from Message-passing

1 code implementation CVPR 2021 Shaofei Cai, Liang Li, Jincan Deng, Beichen Zhang, Zheng-Jun Zha, Li Su, Qingming Huang

Inspired by the strong searching capability of neural architecture search (NAS) in CNN, this paper proposes Graph Neural Architecture Search (GNAS) with novel-designed search space.

feature selection Neural Architecture Search

Crossing You in Style: Cross-modal Style Transfer from Music to Visual Arts

1 code implementation17 Sep 2020 Cheng-Che Lee, Wan-Yi Lin, Yen-Ting Shih, Pei-Yi Patricia Kuo, Li Su

Its major difference from the traditional image style transfer problem is that the style information is provided by music rather than images.

Generative Adversarial Network Style Transfer

Temporally Guided Music-to-Body-Movement Generation

1 code implementation17 Sep 2020 Hsuan-Kai Kao, Li Su

This paper presents a neural network model to generate virtual violinist's 3-D skeleton movements from music audio.

A Human-Computer Duet System for Music Performance

no code implementations16 Sep 2020 Yuen-Jen Lin, Hsuan-Kai Kao, Yih-Chih Tseng, Ming Tsai, Li Su

Virtual musicians have become a remarkable phenomenon in the contemporary multimedia arts.

Pose Estimation

Knowledge-guided Pairwise Reconstruction Network for Weakly Supervised Referring Expression Grounding

1 code implementation5 Sep 2019 Xuejing Liu, Liang Li, Shuhui Wang, Zheng-Jun Zha, Li Su, Qingming Huang

Weakly supervised referring expression grounding (REG) aims at localizing the referential entity in an image according to linguistic query, where the mapping between the image region (proposal) and the query is unknown in the training stage.

Object Referring Expression +2

Multi-layered Cepstrum for Instantaneous Frequency Estimation

1 code implementation1 Feb 2019 Chin-Yun Yu, Li Su

We propose the multi-layered cepstrum (MLC) method to estimate multiple fundamental frequencies (MF0) of a signal under challenging contamination such as high-pass filter noise.

A Streamlined Encoder/Decoder Architecture for Melody Extraction

1 code implementation30 Oct 2018 Tsung-Han Hsieh, Li Su, Yi-Hsuan Yang

Our experiments on both vocal melody extraction and general melody extraction validate the effectiveness of the proposed model.

Melody Extraction

Vocal melody extraction using patch-based CNN

3 code implementations24 Apr 2018 Li Su

A patch-based convolutional neural network (CNN) model presented in this paper for vocal melody extraction in polyphonic music is inspired from object detection in image processing.

Sound Audio and Speech Processing

Between Homomorphic Signal Processing and Deep Neural Networks: Constructing Deep Algorithms for Polyphonic Music Transcription

no code implementations26 Jun 2017 Li Su

This paper presents a new approach in understanding how deep neural networks (DNNs) work by applying homomorphic signal processing techniques.

Music Transcription Relation

Cannot find the paper you are looking for? You can Submit a new open access paper.