no code implementations • ECCV 2020 • Yifan Yang, Guorong Li, Zhe Wu, Li Su, Qingming Huang, Nicu Sebe
We propose a soft-label sorting network along with the counting network, which sorts the given images by their crowd numbers.
1 code implementation • 21 Nov 2023 • Jun-You Wang, Chon-In Leong, Yu-Chen Lin, Li Su, Jyh-Shing Roger Jang
With the use of data augmentation and source separation model, results show that the proposed method achieves a character error rate of less than 18% on a Mandarin polyphonic dataset for lyrics transcription, and a mean absolute error of 0. 071 seconds for lyrics alignment.
no code implementations • 29 Oct 2023 • Xiong Xiong, Fan Yang, Li Su
Livestreaming commerce, a hybrid of e-commerce and self-media, has expanded the broad spectrum of traditional sales performance determinants.
no code implementations • 29 Oct 2023 • Xiong Xiong, Li Su, Jinguo Huang, Guixia Kang
Objective: Motor Imagery (MI) serves as a crucial experimental paradigm within the realm of Brain Computer Interfaces (BCIs), aiming to decoding motor intentions from electroencephalogram (EEG) signals.
1 code implementation • ICCV 2023 • Yunbin Tu, Liang Li, Li Su, Zheng-Jun Zha, Chenggang Yan, Qingming Huang
Change captioning aims to describe the difference between a pair of similar images.
no code implementations • 12 Apr 2023 • Sangeon Yong, Li Su, Juhan Nam
Note-level automatic music transcription is one of the most representative music information retrieval (MIR) tasks and has been studied for various instruments to understand music.
1 code implementation • 6 Mar 2023 • Yunbin Tu, Liang Li, Li Su, Ke Lu, Qingming Huang
Change captioning is to describe the semantic change between a pair of similar images in natural language.
no code implementations • 29 Nov 2022 • Haochuan Cui, Junjie Sheng, Bo Jin, Yiqiu Hu, Li Su, Lei Zhu, Wenli Zhou, Xiangfeng Wang
With the rapid development of cloud computing, virtual machine scheduling has become one of the most important but challenging issues for the cloud computing community, especially for practical heterogeneous request sequences.
1 code implementation • 23 Nov 2021 • Zhaobo Qi, Shuhui Wang, Chi Su, Li Su, Weigang Zhang, Qingming Huang
Based on TDC, we propose the temporal dynamic concept modeling network (TDCMN) to learn an accurate and complete concept representation for efficient untrimmed video analysis.
1 code implementation • 23 Nov 2021 • Zhaobo Qi, Shuhui Wang, Chi Su, Li Su, Qingming Huang, Qi Tian
Future activity anticipation is a challenging problem in egocentric vision.
1 code implementation • 25 Oct 2021 • Wei-Tsung Lu, Meng-Hsuan Wu, Yuh-Ming Chiu, Li Su
In this paper, we propose an editing test to evaluate users' editing experience of music generation models in a systematic way.
no code implementations • 11 Jul 2021 • Kin Wai Cheuk, Dorien Herremans, Li Su
Most of the current supervised automatic music transcription (AMT) models lack the ability to generalize.
1 code implementation • 1 Jun 2021 • Yu-Te Wu, Yin-Jyun Luo, Tsung-Ping Chen, I-Chieh Wei, Jui-Yang Hsu, Yi-Chin Chuang, Li Su
We present and release Omnizart, a new Python library that provides a streamlined solution to automatic music transcription (AMT).
no code implementations • 2 May 2021 • Jinjian Li, Chuandong Guo, Li Su, Xiangyu Wang, Quan Hu
The proposed eSUSAN extracts the univalue segment assimilating nucleus from the circle kernel based on the similarity across timestamps and distinguishes corner events by the number of pixels in the nucleus area.
1 code implementation • CVPR 2021 • Shaofei Cai, Liang Li, Jincan Deng, Beichen Zhang, Zheng-Jun Zha, Li Su, Qingming Huang
Inspired by the strong searching capability of neural architecture search (NAS) in CNN, this paper proposes Graph Neural Architecture Search (GNAS) with novel-designed search space.
1 code implementation • 17 Sep 2020 • Hsuan-Kai Kao, Li Su
This paper presents a neural network model to generate virtual violinist's 3-D skeleton movements from music audio.
1 code implementation • 17 Sep 2020 • Cheng-Che Lee, Wan-Yi Lin, Yen-Ting Shih, Pei-Yi Patricia Kuo, Li Su
Its major difference from the traditional image style transfer problem is that the style information is provided by music rather than images.
no code implementations • 16 Sep 2020 • Yuen-Jen Lin, Hsuan-Kai Kao, Yih-Chih Tseng, Ming Tsai, Li Su
Virtual musicians have become a remarkable phenomenon in the contemporary multimedia arts.
1 code implementation • 5 Sep 2019 • Xuejing Liu, Liang Li, Shuhui Wang, Zheng-Jun Zha, Li Su, Qingming Huang
Weakly supervised referring expression grounding (REG) aims at localizing the referential entity in an image according to linguistic query, where the mapping between the image region (proposal) and the query is unknown in the training stage.
1 code implementation • CVPR 2019 • Zhe Wu, Li Su, Qingming Huang
In this paper, we propose a novel Cascaded Partial Decoder (CPD) framework for fast and accurate salient object detection.
Ranked #1 on
RGB Salient Object Detection
on ISTD
1 code implementation • 1 Feb 2019 • Chin-Yun Yu, Li Su
We propose the multi-layered cepstrum (MLC) method to estimate multiple fundamental frequencies (MF0) of a signal under challenging contamination such as high-pass filter noise.
1 code implementation • 30 Oct 2018 • Tsung-Han Hsieh, Li Su, Yi-Hsuan Yang
Our experiments on both vocal melody extraction and general melody extraction validate the effectiveness of the proposed model.
3 code implementations • 24 Apr 2018 • Li Su
A patch-based convolutional neural network (CNN) model presented in this paper for vocal melody extraction in polyphonic music is inspired from object detection in image processing.
Sound Audio and Speech Processing
no code implementations • 26 Jun 2017 • Li Su
This paper presents a new approach in understanding how deep neural networks (DNNs) work by applying homomorphic signal processing techniques.