no code implementations • 28 Nov 2023 • Jian Yu, Yi Yu, Feipeng Da
Large parallax image stitching is a challenging task.
1 code implementation • 23 Nov 2023 • Junwei Luo, Xue Yang, Yi Yu, Qingyun Li, Junchi Yan, Yansheng Li
Single point-supervised object detection is gaining attention due to its cost-effectiveness.
no code implementations • 2 Oct 2023 • Zhe Zhang, Karol Lasocki, Yi Yu, Atsuhiro Takasu
We leverage character-level language models for syllable-level lyrics generation from symbolic melody.
no code implementations • 1 Oct 2023 • Julien Lalanne, Raphael Bournet, Yi Yu
Live commenting on video, a popular feature of live streaming platforms, enables viewers to engage with the content and share their comments, reactions, opinions, or questions with the streamer or other viewers while watching the video or live stream.
1 code implementation • 30 Sep 2023 • Wenjie Yin, Qingyuan Yao, Yi Yu, Hang Yin, Danica Kragic, Mårten Björkman
To complement it, we introduce JustLMD, a new multimodal dataset of 3D dance motion with music and lyrics.
no code implementations • 25 Jul 2023 • Shengyue Yao, Jingru Yu, Yi Yu, Jia Xu, Xingyuan Dai, Honghai Li, Fei-Yue Wang, Yilun Lin
Furthermore, an operation algorithm is proposed regarding the issue of structural rigidity in DAO.
no code implementations • 25 Jul 2023 • Yi Yu, Wenlian Lu, BoYu Chen
We propose theoretical analyses of a modified natural gradient descent method in the neural network function space based on the eigendecompositions of neural tangent kernel and Fisher information matrix.
1 code implementation • ICCV 2023 • YuFei Wang, Yi Yu, Wenhan Yang, Lanqing Guo, Lap-Pui Chau, Alex C. Kot, Bihan Wen
Different from a vanilla diffusion model that has to perform Gaussian denoising, with the injected physics-based exposure model, our restoration process can directly start from a noisy image instead of pure noise.
Ranked #1 on
Image Denoising
on Image Denoising on SID x300
1 code implementation • 21 Jun 2023 • YuFei Wang, Yi Yu, Wenhan Yang, Lanqing Guo, Lap-Pui Chau, Alex C. Kot, Bihan Wen
Besides, we propose a novel design of the context model, which can better predict the order masks of encoding/decoding based on both the sRGB image and the masks of already processed features.
no code implementations • 5 Jun 2023 • Zhe Zhang, Yi Yu, Atsuhiro Takasu
Lyrics-to-melody generation is an interesting and challenging topic in AI music research field.
no code implementations • 27 Apr 2023 • Qingpeng Zhu, Wenxiu Sun, Yuekun Dai, Chongyi Li, Shangchen Zhou, Ruicheng Feng, Qianhui Sun, Chen Change Loy, Jinwei Gu, Yi Yu, Yangke Huang, Kang Zhang, Meiya Chen, Yu Wang, Yongchao Li, Hao Jiang, Amrit Kumar Muduli, Vikash Kumar, Kunal Swami, Pankaj Kumar Bajpai, Yunchao Ma, Jiajun Xiao, Zhi Ling
To evaluate the performance of different depth completion methods, we organized an RGB+sparse ToF depth completion competition.
no code implementations • 29 Mar 2023 • Lu Lu, Yi Yu, Zongsheng Zheng, Guangya Zhu, Xiaomin Yang
Two Andrew's sine estimator (ASE)-based robust adaptive filtering algorithms are proposed in this brief.
1 code implementation • 23 Mar 2023 • Dichucheng Li, Mingjin Che, Wenwu Meng, Yulun Wu, Yi Yu, Fan Xia, Wei Li
Instrument playing technique (IPT) is a key element of musical presentation.
Instrument Playing Technique Detection
Multi-Label Classification
1 code implementation • 21 Mar 2023 • Sahil Goyal, Shagun Uppal, Sarthak Bhagat, Yi Yu, Yifang Yin, Rajiv Ratn Shah
To mitigate this, we build a talking face generation framework conditioned on a categorical emotion to generate videos with appropriate expressions, making them more realistic and convincing.
Ranked #1 on
Talking Face Generation
on CREMA-D
no code implementations • 4 Mar 2023 • Qinghua He, Wanyu Li, Yaping Shi, Yi Yu, Yi Zhang, Wenqian Geng, Zhiyuan Sun, Ruikang K Wang
This study highlights the potential of SpeCamX to improve the prediction of bio-chromophores, and its ability to transform an ordinary smartphone into a powerful medical tool without the need for additional investments or expertise.
no code implementations • CVPR 2023 • Yi Yu, YuFei Wang, Wenhan Yang, Shijian Lu, Yap-Peng Tan, Alex C. Kot
Extensive experiments show that with our trained trigger injection models and simple modification of encoder parameters (of the compression model), the proposed attack can successfully inject several backdoors with corresponding triggers in a single image compression model.
1 code implementation • CVPR 2023 • YuFei Wang, Yi Yu, Wenhan Yang, Lanqing Guo, Lap-Pui Chau, Alex Kot, Bihan Wen
While raw images exhibit advantages over sRGB images (e. g., linearity and fine-grained quantization level), they are not widely used by common users due to the large storage requirements.
no code implementations • 23 Jan 2023 • Gurunath Reddy M, Zhe Zhang, Yi Yu, Florian Harscoet, Simon Canales, Suhua Tang
We propose a deep attention-based alignment network, which aims to automatically predict lyrics and melody with given incomplete lyrics as input in a way similar to the music creation of humans.
no code implementations • ACM Multimedia Asia 2022 • Sahil Goyal, Shagun Uppal, Sarthak Bhagat, Dhroov Goel, Sakshat Mali, Yi Yu, Yifang Yin, Rajiv Ratn Shah
Lip synchronization and talking face generation have gained a specific interest from the research community with the advent and need of digital communication in different fields.
1 code implementation • CVPR 2023 • Yi Yu, Feipeng Da
With the vigorous development of computer vision, oriented object detection has gradually been featured.
7 code implementations • 5 Oct 2022 • Silvio Giancola, Anthony Cioppa, Adrien Deliège, Floriane Magera, Vladimir Somers, Le Kang, Xin Zhou, Olivier Barnich, Christophe De Vleeschouwer, Alexandre Alahi, Bernard Ghanem, Marc Van Droogenbroeck, Abdulrahman Darwish, Adrien Maglo, Albert Clapés, Andreas Luyts, Andrei Boiarov, Artur Xarles, Astrid Orcesi, Avijit Shah, Baoyu Fan, Bharath Comandur, Chen Chen, Chen Zhang, Chen Zhao, Chengzhi Lin, Cheuk-Yiu Chan, Chun Chuen Hui, Dengjie Li, Fan Yang, Fan Liang, Fang Da, Feng Yan, Fufu Yu, Guanshuo Wang, H. Anthony Chan, He Zhu, Hongwei Kan, Jiaming Chu, Jianming Hu, Jianyang Gu, Jin Chen, João V. B. Soares, Jonas Theiner, Jorge De Corte, José Henrique Brito, Jun Zhang, Junjie Li, Junwei Liang, Leqi Shen, Lin Ma, Lingchi Chen, Miguel Santos Marques, Mike Azatov, Nikita Kasatkin, Ning Wang, Qiong Jia, Quoc Cuong Pham, Ralph Ewerth, Ran Song, RenGang Li, Rikke Gade, Ruben Debien, Runze Zhang, Sangrok Lee, Sergio Escalera, Shan Jiang, Shigeyuki Odashima, Shimin Chen, Shoichi Masui, Shouhong Ding, Sin-wai Chan, Siyu Chen, Tallal El-Shabrawy, Tao He, Thomas B. Moeslund, Wan-Chi Siu, Wei zhang, Wei Li, Xiangwei Wang, Xiao Tan, Xiaochuan Li, Xiaolin Wei, Xiaoqing Ye, Xing Liu, Xinying Wang, Yandong Guo, YaQian Zhao, Yi Yu, YingYing Li, Yue He, Yujie Zhong, Zhenhua Guo, Zhiheng Li
The SoccerNet 2022 challenges were the second annual video understanding challenges organized by the SoccerNet team.
no code implementations • 19 Sep 2022 • Dichucheng Li, Yulun Wu, Qinyu Li, Jiahao Zhao, Yi Yu, Fan Xia, Wei Li
Because each Guzheng playing technique is applied to a note, a dedicated onset detector is trained to divide an audio into several notes and its predictions are fused with frame-wise IPT predictions.
no code implementations • 14 Aug 2022 • YaQin Li, Lingli Li, Yongjin Xu, Yi Yu
In the generative model, one of the reward components, a binding affinity predictor, is based on 1D protein sequence and molecular SMILES.
no code implementations • 4 Aug 2022 • Yi Yu, Hongsen He, Rodrigo C. de Lamare, Badong Chen
In this paper, we propose a general robust subband adaptive filtering (GR-SAF) scheme against impulsive noise by minimizing the mean square deviation under the random-walk model with individual weight uncertainty.
no code implementations • 30 Jun 2022 • Wei Duan, Zhe Zhang, Yi Yu, Keizo Oyama
Generating melody from lyrics is an interesting yet challenging task in the area of artificial intelligence and music.
no code implementations • 25 Jun 2022 • Tao Yu, Rodrigo C. de Lamare, Yi Yu
This paper studies distributed diffusion adaptation over clustered multi-task networks in the presence of impulsive interferences and Byzantine attacks.
1 code implementation • 16 May 2022 • Yi Yu, Karl Borjesson
Transformer models have been developed in molecular science with excellent performance in applications including quantitative structure-activity relationship (QSAR) and virtual screening (VS).
no code implementations • 15 May 2022 • Yi Yu, Zongxin Huang, Hongsen He, Yuriy Zakharov, Rodrigo C. de Lamare
This paper proposes a unified sparsity-aware robust normalized subband adaptive filtering (SA-RNSAF) algorithm for identification of sparse systems under impulsive noise.
1 code implementation • 2 May 2022 • Weixing Wei, Peilin Li, Yi Yu, Wei Li
Sounds, especially music, contain various harmonic components scattered in the frequency dimension.
1 code implementation • 28 Apr 2022 • Guangwei Gao, Zhengxue Wang, Juncheng Li, Wenjie Li, Yi Yu, Tieyong Zeng
Single-image super-resolution (SISR) has achieved significant breakthroughs with the development of deep learning.
1 code implementation • CVPR 2022 • Yi Yu, Wenhan Yang, Yap-Peng Tan, Alex C. Kot
Finally, we examine various types of adversarial attacks that are specific to deraining problems and their effects on both human and machine vision tasks, including 1) rain region attacks, adding perturbations only in the rain regions to make the perturbations in the attacked rain images less visible; 2) object-sensitive attacks, adding perturbations only in regions near the given objects.
no code implementations • 19 Mar 2022 • Lu Lu, Yi Yu, Rodrigo C. de Lamare, Xiaomin Yang
We propose a novel M-estimate conjugate gradient (CG) algorithm, termed Tukey's biweight M-estimate CG (TbMCG), for system identification in impulsive noise environments.
no code implementations • 28 Feb 2022 • Luís Vilaça, Yi Yu, Paula Viana
Audio-visual correlation learning aims to capture essential correspondences and understand natural phenomena between audio and video.
1 code implementation • 13 Feb 2022 • Qiqi He, Xiaoheng Sun, Yi Yu, Wei Li
Chorus detection is a challenging problem in musical signal processing as the chorus often repeats more than once in popular songs, usually with rich instruments and complex rhythm forms.
1 code implementation • 16 Dec 2021 • Guangwei Gao, Wenjie Li, Juncheng Li, Fei Wu, Huimin Lu, Yi Yu
Convolutional neural networks based single-image super-resolution (SISR) has made great progress in recent years.
no code implementations • 5 Dec 2021 • Jiwei Zhang, Yi Yu, Suhua Tang, Jianming Wu, Wei Li
On the one hand, audio encoder and visual encoder separately encode audio data and visual data into two different latent spaces.
no code implementations • 19 Oct 2021 • Lu Lu, Kai-Li Yin, Rodrigo C. de Lamare, Zongsheng Zheng, Yi Yu, Xiaomin Yang, Badong Chen
Most of the literature focuses on the development of the linear active noise control (ANC) techniques.
no code implementations • 1 Oct 2021 • Lu Lu, Kai-Li Yin, Rodrigo C. de Lamare, Zongsheng Zheng, Yi Yu, Xiaomin Yang, Badong Chen
Active noise control (ANC) is an effective way for reducing the noise level in electroacoustic or electromechanical systems.
no code implementations • 7 Sep 2021 • YaQin Li, Yongjin Xu, Yi Yu
Our strategy takes advantages of both convolutional and recurrent neural networks for feature extraction, as well as the data augmentation method.
1 code implementation • 2 Sep 2021 • Guangwei Gao, Guoan Xu, Juncheng Li, Yi Yu, Huimin Lu, Jian Yang
Specifically, FBSNet employs a symmetrical encoder-decoder structure with two branches, semantic information branch and spatial detail branch.
no code implementations • 14 Aug 2021 • Gang Guo, Yi Yu, Rodrigo C. de Lamare, Zongsheng Zheng, Lu Lu, Qiangming Cai
In addition, an adaptive approach for the choice of the thresholding parameter in the proximal step is also proposed based on the minimization of the mean square deviation.
1 code implementation • 6 Aug 2021 • Xuejiao Tang, Wenbin Zhang, Yi Yu, Kea Turner, Tyler Derr, Mengyu Wang, Eirini Ntoutsi
While image understanding on recognition-level has achieved remarkable advancements, reliable visual scene understanding requires comprehensive image understanding on recognition-level but also cognition-level, which calls for exploiting the multi-source information as well as learning different levels of understanding and extensive commonsense knowledge.
no code implementations • ACL 2021 • Yi Yu, Adam Jatowt, Antoine Doucet, Kazunari Sugiyama, Masatoshi Yoshikawa
In this paper, we address a novel task, Multiple TimeLine Summarization (MTLS), which extends the flexibility and versatility of Time-Line Summarization (TLS).
no code implementations • 30 Jul 2021 • Xiaotian Yu, Hanling Yi, Yi Yu, Ling Xing, Shiliang Zhang, Xiaoyu Wang
There has been a recent surge of research interest in attacking the problem of social relation inference based on images.
1 code implementation • Conference 2021 • Xingcai Wu, Yucheng Xie, Jiaqi Zeng, Zhenguo Yang, Yi Yu, Qing Li, and Wenyin Liu
In this paper, we propose an adversarial learning framework with mask reconstruction (ALMR) for image inpainting with textual guidance, which consists of a two-stage generator and dual discriminators.
1 code implementation • NeurIPS 2021 • Oscar Hernan Madrid Padilla, Yi Yu, Alessandro Rinaldo
We study piece-wise constant signals corrupted by additive Gaussian noise over a $d$-dimensional lattice.
no code implementations • 31 Mar 2021 • Yi Yu, Feipeng Da, Ziyu Zhang
Without fine-tuning on the test set, the Rank-1 Recognition Rate (RR1) is achieved as follows: 98. 85% on FRGC v2. 0 dataset and 99. 33% on Bosphorus dataset, which proves the effectiveness and the potentiality of our method.
1 code implementation • 26 Mar 2021 • Guangwei Gao, Hao Shao, Fei Wu, Meng Yang, Yi Yu
This paper pays close attention to the cross-modality visible-infrared person re-identification (VI Re-ID) task, which aims to match pedestrian samples between visible and infrared modes.
Cross-Modality Person Re-identification
Knowledge Distillation
+1
no code implementations • 25 Mar 2021 • Guangwei Gao, Yi Yu, Jian Yang, Guo-Jun Qi, Meng Yang
(i) To learn more robust and discriminative features, we desire to adaptively fuse the contextual features from different layers.
no code implementations • 24 Mar 2021 • Zhengxue Wang, Guangwei Gao, Juncheng Li, Yi Yu, Huimin Lu
Recently, the single image super-resolution (SISR) approaches with deep and complex convolutional neural network structures have achieved promising performance.
no code implementations • 24 Mar 2021 • Guangwei Gao, Guoan Xu, Yi Yu, Jin Xie, Jian Yang, Dong Yue
In recent years, how to strike a good trade-off between accuracy and inference speed has become the core issue for real-time semantic segmentation applications, which plays a vital role in real-world scenarios such as autonomous driving systems and drones.
no code implementations • 1 Feb 2021 • Anne Gael Manegueu, Alexandra Carpentier, Yi Yu
On top of the switching bandit problem (\textbf{Case a}), we are interested in three concrete examples: (\textbf{b}) the means of the arms are local polynomials, (\textbf{c}) the means of the arms are locally smooth, and (\textbf{d}) the gaps of the arms have a bounded number of inflexion points and where the highest arm mean cannot vary too much in a short range.
no code implementations • 14 Jan 2021 • Yi Yu, Oscar Hernan Madrid Padilla, Daren Wang, Alessandro Rinaldo
The goal is to detect the change point as quickly as possible, if it exists, subject to a constraint on the number or probability of false alarms.
no code implementations • 1 Dec 2020 • Donghuo Zeng, Yi Yu, Keizo Oyama
This work present a music dataset named MusicTM-Dataset, which is utilized in improving the representation learning ability of different types of cross-modal retrieval (CMR).
1 code implementation • 1 Dec 2020 • Daren Wang, Zifeng Zhao, Yi Yu, Rebecca Willett
We derive finite sample theoretical guarantees and show that the excess prediction risk of our estimator is minimax optimal.
Statistics Theory Methodology Statistics Theory
1 code implementation • 25 Nov 2020 • Hemant Yadav, Atul Anshuman Singh, Rachit Mittal, Sunayana Sitaram, Yi Yu, Rajiv Ratn Shah
Training a robust system, e. g., Speech to Text (STT), requires large datasets.
no code implementations • 12 Nov 2020 • Gurunath Reddy Madhumani, Yi Yu, Florian Harscoët, Simon Canales, Suhua Tang
In this paper, we propose a technique to address the most challenging aspect of algorithmic songwriting process, which enables the human community to discover original lyrics, and melodies suitable for the generated lyrics.
no code implementations • 18 Sep 2020 • Yi Yu, Abhishek Srivastava, Rajiv Ratn Shah
Conditional sequence generation aims to instruct the generation procedure by conditioning the model with additional context information, which is a self-supervised learning issue (a form of unsupervised learning with supervision information from data itself).
no code implementations • 18 Sep 2020 • Yi Yu, Tao Yang, Hongyang Chen, Rodrigo C. de Lamare, Yingsong Li
In this paper, we propose and analyze the sparsity-aware sign subband adaptive filtering with individual weighting factors (S-IWF-SSAF) algorithm, and consider its application in acoustic echo cancellation (AEC).
1 code implementation • 4 Aug 2020 • Dikshant Sagar, Jatin Garg, Prarthana Kansal, Sejal Bhalla, Rajiv Ratn Shah, Yi Yu
The rise in the fashion industry and its effect on social influencing have made outfit compatibility a need.
Ranked #1 on
Preference Mapping
on IQOON3000
no code implementations • 29 Jul 2020 • Donghuo Zeng, Yi Yu, Keizo Oyama
In this paper, we propose an unsupervised generative adversarial alignment representation (UGAAR) model to learn deep discriminative representations shared across three major musical modalities: sheet music, lyrics, and audio, where a deep neural network based architecture on three branches is jointly trained.
1 code implementation • 22 May 2020 • Hemant Yadav, Sreyan Ghosh, Yi Yu, Rajiv Ratn Shah
Named entity recognition (NER) from text has been a widely studied problem and usually extracts semantic information from text.
Automatic Speech Recognition
Automatic Speech Recognition (ASR)
+4
1 code implementation • 15 May 2020 • Shagun Uppal, Anish Madan, Sarthak Bhagat, Yi Yu, Rajiv Ratn Shah
In this paper, we try to exploit the different visual cues and concepts in an image to generate questions using a variational autoencoder (VAE) without ground-truth answers.
no code implementations • 8 Apr 2020 • Yifu Sun, xulong Zhang, Yi Yu, Xi Chen, Wei Li
Singing voice detection (SVD), to recognize vocal parts in the song, is an essential task in music information retrieval (MIR).
1 code implementation • 26 Nov 2019 • Osaid Rehman Nasir, Shailesh Kumar Jha, Manraj Singh Grover, Yi Yu, Ajit Kumar, Rajiv Ratn Shah
We then model the highly multi-modal problem of text to face generation as learning the conditional distribution of faces (conditioned on text) in same latent space.
2 code implementations • 15 Aug 2019 • Yi Yu, Abhishek Srivastava, Simon Canales
Melody generation from lyrics has been a challenging research issue in the field of artificial intelligence and music, which enables to learn and discover latent relationship between interesting lyrics and accompanying melody.
no code implementations • 10 Aug 2019 • Haoting Liang, Donghuo Zeng, Yi Yu, Keizo Oyama
Since many online music services emerged in recent years so that effective music recommendation systems are desirable.
2 code implementations • 10 Aug 2019 • Donghuo Zeng, Yi Yu, Keizo Oyama
In particular, two significant contributions are made: i) a better representation by constructing deep triplet neural network with triplet loss for optimal projections can be generated to maximize correlation in the shared subspace.
no code implementations • 10 Aug 2019 • Donghuo Zeng, Yi Yu, Keizo Oyama
ii) We propose an end-to-end deep model for cross-modal audio-visual learning where S-DCCA is trained to learn the semantic correlation between audio and visual modalities.
no code implementations • 10 Aug 2019 • Peipei Wang, Lin Li, Yi Yu, Guandong Xu
To tackle the issue of preference aggregation for group recommendation, we propose a novel attentive aggregation representation learning method based on sociological theory for group recommendation, namely SIAGR (short for "Social Influence-based Attentive Group Recommendation"), which takes attention mechanisms and the popular method (BERT) as the aggregation representation for group profile modeling.
1 code implementation • 12 May 2019 • Junjun Jiang, Yi Yu, Zheng Wang, Suhua Tang, Ruimin Hu, Jiayi Ma
In this paper, we present a simple but effective single image SR method based on ensemble learning, which can produce a better performance than that could be obtained from any of SR methods to be ensembled (or called component super-resolvers).
1 code implementation • 24 Sep 2018 • Sein Minn, Yi Yu, Michel C. Desmarais, Feida Zhu, Jill Jenn Vie
In Intelligent Tutoring System (ITS), tracing the student's knowledge state during learning has been studied for several decades in order to provide more supportive learning instructions.
2 code implementations • 3 Sep 2018 • Junjun Jiang, Yi Yu, Suhua Tang, Jiayi Ma, Akiko Aizawa, Kiyoharu Aizawa
To this end, this study incorporates the contextual information of image patch and proposes a powerful and efficient context-patch based face hallucination approach, namely Thresholding Locality-constrained Representation and Reproducing learning (TLcR-RL).
1 code implementation • 28 Jun 2018 • Junjun Jiang, Yi Yu, Jinhui Hu, Suhua Tang, Jiayi Ma
Most of the current face hallucination methods, whether they are shallow learning-based or deep learning-based, all try to learn a relationship model between Low-Resolution (LR) and High-Resolution (HR) spaces with the help of a training set.
no code implementations • 8 May 2018 • Yi Yu, Suhua Tang, Kiyoharu Aizawa, Akiko Aizawa
Given a photo as input, this model performs (i) exact venue search (find the venue where the photo was taken), and (ii) group venue search (find relevant venues with the same category as that of the photo), by the cross-modal correlation between the input photo and textual description of venues.
no code implementations • 14 Dec 2017 • Francisco Raposo, David Martins de Matos, Ricardo Ribeiro, Suhua Tang, Yi Yu
Modeling of music audio semantics has been previously tackled through learning of mappings from audio data to high-level tags or latent unsupervised spaces.
no code implementations • 4 Dec 2014 • Diego Franco Saldana, Yi Yu, Yang Feng
Stochastic blockmodels and variants thereof are among the most widely used approaches to community detection for social networks and relational data.
no code implementations • 2 Nov 2012 • Yi Yu, Yang Feng
In high-dimensional data analysis, penalized likelihood estimators are shown to provide superior results in both variable selection and parameter estimation.