1 code implementation • 31 Jan 2025 • Fan Wang, Feiyu Jiang, Zifeng Zhao, Yi Yu
We propose a novel Transfer Learning for Dynamic Pricing (TLDP) algorithm that can effectively leverage pre-collected data from a source domain to enhance pricing decisions in the target domain.
2 code implementations • 23 Jan 2025 • Peiyuan Zhang, Junwei Luo, Xue Yang, Yi Yu, Qingyun Li, Yue Zhou, Xiaosong Jia, Xudong Lu, Jingdong Chen, Xiang Li, Junchi Yan, Yansheng Li
Based on the views, a scale augmentation module and an angle acquisition module are constructed.
2 code implementations • 22 Jan 2025 • DeepSeek-AI, Daya Guo, Dejian Yang, Haowei Zhang, Junxiao Song, Ruoyu Zhang, Runxin Xu, Qihao Zhu, Shirong Ma, Peiyi Wang, Xiao Bi, Xiaokang Zhang, Xingkai Yu, Yu Wu, Z. F. Wu, Zhibin Gou, Zhihong Shao, Zhuoshu Li, Ziyi Gao, Aixin Liu, Bing Xue, Bingxuan Wang, Bochao Wu, Bei Feng, Chengda Lu, Chenggang Zhao, Chengqi Deng, Chenyu Zhang, Chong Ruan, Damai Dai, Deli Chen, Dongjie Ji, Erhang Li, Fangyun Lin, Fucong Dai, Fuli Luo, Guangbo Hao, Guanting Chen, Guowei Li, H. Zhang, Han Bao, Hanwei Xu, Haocheng Wang, Honghui Ding, Huajian Xin, Huazuo Gao, Hui Qu, Hui Li, JianZhong Guo, Jiashi Li, Jiawei Wang, Jingchang Chen, Jingyang Yuan, Junjie Qiu, Junlong Li, J. L. Cai, Jiaqi Ni, Jian Liang, Jin Chen, Kai Dong, Kai Hu, Kaige Gao, Kang Guan, Kexin Huang, Kuai Yu, Lean Wang, Lecong Zhang, Liang Zhao, Litong Wang, Liyue Zhang, Lei Xu, Leyi Xia, Mingchuan Zhang, Minghua Zhang, Minghui Tang, Meng Li, Miaojun Wang, Mingming Li, Ning Tian, Panpan Huang, Peng Zhang, Qiancheng Wang, Qinyu Chen, Qiushi Du, Ruiqi Ge, Ruisong Zhang, Ruizhe Pan, Runji Wang, R. J. Chen, R. L. Jin, Ruyi Chen, Shanghao Lu, Shangyan Zhou, Shanhuang Chen, Shiyu Wang, Shuiping Yu, Shunfeng Zhou, Shuting Pan, S. S. Li, Shuang Zhou, Shaoqing Wu, Shengfeng Ye, Tao Yun, Tian Pei, Tianyu Sun, T. Wang, Wangding Zeng, Wanjia Zhao, Wen Liu, Wenfeng Liang, Wenjun Gao, Wenqin Yu, Wentao Zhang, W. L. Xiao, Wei An, Xiaodong Liu, Xiaohan Wang, Xiaokang Chen, Xiaotao Nie, Xin Cheng, Xin Liu, Xin Xie, Xingchao Liu, Xinyu Yang, Xinyuan Li, Xuecheng Su, Xuheng Lin, X. Q. Li, Xiangyue Jin, Xiaojin Shen, Xiaosha Chen, Xiaowen Sun, Xiaoxiang Wang, Xinnan Song, Xinyi Zhou, Xianzu Wang, Xinxia Shan, Y. K. Li, Y. Q. Wang, Y. X. Wei, Yang Zhang, Yao Li, Yao Zhao, Yaofeng Sun, Yaohui Wang, Yi Yu, Yichao Zhang, Yifan Shi, Yiliang Xiong, Ying He, Yishi Piao, Yisong Wang, Yixuan Tan, Yiyang Ma, Yiyuan Liu, Yongqiang Guo, Yuan Ou, Yuduan Wang, Yue Gong, Yuheng Zou, Yujia He, Yunfan Xiong, Yuxiang Luo, Yuxiang You, Yuxuan Liu, Yuyang Zhou, Y. X. Zhu, Yanhong Xu, Yanping Huang, Yaohui Li, Yi Zheng, Yuchen Zhu, Yunxian Ma, Ying Tang, Yukun Zha, Yuting Yan, Z. Z. Ren, Zehui Ren, Zhangli Sha, Zhe Fu, Zhean Xu, Zhenda Xie, Zhengyan Zhang, Zhewen Hao, Zhicheng Ma, Zhigang Yan, Zhiyu Wu, Zihui Gu, Zijia Zhu, Zijun Liu, Zilin Li, Ziwei Xie, Ziyang Song, Zizheng Pan, Zhen Huang, Zhipeng Xu, Zhongyu Zhang, Zhen Zhang
We introduce our first-generation reasoning models, DeepSeek-R1-Zero and DeepSeek-R1.
Ranked #1 on
Mathematical Reasoning
on AIME24
1 code implementation • 16 Jan 2025 • Qingyun Li, Yushi Chen, Xinya Shu, Dong Chen, Xin He, Yi Yu, Xue Yang
In this paper, we present a simple baseline for applying MLMs to aerial detection for the first time, named LMMRotate.
1 code implementation • 27 Dec 2024 • DeepSeek-AI, Aixin Liu, Bei Feng, Bing Xue, Bingxuan Wang, Bochao Wu, Chengda Lu, Chenggang Zhao, Chengqi Deng, Chenyu Zhang, Chong Ruan, Damai Dai, Daya Guo, Dejian Yang, Deli Chen, Dongjie Ji, Erhang Li, Fangyun Lin, Fucong Dai, Fuli Luo, Guangbo Hao, Guanting Chen, Guowei Li, H. Zhang, Han Bao, Hanwei Xu, Haocheng Wang, Haowei Zhang, Honghui Ding, Huajian Xin, Huazuo Gao, Hui Li, Hui Qu, J. L. Cai, Jian Liang, JianZhong Guo, Jiaqi Ni, Jiashi Li, Jiawei Wang, Jin Chen, Jingchang Chen, Jingyang Yuan, Junjie Qiu, Junlong Li, Junxiao Song, Kai Dong, Kai Hu, Kaige Gao, Kang Guan, Kexin Huang, Kuai Yu, Lean Wang, Lecong Zhang, Lei Xu, Leyi Xia, Liang Zhao, Litong Wang, Liyue Zhang, Meng Li, Miaojun Wang, Mingchuan Zhang, Minghua Zhang, Minghui Tang, Mingming Li, Ning Tian, Panpan Huang, Peiyi Wang, Peng Zhang, Qiancheng Wang, Qihao Zhu, Qinyu Chen, Qiushi Du, R. J. Chen, R. L. Jin, Ruiqi Ge, Ruisong Zhang, Ruizhe Pan, Runji Wang, Runxin Xu, Ruoyu Zhang, Ruyi Chen, S. S. Li, Shanghao Lu, Shangyan Zhou, Shanhuang Chen, Shaoqing Wu, Shengfeng Ye, Shirong Ma, Shiyu Wang, Shuang Zhou, Shuiping Yu, Shunfeng Zhou, Shuting Pan, T. Wang, Tao Yun, Tian Pei, Tianyu Sun, W. L. Xiao, Wangding Zeng, Wanjia Zhao, Wei An, Wen Liu, Wenfeng Liang, Wenjun Gao, Wenqin Yu, Wentao Zhang, X. Q. Li, Xiangyue Jin, Xianzu Wang, Xiao Bi, Xiaodong Liu, Xiaohan Wang, Xiaojin Shen, Xiaokang Chen, Xiaokang Zhang, Xiaosha Chen, Xiaotao Nie, Xiaowen Sun, Xiaoxiang Wang, Xin Cheng, Xin Liu, Xin Xie, Xingchao Liu, Xingkai Yu, Xinnan Song, Xinxia Shan, Xinyi Zhou, Xinyu Yang, Xinyuan Li, Xuecheng Su, Xuheng Lin, Y. K. Li, Y. Q. Wang, Y. X. Wei, Y. X. Zhu, Yang Zhang, Yanhong Xu, Yanping Huang, Yao Li, Yao Zhao, Yaofeng Sun, Yaohui Li, Yaohui Wang, Yi Yu, Yi Zheng, Yichao Zhang, Yifan Shi, Yiliang Xiong, Ying He, Ying Tang, Yishi Piao, Yisong Wang, Yixuan Tan, Yiyang Ma, Yiyuan Liu, Yongqiang Guo, Yu Wu, Yuan Ou, Yuchen Zhu, Yuduan Wang, Yue Gong, Yuheng Zou, Yujia He, Yukun Zha, Yunfan Xiong, Yunxian Ma, Yuting Yan, Yuxiang Luo, Yuxiang You, Yuxuan Liu, Yuyang Zhou, Z. F. Wu, Z. Z. Ren, Zehui Ren, Zhangli Sha, Zhe Fu, Zhean Xu, Zhen Huang, Zhen Zhang, Zhenda Xie, Zhengyan Zhang, Zhewen Hao, Zhibin Gou, Zhicheng Ma, Zhigang Yan, Zhihong Shao, Zhipeng Xu, Zhiyu Wu, Zhongyu Zhang, Zhuoshu Li, Zihui Gu, Zijia Zhu, Zijun Liu, Zilin Li, Ziwei Xie, Ziyang Song, Ziyi Gao, Zizheng Pan
We present DeepSeek-V3, a strong Mixture-of-Experts (MoE) language model with 671B total parameters with 37B activated for each token.
1 code implementation • 10 Dec 2024 • Yi Yu, Song Xia, Xun Lin, Wenhan Yang, Shijian Lu, Yap-Peng Tan, Alex Kot
To address these challenges, we shift our focus to another significant threat and present a novel poisoning-based backdoor attack against NR-IQA (BAIQA), allowing the attacker to manipulate the IQA model's output to any desired target value by simply adjusting a scaling coefficient $\alpha$ for the trigger.
no code implementations • 2 Dec 2024 • Yi Yu, YuFei Wang, Wenhan Yang, Lanqing Guo, Shijian Lu, Ling-Yu Duan, Yap-Peng Tan, Alex C. Kot
To improve training efficiency, we propose a dynamic loss function that balances loss terms with fewer hyper-parameters, optimizing attack objectives effectively.
no code implementations • 24 Nov 2024 • Luis Vilaca, Yi Yu, Paula Vinan
Audio-visual correlation learning aims to capture and understand natural phenomena between audio and visual data.
1 code implementation • 4 Nov 2024 • Yan Li, Weiwei Guo, Xue Yang, Ning Liao, Shaofeng Zhang, Yi Yu, Wenxian Yu, Junchi Yan
In this paper, we put forth a novel formulation of the aerial object detection problem, namely open-vocabulary aerial object detection (OVAD), which can detect objects beyond training categories without costly collecting new labeled data.
1 code implementation • 26 Oct 2024 • Song Xia, Wenhan Yang, Yi Yu, Xun Lin, Henghui Ding, Lingyu Duan, Xudong Jiang
To enhance the effectiveness of the adversarial attack towards models fine-tuned on unknown datasets, we propose a universal meta-initialization (UMI) algorithm to extract the intrinsic vulnerability inherent in the foundation model, which is then utilized as the prior knowledge to guide the generation of adversarial perturbations.
1 code implementation • 10 Oct 2024 • Botao Ren, Xue Yang, Yi Yu, Junwei Luo, Zhidong Deng
Single point supervised oriented object detection has gained attention and made initial progress within the community.
no code implementations • 3 Oct 2024 • Yangyang Qiu, Guoan Xu, Guangwei Gao, Zhenhua Guo, Yi Yu, Chia-Wen Lin
Recently, the integration of the local modeling capabilities of Convolutional Neural Networks (CNNs) with the global dependency strengths of Transformers has created a sensation in the semantic segmentation community.
no code implementations • 29 Aug 2024 • Chong Wang, Mengyao Li, Junjun He, Zhongruo Wang, Erfan Darzi, Zan Chen, Jin Ye, Tianbin Li, Yanzhou Su, Jing Ke, Kaili Qu, Shuxin Li, Yi Yu, Pietro Liò, Tianyun Wang, Yu Guang Wang, Yiqing Shen
To address these challenges, we also identify future research directions of LLM in biomedicine including federated learning methods to preserve data privacy and integrating explainable AI methodologies to enhance the transparency of LLMs.
no code implementations • 16 Aug 2024 • Qichen Zheng, Yi Yu, Siyuan Yang, Jun Liu, Kwok-Yan Lam, Alex Kot
To investigate the vulnerabilities of SAR in the physical world, we introduce the Physical Skeleton Backdoor Attacks (PSBA), the first exploration of physical backdoor attacks against SAR.
no code implementations • 15 Aug 2024 • Yi Yu, Qichen Zheng, Siyuan Yang, Wenhan Yang, Jun Liu, Shijian Lu, Yap-Peng Tan, Kwok-Yan Lam, Alex Kot
We verify that when training a classifier on a mixed dataset containing both UEs and clean data, the model tends to quickly adapt to the UEs compared to the clean data.
no code implementations • 15 Jul 2024 • Xuhong Wang, Haoyu Jiang, Yi Yu, Jingru Yu, Yilun Lin, Ping Yi, Yingchun Wang, Yu Qiao, Li Li, Fei-Yue Wang
Large Language Models (LLMs) are increasingly integrated into diverse industries, posing substantial security risks due to unauthorized replication and misuse.
1 code implementation • 11 Jul 2024 • Laniqng Guo, Chong Wang, YuFei Wang, Yi Yu, Siyu Huang, Wenhan Yang, Alex C. Kot, Bihan Wen
In this paper, we are the first to provide a comprehensive survey to cover various aspects ranging from technical details to applications.
no code implementations • 25 Jun 2024 • Ruohan Meng, Chenyu Yi, Yi Yu, Siyuan Yang, Bingquan Shen, Alex C. Kot
To further boost the robustness of unlearnable examples, we design a Semantic Images Generation module that produces hidden semantic images.
3 code implementations • 13 Jun 2024 • Yansheng Li, LinLin Wang, Tingzhu Wang, Xue Yang, Junwei Luo, Qi Wang, Youming Deng, Wenbin Wang, Xian Sun, Haifeng Li, Bo Dang, Yongjun Zhang, Yi Yu, Junchi Yan
This paper constructs a large-scale dataset for SGG in large-size VHR SAI with image sizes ranging from 512 x 768 to 27, 860 x 31, 096 pixels, named STAR (Scene graph generaTion in lArge-size satellite imageRy), encompassing over 210K objects and over 400K triplets.
no code implementations • 4 Jun 2024 • Zifeng Zhao, Feiyu Jiang, Yi Yu
In particular, we propose a stochastic gradient descent based ETC algorithm that achieves an optimal regret upper bound of order $d\sqrt{T}/\epsilon$, up to a logarithmic factor, where $\epsilon>0$ is the privacy parameter.
1 code implementation • 2 May 2024 • Yi Yu, YuFei Wang, Song Xia, Wenhan Yang, Shijian Lu, Yap-Peng Tan, Alex C. Kot
Based on this network, a two-stage purification approach is naturally developed.
no code implementations • 21 Apr 2024 • Donghuo Zeng, Yanan Wang, Kazushi Ikeda, Yi Yu
However, the model training fails to fully explore the space due to the scarcity of training data points, resulting in an incomplete representation of the overall positive and negative distributions.
1 code implementation • 15 Apr 2024 • Song Xia, Yi Yu, Xudong Jiang, Henghui Ding
The proposed Dual Randomized Smoothing (DRS) down-samples the input image into two sub-images and smooths the two sub-images in lower dimensions.
1 code implementation • 12 Apr 2024 • Chenqi Kong, Anwei Luo, Peijun Bao, Yi Yu, Haoliang Li, Zengwei Zheng, Shiqi Wang, Alex C. Kot
Deepfakes have recently raised significant trust issues and security concerns among the public.
no code implementations • 8 Apr 2024 • Yi Yu, Brendan P. Malone, Luigi J. Renzullo
The cross-cluster validation underscored the capability of the upscaling approach to map the spatial variability of SM within areas that were not covered by in-situ sites, with correlation performance ranging between 0. 6 and 0. 8.
no code implementations • 21 Mar 2024 • Xun Lin, Yi Yu, Song Xia, Jue Jiang, Haoran Wang, Zitong Yu, Yizhong Liu, Ying Fu, Shuai Wang, Wenzhong Tang, Alex Kot
This is particularly true for medical image segmentation (MIS) datasets, where the processes of collection and fine-grained annotation are time-intensive and laborious.
no code implementations • 17 Mar 2024 • Mengchu Li, Ye Tian, Yang Feng, Yi Yu
By investigating the minimax rates and identifying the costs of privacy for these problems, we show that federated differential privacy is an intermediate privacy model between the well-established local and central models of differential privacy.
1 code implementation • CVPR 2024 • Chong Wang, Lanqing Guo, YuFei Wang, Hao Cheng, Yi Yu, Bihan Wen
Starting from decomposing the original maximum-a-posteriori problem of accelerated MRI, we present a rigorous derivation of the proposed PDAC framework, which could be further unfolded into an end-to-end trainable network.
no code implementations • 15 Mar 2024 • Chong Wang, Yi Yu, Lanqing Guo, Bihan Wen
This is primarily due to the unique characteristic of spatially varying illumination within shadow images.
no code implementations • 14 Mar 2024 • Wenjie Yin, Xuejiao Zhao, Yi Yu, Hang Yin, Danica Kragic, Mårten Björkman
First, we propose LM2D, a novel probabilistic architecture that incorporates a multimodal diffusion model with consistency distillation, designed to create dance conditioned on both music and lyrics in one diffusion generation step.
1 code implementation • 13 Dec 2023 • Xin You, Ming Ding, Minghui Zhang, Hanxiao Zhang, Yi Yu, Jie Yang, Yun Gu
Precise boundary segmentation of volumetric images is a critical task for image-guided diagnosis and computer-assisted intervention, especially for boundary confusion in clinical practice.
no code implementations • 12 Dec 2023 • Wenjie Yin, Yi Yu, Hang Yin, Danica Kragic, Mårten Björkman
Current training of motion style transfer systems relies on consistency losses across style domains to preserve contents, hindering its scalable application to a large number of domains and private data.
1 code implementation • CVPR 2024 • Junwei Luo, Xue Yang, Yi Yu, Qingyun Li, Junchi Yan, Yansheng Li
Single point-supervised object detection is gaining attention due to its cost-effectiveness.
2 code implementations • CVPR 2024 • Yi Yu, Xue Yang, Qingyun Li, Feipeng Da, Jifeng Dai, Yu Qiao, Junchi Yan
To our best knowledge, Point2RBox is the first end-to-end solution for point-supervised OOD.
no code implementations • 2 Oct 2023 • Zhe Zhang, Karol Lasocki, Yi Yu, Atsuhiro Takasu
The generation of lyrics tightly connected to accompanying melodies involves establishing a mapping between musical notes and syllables of lyrics.
1 code implementation • MICCAI 2023 • Xin You, Ming Ding, Minghui Zhang, Yangqian Wu, Yi Yu, Yun Gu, Jie Yang
In this paper, we have modeled relative relations between the LA and LAA via deep segmentation networks for the first time, and introduce a new LA & LAA CT dataset.
no code implementations • 1 Oct 2023 • Julien Lalanne, Raphael Bournet, Yi Yu
Live commenting on video, a popular feature of live streaming platforms, enables viewers to engage with the content and share their comments, reactions, opinions, or questions with the streamer or other viewers while watching the video or live stream.
1 code implementation • 30 Sep 2023 • Wenjie Yin, Qingyuan Yao, Yi Yu, Hang Yin, Danica Kragic, Mårten Björkman
To complement it, we introduce JustLMD, a new multimodal dataset of 3D dance motion with music and lyrics.
no code implementations • 25 Jul 2023 • Yi Yu, Wenlian Lu, BoYu Chen
We propose theoretical analyses of a modified natural gradient descent method in the neural network function space based on the eigendecompositions of neural tangent kernel and Fisher information matrix.
no code implementations • 25 Jul 2023 • Shengyue Yao, Jingru Yu, Yi Yu, Jia Xu, Xingyuan Dai, Honghai Li, Fei-Yue Wang, Yilun Lin
Furthermore, an operation algorithm is proposed regarding the issue of structural rigidity in DAO.
1 code implementation • ICCV 2023 • YuFei Wang, Yi Yu, Wenhan Yang, Lanqing Guo, Lap-Pui Chau, Alex C. Kot, Bihan Wen
Different from a vanilla diffusion model that has to perform Gaussian denoising, with the injected physics-based exposure model, our restoration process can directly start from a noisy image instead of pure noise.
Ranked #1 on
Image Denoising
on Image Denoising on SID x300
1 code implementation • 21 Jun 2023 • YuFei Wang, Yi Yu, Wenhan Yang, Lanqing Guo, Lap-Pui Chau, Alex C. Kot, Bihan Wen
Besides, we propose a novel design of the context model, which can better predict the order masks of encoding/decoding based on both the sRGB image and the masks of already processed features.
no code implementations • 5 Jun 2023 • Zhe Zhang, Yi Yu, Atsuhiro Takasu
Lyrics-to-melody generation is an interesting and challenging topic in AI music research field.
no code implementations • 27 Apr 2023 • Qingpeng Zhu, Wenxiu Sun, Yuekun Dai, Chongyi Li, Shangchen Zhou, Ruicheng Feng, Qianhui Sun, Chen Change Loy, Jinwei Gu, Yi Yu, Yangke Huang, Kang Zhang, Meiya Chen, Yu Wang, Yongchao Li, Hao Jiang, Amrit Kumar Muduli, Vikash Kumar, Kunal Swami, Pankaj Kumar Bajpai, Yunchao Ma, Jiajun Xiao, Zhi Ling
To evaluate the performance of different depth completion methods, we organized an RGB+sparse ToF depth completion competition.
no code implementations • 29 Mar 2023 • Lu Lu, Yi Yu, Zongsheng Zheng, Guangya Zhu, Xiaomin Yang
Two Andrew's sine estimator (ASE)-based robust adaptive filtering algorithms are proposed in this brief.
1 code implementation • 23 Mar 2023 • Dichucheng Li, Mingjin Che, Wenwu Meng, Yulun Wu, Yi Yu, Fan Xia, Wei Li
Instrument playing technique (IPT) is a key element of musical presentation.
Instrument Playing Technique Detection
Multi-Label Classification
+1
1 code implementation • 21 Mar 2023 • Sahil Goyal, Shagun Uppal, Sarthak Bhagat, Yi Yu, Yifang Yin, Rajiv Ratn Shah
To mitigate this, we build a talking face generation framework conditioned on a categorical emotion to generate videos with appropriate expressions, making them more realistic and convincing.
Ranked #1 on
Talking Face Generation
on CREMA-D
no code implementations • 4 Mar 2023 • Qinghua He, Wanyu Li, Yaping Shi, Yi Yu, Yi Zhang, Wenqian Geng, Zhiyuan Sun, Ruikang K Wang
This study highlights the potential of SpeCamX to improve the prediction of bio-chromophores, and its ability to transform an ordinary smartphone into a powerful medical tool without the need for additional investments or expertise.
no code implementations • CVPR 2023 • Yi Yu, YuFei Wang, Wenhan Yang, Shijian Lu, Yap-Peng Tan, Alex C. Kot
Extensive experiments show that with our trained trigger injection models and simple modification of encoder parameters (of the compression model), the proposed attack can successfully inject several backdoors with corresponding triggers in a single image compression model.
1 code implementation • CVPR 2023 • YuFei Wang, Yi Yu, Wenhan Yang, Lanqing Guo, Lap-Pui Chau, Alex Kot, Bihan Wen
While raw images exhibit advantages over sRGB images (e. g., linearity and fine-grained quantization level), they are not widely used by common users due to the large storage requirements.
no code implementations • 23 Jan 2023 • Gurunath Reddy M, Zhe Zhang, Yi Yu, Florian Harscoet, Simon Canales, Suhua Tang
We propose a deep attention-based alignment network, which aims to automatically predict lyrics and melody with given incomplete lyrics as input in a way similar to the music creation of humans.
no code implementations • ACM Multimedia Asia 2022 • Sahil Goyal, Shagun Uppal, Sarthak Bhagat, Dhroov Goel, Sakshat Mali, Yi Yu, Yifang Yin, Rajiv Ratn Shah
Lip synchronization and talking face generation have gained a specific interest from the research community with the advent and need of digital communication in different fields.
1 code implementation • CVPR 2023 • Yi Yu, Feipeng Da
With the vigorous development of computer vision, oriented object detection has gradually been featured.
7 code implementations • 5 Oct 2022 • Silvio Giancola, Anthony Cioppa, Adrien Deliège, Floriane Magera, Vladimir Somers, Le Kang, Xin Zhou, Olivier Barnich, Christophe De Vleeschouwer, Alexandre Alahi, Bernard Ghanem, Marc Van Droogenbroeck, Abdulrahman Darwish, Adrien Maglo, Albert Clapés, Andreas Luyts, Andrei Boiarov, Artur Xarles, Astrid Orcesi, Avijit Shah, Baoyu Fan, Bharath Comandur, Chen Chen, Chen Zhang, Chen Zhao, Chengzhi Lin, Cheuk-Yiu Chan, Chun Chuen Hui, Dengjie Li, Fan Yang, Fan Liang, Fang Da, Feng Yan, Fufu Yu, Guanshuo Wang, H. Anthony Chan, He Zhu, Hongwei Kan, Jiaming Chu, Jianming Hu, Jianyang Gu, Jin Chen, João V. B. Soares, Jonas Theiner, Jorge De Corte, José Henrique Brito, Jun Zhang, Junjie Li, Junwei Liang, Leqi Shen, Lin Ma, Lingchi Chen, Miguel Santos Marques, Mike Azatov, Nikita Kasatkin, Ning Wang, Qiong Jia, Quoc Cuong Pham, Ralph Ewerth, Ran Song, RenGang Li, Rikke Gade, Ruben Debien, Runze Zhang, Sangrok Lee, Sergio Escalera, Shan Jiang, Shigeyuki Odashima, Shimin Chen, Shoichi Masui, Shouhong Ding, Sin-wai Chan, Siyu Chen, Tallal El-Shabrawy, Tao He, Thomas B. Moeslund, Wan-Chi Siu, Wei zhang, Wei Li, Xiangwei Wang, Xiao Tan, Xiaochuan Li, Xiaolin Wei, Xiaoqing Ye, Xing Liu, Xinying Wang, Yandong Guo, YaQian Zhao, Yi Yu, YingYing Li, Yue He, Yujie Zhong, Zhenhua Guo, Zhiheng Li
The SoccerNet 2022 challenges were the second annual video understanding challenges organized by the SoccerNet team.
no code implementations • 19 Sep 2022 • Dichucheng Li, Yulun Wu, Qinyu Li, Jiahao Zhao, Yi Yu, Fan Xia, Wei Li
Because each Guzheng playing technique is applied to a note, a dedicated onset detector is trained to divide an audio into several notes and its predictions are fused with frame-wise IPT predictions.
no code implementations • 14 Aug 2022 • YaQin Li, Lingli Li, Yongjin Xu, Yi Yu
In the generative model, one of the reward components, a binding affinity predictor, is based on 1D protein sequence and molecular SMILES.
no code implementations • 4 Aug 2022 • Yi Yu, Hongsen He, Rodrigo C. de Lamare, Badong Chen
In this paper, we propose a general robust subband adaptive filtering (GR-SAF) scheme against impulsive noise by minimizing the mean square deviation under the random-walk model with individual weight uncertainty.
no code implementations • 30 Jun 2022 • Wei Duan, Zhe Zhang, Yi Yu, Keizo Oyama
Generating melody from lyrics is an interesting yet challenging task in the area of artificial intelligence and music.
no code implementations • 25 Jun 2022 • Tao Yu, Rodrigo C. de Lamare, Yi Yu
This paper studies distributed diffusion adaptation over clustered multi-task networks in the presence of impulsive interferences and Byzantine attacks.
1 code implementation • 16 May 2022 • Yi Yu, Karl Borjesson
Transformer models have been developed in molecular science with excellent performance in applications including quantitative structure-activity relationship (QSAR) and virtual screening (VS).
no code implementations • 15 May 2022 • Yi Yu, Zongxin Huang, Hongsen He, Yuriy Zakharov, Rodrigo C. de Lamare
This paper proposes a unified sparsity-aware robust normalized subband adaptive filtering (SA-RNSAF) algorithm for identification of sparse systems under impulsive noise.
1 code implementation • 2 May 2022 • Weixing Wei, Peilin Li, Yi Yu, Wei Li
Sounds, especially music, contain various harmonic components scattered in the frequency dimension.
1 code implementation • 28 Apr 2022 • Guangwei Gao, Zhengxue Wang, Juncheng Li, Wenjie Li, Yi Yu, Tieyong Zeng
Single-image super-resolution (SISR) has achieved significant breakthroughs with the development of deep learning.
1 code implementation • CVPR 2022 • Yi Yu, Wenhan Yang, Yap-Peng Tan, Alex C. Kot
Finally, we examine various types of adversarial attacks that are specific to deraining problems and their effects on both human and machine vision tasks, including 1) rain region attacks, adding perturbations only in the rain regions to make the perturbations in the attacked rain images less visible; 2) object-sensitive attacks, adding perturbations only in regions near the given objects.
no code implementations • 19 Mar 2022 • Lu Lu, Yi Yu, Rodrigo C. de Lamare, Xiaomin Yang
We propose a novel M-estimate conjugate gradient (CG) algorithm, termed Tukey's biweight M-estimate CG (TbMCG), for system identification in impulsive noise environments.
no code implementations • 28 Feb 2022 • Luís Vilaça, Yi Yu, Paula Viana
Audio-visual correlation learning aims to capture essential correspondences and understand natural phenomena between audio and video.
1 code implementation • 13 Feb 2022 • Qiqi He, Xiaoheng Sun, Yi Yu, Wei Li
Chorus detection is a challenging problem in musical signal processing as the chorus often repeats more than once in popular songs, usually with rich instruments and complex rhythm forms.
1 code implementation • 16 Dec 2021 • Guangwei Gao, Wenjie Li, Juncheng Li, Fei Wu, Huimin Lu, Yi Yu
Convolutional neural networks based single-image super-resolution (SISR) has made great progress in recent years.
no code implementations • 5 Dec 2021 • Jiwei Zhang, Yi Yu, Suhua Tang, Jianming Wu, Wei Li
On the one hand, audio encoder and visual encoder separately encode audio data and visual data into two different latent spaces.
no code implementations • 19 Oct 2021 • Lu Lu, Kai-Li Yin, Rodrigo C. de Lamare, Zongsheng Zheng, Yi Yu, Xiaomin Yang, Badong Chen
Most of the literature focuses on the development of the linear active noise control (ANC) techniques.
no code implementations • 1 Oct 2021 • Lu Lu, Kai-Li Yin, Rodrigo C. de Lamare, Zongsheng Zheng, Yi Yu, Xiaomin Yang, Badong Chen
Active noise control (ANC) is an effective way for reducing the noise level in electroacoustic or electromechanical systems.
no code implementations • 7 Sep 2021 • YaQin Li, Yongjin Xu, Yi Yu
Our strategy takes advantages of both convolutional and recurrent neural networks for feature extraction, as well as the data augmentation method.
1 code implementation • 2 Sep 2021 • Guangwei Gao, Guoan Xu, Juncheng Li, Yi Yu, Huimin Lu, Jian Yang
Specifically, FBSNet employs a symmetrical encoder-decoder structure with two branches, semantic information branch and spatial detail branch.
no code implementations • 14 Aug 2021 • Gang Guo, Yi Yu, Rodrigo C. de Lamare, Zongsheng Zheng, Lu Lu, Qiangming Cai
In addition, an adaptive approach for the choice of the thresholding parameter in the proximal step is also proposed based on the minimization of the mean square deviation.
1 code implementation • 6 Aug 2021 • Xuejiao Tang, Wenbin Zhang, Yi Yu, Kea Turner, Tyler Derr, Mengyu Wang, Eirini Ntoutsi
While image understanding on recognition-level has achieved remarkable advancements, reliable visual scene understanding requires comprehensive image understanding on recognition-level but also cognition-level, which calls for exploiting the multi-source information as well as learning different levels of understanding and extensive commonsense knowledge.
no code implementations • ACL 2021 • Yi Yu, Adam Jatowt, Antoine Doucet, Kazunari Sugiyama, Masatoshi Yoshikawa
In this paper, we address a novel task, Multiple TimeLine Summarization (MTLS), which extends the flexibility and versatility of Time-Line Summarization (TLS).
no code implementations • 30 Jul 2021 • Xiaotian Yu, Hanling Yi, Yi Yu, Ling Xing, Shiliang Zhang, Xiaoyu Wang
There has been a recent surge of research interest in attacking the problem of social relation inference based on images.
1 code implementation • Conference 2021 • Xingcai Wu, Yucheng Xie, Jiaqi Zeng, Zhenguo Yang, Yi Yu, Qing Li, and Wenyin Liu
In this paper, we propose an adversarial learning framework with mask reconstruction (ALMR) for image inpainting with textual guidance, which consists of a two-stage generator and dual discriminators.
1 code implementation • NeurIPS 2021 • Oscar Hernan Madrid Padilla, Yi Yu, Alessandro Rinaldo
We study piece-wise constant signals corrupted by additive Gaussian noise over a $d$-dimensional lattice.
no code implementations • 31 Mar 2021 • Yi Yu, Feipeng Da, Ziyu Zhang
Without fine-tuning on the test set, the Rank-1 Recognition Rate (RR1) is achieved as follows: 98. 85% on FRGC v2. 0 dataset and 99. 33% on Bosphorus dataset, which proves the effectiveness and the potentiality of our method.
1 code implementation • 26 Mar 2021 • Guangwei Gao, Hao Shao, Fei Wu, Meng Yang, Yi Yu
This paper pays close attention to the cross-modality visible-infrared person re-identification (VI Re-ID) task, which aims to match pedestrian samples between visible and infrared modes.
Cross-Modality Person Re-identification
Knowledge Distillation
+2
no code implementations • 25 Mar 2021 • Guangwei Gao, Yi Yu, Jian Yang, Guo-Jun Qi, Meng Yang
(i) To learn more robust and discriminative features, we desire to adaptively fuse the contextual features from different layers.
no code implementations • 24 Mar 2021 • Zhengxue Wang, Guangwei Gao, Juncheng Li, Yi Yu, Huimin Lu
Recently, the single image super-resolution (SISR) approaches with deep and complex convolutional neural network structures have achieved promising performance.
no code implementations • 24 Mar 2021 • Guangwei Gao, Guoan Xu, Yi Yu, Jin Xie, Jian Yang, Dong Yue
In recent years, how to strike a good trade-off between accuracy and inference speed has become the core issue for real-time semantic segmentation applications, which plays a vital role in real-world scenarios such as autonomous driving systems and drones.
no code implementations • 1 Feb 2021 • Anne Gael Manegueu, Alexandra Carpentier, Yi Yu
On top of the switching bandit problem (\textbf{Case a}), we are interested in three concrete examples: (\textbf{b}) the means of the arms are local polynomials, (\textbf{c}) the means of the arms are locally smooth, and (\textbf{d}) the gaps of the arms have a bounded number of inflexion points and where the highest arm mean cannot vary too much in a short range.
no code implementations • 14 Jan 2021 • Yi Yu, Oscar Hernan Madrid Padilla, Daren Wang, Alessandro Rinaldo
The goal is to detect the change point as quickly as possible, if it exists, subject to a constraint on the number or probability of false alarms.
no code implementations • 1 Dec 2020 • Donghuo Zeng, Yi Yu, Keizo Oyama
This work present a music dataset named MusicTM-Dataset, which is utilized in improving the representation learning ability of different types of cross-modal retrieval (CMR).
1 code implementation • 1 Dec 2020 • Daren Wang, Zifeng Zhao, Yi Yu, Rebecca Willett
We derive finite sample theoretical guarantees and show that the excess prediction risk of our estimator is minimax optimal.
Statistics Theory Methodology Statistics Theory
1 code implementation • 25 Nov 2020 • Hemant Yadav, Atul Anshuman Singh, Rachit Mittal, Sunayana Sitaram, Yi Yu, Rajiv Ratn Shah
Training a robust system, e. g., Speech to Text (STT), requires large datasets.
no code implementations • 12 Nov 2020 • Gurunath Reddy Madhumani, Yi Yu, Florian Harscoët, Simon Canales, Suhua Tang
In this paper, we propose a technique to address the most challenging aspect of algorithmic songwriting process, which enables the human community to discover original lyrics, and melodies suitable for the generated lyrics.
no code implementations • 18 Sep 2020 • Yi Yu, Abhishek Srivastava, Rajiv Ratn Shah
Conditional sequence generation aims to instruct the generation procedure by conditioning the model with additional context information, which is a self-supervised learning issue (a form of unsupervised learning with supervision information from data itself).
no code implementations • 18 Sep 2020 • Yi Yu, Tao Yang, Hongyang Chen, Rodrigo C. de Lamare, Yingsong Li
In this paper, we propose and analyze the sparsity-aware sign subband adaptive filtering with individual weighting factors (S-IWF-SSAF) algorithm, and consider its application in acoustic echo cancellation (AEC).
1 code implementation • 4 Aug 2020 • Dikshant Sagar, Jatin Garg, Prarthana Kansal, Sejal Bhalla, Rajiv Ratn Shah, Yi Yu
The rise in the fashion industry and its effect on social influencing have made outfit compatibility a need.
Ranked #1 on
Preference Mapping
on IQOON3000
no code implementations • 29 Jul 2020 • Donghuo Zeng, Yi Yu, Keizo Oyama
In this paper, we propose an unsupervised generative adversarial alignment representation (UGAAR) model to learn deep discriminative representations shared across three major musical modalities: sheet music, lyrics, and audio, where a deep neural network based architecture on three branches is jointly trained.
1 code implementation • 22 May 2020 • Hemant Yadav, Sreyan Ghosh, Yi Yu, Rajiv Ratn Shah
Named entity recognition (NER) from text has been a widely studied problem and usually extracts semantic information from text.
Automatic Speech Recognition
Automatic Speech Recognition (ASR)
+4
1 code implementation • 15 May 2020 • Shagun Uppal, Anish Madan, Sarthak Bhagat, Yi Yu, Rajiv Ratn Shah
In this paper, we try to exploit the different visual cues and concepts in an image to generate questions using a variational autoencoder (VAE) without ground-truth answers.
no code implementations • 8 Apr 2020 • Yifu Sun, xulong Zhang, Yi Yu, Xi Chen, Wei Li
Singing voice detection (SVD), to recognize vocal parts in the song, is an essential task in music information retrieval (MIR).
1 code implementation • 26 Nov 2019 • Osaid Rehman Nasir, Shailesh Kumar Jha, Manraj Singh Grover, Yi Yu, Ajit Kumar, Rajiv Ratn Shah
We then model the highly multi-modal problem of text to face generation as learning the conditional distribution of faces (conditioned on text) in same latent space.
2 code implementations • 15 Aug 2019 • Yi Yu, Abhishek Srivastava, Simon Canales
Melody generation from lyrics has been a challenging research issue in the field of artificial intelligence and music, which enables to learn and discover latent relationship between interesting lyrics and accompanying melody.
no code implementations • 10 Aug 2019 • Peipei Wang, Lin Li, Yi Yu, Guandong Xu
To tackle the issue of preference aggregation for group recommendation, we propose a novel attentive aggregation representation learning method based on sociological theory for group recommendation, namely SIAGR (short for "Social Influence-based Attentive Group Recommendation"), which takes attention mechanisms and the popular method (BERT) as the aggregation representation for group profile modeling.
no code implementations • 10 Aug 2019 • Donghuo Zeng, Yi Yu, Keizo Oyama
ii) We propose an end-to-end deep model for cross-modal audio-visual learning where S-DCCA is trained to learn the semantic correlation between audio and visual modalities.
2 code implementations • 10 Aug 2019 • Donghuo Zeng, Yi Yu, Keizo Oyama
In particular, two significant contributions are made: i) a better representation by constructing deep triplet neural network with triplet loss for optimal projections can be generated to maximize correlation in the shared subspace.
no code implementations • 10 Aug 2019 • Haoting Liang, Donghuo Zeng, Yi Yu, Keizo Oyama
Since many online music services emerged in recent years so that effective music recommendation systems are desirable.
1 code implementation • 12 May 2019 • Junjun Jiang, Yi Yu, Zheng Wang, Suhua Tang, Ruimin Hu, Jiayi Ma
In this paper, we present a simple but effective single image SR method based on ensemble learning, which can produce a better performance than that could be obtained from any of SR methods to be ensembled (or called component super-resolvers).
1 code implementation • 24 Sep 2018 • Sein Minn, Yi Yu, Michel C. Desmarais, Feida Zhu, Jill Jenn Vie
In Intelligent Tutoring System (ITS), tracing the student's knowledge state during learning has been studied for several decades in order to provide more supportive learning instructions.
2 code implementations • 3 Sep 2018 • Junjun Jiang, Yi Yu, Suhua Tang, Jiayi Ma, Akiko Aizawa, Kiyoharu Aizawa
To this end, this study incorporates the contextual information of image patch and proposes a powerful and efficient context-patch based face hallucination approach, namely Thresholding Locality-constrained Representation and Reproducing learning (TLcR-RL).
1 code implementation • 28 Jun 2018 • Junjun Jiang, Yi Yu, Jinhui Hu, Suhua Tang, Jiayi Ma
Most of the current face hallucination methods, whether they are shallow learning-based or deep learning-based, all try to learn a relationship model between Low-Resolution (LR) and High-Resolution (HR) spaces with the help of a training set.
no code implementations • 8 May 2018 • Yi Yu, Suhua Tang, Kiyoharu Aizawa, Akiko Aizawa
Given a photo as input, this model performs (i) exact venue search (find the venue where the photo was taken), and (ii) group venue search (find relevant venues with the same category as that of the photo), by the cross-modal correlation between the input photo and textual description of venues.
no code implementations • 14 Dec 2017 • Francisco Raposo, David Martins de Matos, Ricardo Ribeiro, Suhua Tang, Yi Yu
Modeling of music audio semantics has been previously tackled through learning of mappings from audio data to high-level tags or latent unsupervised spaces.
no code implementations • 4 Dec 2014 • Diego Franco Saldana, Yi Yu, Yang Feng
Stochastic blockmodels and variants thereof are among the most widely used approaches to community detection for social networks and relational data.
no code implementations • 2 Nov 2012 • Yi Yu, Yang Feng
In high-dimensional data analysis, penalized likelihood estimators are shown to provide superior results in both variable selection and parameter estimation.