no code implementations • ECCV 2020 • Xuchong Qiu, Yang Xiao, Chaohui Wang, Renaud Marlet
Inference & Application","We formalize concepts around geometric occlusion in 2D images (i. e., ignoring semantics), and propose a novel unified formulation of both occlusion boundaries and occlusion orientations via a pixel-pair occlusion relation.
no code implementations • 30 Mar 2025 • Xingyu Lyu, Ning Wang, Yang Xiao, Shixiong Li, Tao Li, Danjue Chen, Yimin Chen
FLBuff is inspired by our insight that non-iids can be modeled as omni-directional expansion in representation space while backdoor attacks as uni-directional.
no code implementations • 30 Mar 2025 • Xingyu Lyu, Ning Wang, Yang Xiao, Shixiong Li, Tao Li, Danjue Chen, Yimin Chen
Our comprehensive evaluations show that GeminiGuard consistently outperforms SOTA defenses under various settings.
no code implementations • 9 Mar 2025 • Yang Xiao, Wang Lu, Jie Ji, Ruimeng Ye, Gen Li, Xiaolong Ma, Bo Hui
We believe our approach paves the way for a more precise understanding of brain signals in the future.
no code implementations • 8 Mar 2025 • Kai Yang, Zijian Bai, Yang Xiao, Xinyu Li, Xiaohan Shi
3D reconstruction garners increasing attention alongside the advancement of high-level image applications, where dense stereo matching (DSM) serves as a pivotal technique.
no code implementations • 20 Feb 2025 • Weipeng Huang, Qin Li, Yang Xiao, Cheng Qiao, Tie Cai, Junwei Liao, Neil J. Hurley, Guangyuan Piao
Our model posits that label noise arises from a stochastic shift in the latent variable, providing a more robust and beneficial means for noisy learning.
3 code implementations • 5 Feb 2025 • Yixin Ye, Zhen Huang, Yang Xiao, Ethan Chern, Shijie Xia, PengFei Liu
While conventional wisdom suggests that sophisticated reasoning tasks demand extensive training data (>100, 000 examples), we demonstrate that complex mathematical reasoning abilities can be effectively elicited with surprisingly few examples.
no code implementations • 25 Jan 2025 • Bohan Liu, Yang Xiao, Ruimeng Ye, Zinan Ling, Xiaolong Ma, Bo Hui
In this paper, we experimentally demonstrate that, while directly applying DBA to decentralized FL, the attack success rate depends on the distribution of attackers in the network architecture.
1 code implementation • 15 Nov 2024 • Yang Xiao, Rohan Kumar Das
Transformers and their variants have achieved great success in speech processing.
Ranked #1 on
Audio Deepfake Detection
on ASVspoof 2021
1 code implementation • 2 Nov 2024 • Han Yin, Yang Xiao, Jisheng Bai, Rohan Kumar Das
Sound Event Detection (SED) is challenging in noisy environments where overlapping sounds obscure target events.
Ranked #1 on
Sound Event Detection
on WildDESED
(using extra training data)
1 code implementation • 16 Oct 2024 • Ruimeng Ye, Yang Xiao, Bo Hui
We remark that existing works investigate the phenomenon of weak-to-strong generation in analogous setup (i. e., binary classification), rather than practical alignment-relevant tasks (e. g., safety).
1 code implementation • 20 Sep 2024 • Han Yin, Jisheng Bai, Yang Xiao, Hui Wang, Siqi Zheng, Yafeng Chen, Rohan Kumar Das, Chong Deng, Jianfeng Chen
To address this issue, we propose the text-queried SED (TQ-SED) framework.
no code implementations • 12 Sep 2024 • Tianyi Peng, Yang Xiao
Spoken keyword spotting (KWS) is crucial for identifying keywords within audio inputs and is widely used in applications like Apple Siri and Google Home, particularly on edge devices.
no code implementations • 8 Sep 2024 • Yang Xiao, Rohan Kumar Das
We consider the Mamba-based model to analyze spatial features from speech signals by fusing both time and frequency features, and we develop an SSL system called TF-Mamba.
no code implementations • 15 Jul 2024 • Yubin Hu, Xiaoyang Guo, Yang Xiao, Jingwei Huang, Yong-Jin Liu
Although it achieves fast training speed, there is still a lot of room for improvement in its rendering speed due to the per-point MLP executions for implicit multi-level feature aggregation, especially for real-time applications.
no code implementations • 12 Jul 2024 • Ning Wang, Shanghao Shi, Yang Xiao, Yimin Chen, Y. Thomas Hou, Wenjing Lou
Based on the intuition that clustering and subsequent backdoor detection can drastically benefit from knowing client data distributions, we propose a novel data distribution inference mechanism.
no code implementations • 4 Jul 2024 • Yang Xiao, Rohan Kumar Das
Sound source localization (SSL) is essential for many speech-processing applications.
no code implementations • 4 Jul 2024 • Yang Xiao, Han Yin, Jisheng Bai, Rohan Kumar Das
This work explores domain generalization (DG) for sound event detection (SED), advancing adaptability towards real-world scenarios.
1 code implementation • 4 Jul 2024 • Yang Xiao, Rohan Kumar Das
This work aims to advance sound event detection (SED) research by presenting a new large language model (LLM)-powered dataset namely wild domestic environment sound event detection (WildDESED).
Ranked #3 on
Sound Event Detection
on WildDESED
no code implementations • 4 Jul 2024 • Yang Xiao, Rohan Kumar Das
This work explores class-incremental learning (CIL) for sound event detection (SED), advancing adaptability towards real-world scenarios.
no code implementations • 29 Jun 2024 • Yang Xiao, Han Yin, Jisheng Bai, Rohan Kumar Das
Our proposed method shows superior macro-average pAUC and polyphonic SED score performance on the DCASE 2024 Challenge Task 4 validation dataset and public evaluation dataset.
no code implementations • 26 Jun 2024 • Yuanxi Lin, Tonglin Zhou, Yang Xiao
These findings highlight the effectiveness of our model advancements in improving speech command recognition for aviation safety and efficiency in noisy, high-stakes environments.
no code implementations • 18 Jun 2024 • Jiashuo Wang, Yang Xiao, Yanran Li, Changhe Song, Chunpu Xu, Chenhao Tan, Wenjie Li
To this end, we adopt LLMs to simulate clients and propose ClientCAST, a client-centered approach to assessing LLM therapists by client simulation.
1 code implementation • 18 Jun 2024 • Zhen Huang, Zengzhi Wang, Shijie Xia, Xuefeng Li, Haoyang Zou, Ruijie Xu, Run-Ze Fan, Lyumanshan Ye, Ethan Chern, Yixin Ye, Yikai Zhang, Yuqing Yang, Ting Wu, Binjie Wang, Shichao Sun, Yang Xiao, Yiyuan Li, Fan Zhou, Steffi Chern, Yiwei Qin, Yan Ma, Jiadi Su, Yixiu Liu, Yuxiang Zheng, Shaoting Zhang, Dahua Lin, Yu Qiao, PengFei Liu
We delve into the models' cognitive reasoning abilities, their performance across different modalities, and their outcomes in process-level evaluations, which are vital for tasks requiring complex reasoning with lengthy solutions.
1 code implementation • 29 Apr 2024 • Yang Xiao
This study not only advances the application of AI technology in the field of psychology but also provides a new psychological theoretical understanding the information processing of the AI.
no code implementations • 25 Mar 2024 • Chengxuan Li, Di Huang, Zeyu Lu, Yang Xiao, Qingqi Pei, Lei Bai
Video generation is a rapidly advancing research area, garnering significant attention due to its broad range of applications.
no code implementations • 15 Mar 2024 • Tingbing Yan, Wenzheng Zeng, Yang Xiao, Xingyu Tong, Bo Tan, Zhiwen Fang, Zhiguo Cao, Joey Tianyi Zhou
Most existing one-shot skeleton-based action recognition focuses on raw low-level information (e. g., joint location), and may suffer from local information loss and low generalization ability.
no code implementations • 7 Mar 2024 • Bohan Liu, Zijie Zhang, Peixiong He, Zhensen Wang, Yang Xiao, Ruimeng Ye, Yang Zhou, Wei-Shinn Ku, Bo Hui
The Lottery Ticket Hypothesis (LTH) states that a dense neural network model contains a highly sparse subnetwork (i. e., winning tickets) that can achieve even better performance than the original model when trained in isolation.
no code implementations • 5 Feb 2024 • Yang Xiao, Rohan Kumar Das
To address this issue, we introduce a novel framework referred to as dual knowledge distillation for developing efficient SED systems in this work.
Ranked #2 on
Sound Event Detection
on DESED
(using extra training data)
1 code implementation • 28 Dec 2023 • Yang Xiao, Yi Cheng, Jinlan Fu, Jiashuo Wang, Wenjie Li, PengFei Liu
In recent years, AI has demonstrated remarkable capabilities in simulating human behaviors, particularly those implemented with large language models (LLMs).
no code implementations • CVPR 2024 • Yingda Yin, Yuzheng Liu, Yang Xiao, Daniel Cohen-Or, Jingwei Huang, Baoquan Chen
Advancements in 3D instance segmentation have traditionally been tethered to the availability of annotated datasets, limiting their application to a narrow spectrum of object categories.
1 code implementation • 10 Nov 2023 • Shanghao Shi, Ning Wang, Yang Xiao, Chaoyu Zhang, Yi Shi, Y. Thomas Hou, Wenjing Lou
The first step is to reconstruct the latent space representations (LSRs) from the aggregated model updates using a closed-form inversion mechanism, leveraging specially crafted linear layers.
1 code implementation • 27 Oct 2023 • Yiran Guan, Zhuoguang Chen, Wenzheng Zeng, Zhiguo Cao, Yang Xiao
In this letter, we propose a new method, Multi-Clue Gaze (MCGaze), to facilitate video gaze estimation via capturing spatial-temporal interaction context among head, face, and eye in an end-to-end learning way, which has not been well concerned yet.
Ranked #1 on
Gaze Estimation
on Gaze360
no code implementations • 8 Oct 2023 • Hongyu Zhao, Gongming Wei, Yang Xiao, Xianglei Xing
The low frame rates and severe image shake caused by wave turbulence in ship datasets often result in minimal, or even zero, Intersection of Union (IoU) between the predicted and detected bounding boxes.
1 code implementation • IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY 2023 • Wenzheng Zeng, Yang Xiao, Guilei Hu, Zhiguo Cao, Sicheng Wei, Zhiwen Fang, Joey Tianyi Zhou, Junsong Yuan
The experiments verify that, our proposition is significantly superior to the state-of-the-art ones, and with real-time running efficiency.
Ranked #1 on
Eyeblink detection
on HUST-LEBW
1 code implementation • CVPR 2023 • Changlong Jiang, Yang Xiao, Cunlin Wu, Mingyang Zhang, Jinghong Zheng, Zhiguo Cao, Joey Tianyi Zhou
3D interacting hand pose estimation from a single RGB image is a challenging task, due to serious self-occlusion and inter-occlusion towards hands, confusing similar appearance patterns between 2 hands, ill-posed joint position mapping from 2D to 3D, etc.. To address these, we propose to extend A2J-the state-of-the-art depth-based 3D single hand pose estimation method-to RGB domain under interacting hand condition.
Ranked #7 on
Hand Pose Estimation
on NYU Hands
1 code implementation • CVPR 2023 • Wenzheng Zeng, Yang Xiao, Sicheng Wei, Jinfang Gan, Xintao Zhang, Zhiguo Cao, Zhiwen Fang, Joey Tianyi Zhou
Experiments on MPEblink verify the essential challenges of real-time multi-person eyeblink detection in the wild for untrimmed video.
Ranked #1 on
on MPEblink
1 code implementation • 19 Sep 2022 • Georgy Ponimatkin, Nermin Samet, Yang Xiao, Yuming Du, Renaud Marlet, Vincent Lepetit
We propose a simple, yet powerful approach for unsupervised object segmentation in videos.
Ranked #1 on
Unsupervised Video Object Segmentation
on SegTrack v2
(Jaccard (Mean) metric)
1 code implementation • 15 Sep 2022 • Van Nguyen Nguyen, Yuming Du, Yang Xiao, Michael Ramamonjisoa, Vincent Lepetit
Our results on challenging datasets are on par with previous works that require much more information (training images of the target objects, 3D models, and/or depth data).
no code implementations • 23 Aug 2022 • Boshen Zhang, Yuxi Li, Yuanpeng Tu, Jinlong Peng, Yabiao Wang, Cunlin Wu, Yang Xiao, Cairong Zhao
Specifically, for the clean set, we deliberately design a memory-based modulation scheme to dynamically adjust the contribution of each sample in terms of its historical credibility sequence during training, thus alleviating the effect from noisy samples incorrectly grouped into the clean set.
1 code implementation • 15 Jul 2022 • Yang Xiao, Xubo Liu, James King, Arshdeep Singh, Eng Siong Chng, Mark D. Plumbley, Wenwu Wang
Experimental results on the DCASE 2019 Task 1 and ESC-50 dataset show that our proposed method outperforms baseline continual learning methods on classification accuracy and computational efficiency, indicating our method can efficiently and incrementally learn new classes without the catastrophic forgetting problem for on-device environmental sound classification.
no code implementations • NAACL 2022 • Yang Xiao, Jinlan Fu, See-Kiong Ng, PengFei Liu
In this paper, we ask the research question of whether all the datasets in the benchmark are necessary.
1 code implementation • 3 May 2022 • Md Hasan Shahriar, Yang Xiao, Pablo Moriano, Wenjing Lou, Y. Thomas Hou
As ordinary injection attacks disrupt the typical timing properties of the CAN data stream, rule-based intrusion detection systems (IDS) can easily detect them.
2 code implementations • CVPR 2022 • Van Nguyen Nguyen, Yinlin Hu, Yang Xiao, Mathieu Salzmann, Vincent Lepetit
It relies on a small set of training objects to learn local object representations, which allow us to locally match the input image to a set of "templates", rendered images of the CAD models for the new objects.
1 code implementation • 30 Mar 2022 • Yang Xiao, Nana Hou, Eng Siong Chng
Catastrophic forgetting is a thorny challenge when updating keyword spotting (KWS) models after deployment.
no code implementations • ACL 2022 • Yang Xiao, Jinlan Fu, Weizhe Yuan, Vijay Viswanathan, Zhoumianze Liu, Yixin Liu, Graham Neubig, PengFei Liu
Despite data's crucial role in machine learning, most existing tools and research tend to focus on systems on top of existing data rather than how to interpret and manipulate data.
no code implementations • NeurIPS 2021 • Xi Shen, Yang Xiao, Shell Hu, Othman Sbai, Mathieu Aubry
In the problems of image retrieval and few-shot classification, the mainstream approaches focus on learning a better feature representation.
2 code implementations • 22 Oct 2021 • Yuming Du, Wen Guo, Yang Xiao, Vincent Lepetit
In this report, we introduce our (pretty straightforard) two-step "detect-then-match" video instance segmentation method.
1 code implementation • 19 Oct 2021 • Yuming Du, Wen Guo, Yang Xiao, Vincent Lepetit
We describe our two-stage instance segmentation framework we use to compete in the challenge.
1 code implementation • NAACL 2022 • Jun Yan, Yang Xiao, Sagnik Mukherjee, Bill Yuchen Lin, Robin Jia, Xiang Ren
We study the robustness of machine reading comprehension (MRC) models to entity renaming -- do models make more wrong predictions when the same questions are asked about an entity whose name has been changed?
no code implementations • 29 Sep 2021 • Boshen Zhang, Yuxi Li, Yuanpeng Tu, Yabiao Wang, Yang Xiao, Cai Rong Zhao, Chengjie Wang
For the clean set, we deliberately design a memory-based modulation scheme to dynamically adjust the contribution of each sample in terms of its historical credibility sequence during training, thus to alleviate the effect from potential hard noisy samples in clean set.
1 code implementation • 12 May 2021 • Yang Xiao, Yuming Du, Renaud Marlet
We experimented on Pascal3D+, ObjectNet3D and Pix3D in a cross-dataset fashion, with both seen and unseen classes.
no code implementations • ICCV 2021 • Yuming Du, Yang Xiao, Vincent Lepetit
Through extensive experiments, we show that our method can generate a high-quality training set which significantly boosts the performance of segmenting objects of unseen classes.
1 code implementation • ACL 2021 • PengFei Liu, Jinlan Fu, Yang Xiao, Weizhe Yuan, Shuaicheng Chang, Junqi Dai, Yixin Liu, Zihuiwen Ye, Zi-Yi Dou, Graham Neubig
In this paper, we present a new conceptualization and implementation of NLP evaluation: the ExplainaBoard, which in addition to inheriting the functionality of the standard leaderboard, also allows researchers to (i) diagnose strengths and weaknesses of a single system (e. g.~what is the best-performing system bad at?)
1 code implementation • 17 Nov 2020 • Sicheng Zhao, Yang Xiao, Jiang Guo, Xiangyu Yue, Jufeng Yang, Ravi Krishna, Pengfei Xu, Kurt Keutzer
C-CycleGAN transfers source samples at instance-level to an intermediate domain that is closer to the target domain with sentiment semantics preserved and without losing discriminative features.
7 code implementations • 11 Oct 2020 • Xiang An, Xuhan Zhu, Yang Xiao, Lan Wu, Ming Zhang, Yuan Gao, Bin Qin, Debing Zhang, Ying Fu
The experiment demonstrates no loss of accuracy when training with only 10\% randomly sampled classes for the softmax-based loss functions, compared with training with full classes using state-of-the-art models on mainstream benchmarks.
Ranked #2 on
Face Identification
on MegaFace
no code implementations • 21 Aug 2020 • Zhang Li, Jiehua Zhang, Tao Tan, Xichao Teng, Xiaoliang Sun, Yang Li, Lihong Liu, Yang Xiao, Byungjae Lee, Yilong Li, Qianni Zhang, Shujiao Sun, Yushan Zheng, Junyu Yan, Ni Li, Yiyu Hong, Junsu Ko, Hyun Jung, Yanling Liu, Yu-cheng Chen, Ching-Wei Wang, Vladimir Yurovskiy, Pavel Maevskikh, Vahid Khanagha, Yi Jiang, Xiangjun Feng, Zhihong Liu, Daiqiang Li, Peter J. Schüffler, Qifeng Yu, Hui Chen, Yuling Tang, Geert Litjens
All methods were based on deep learning and categorized into two groups: multi-model method and single model method.
2 code implementations • ECCV 2020 • Yang Xiao, Vincent Lepetit, Renaud Marlet
In this paper, we tackle the problems of few-shot object detection and few-shot viewpoint estimation.
Ranked #16 on
Few-Shot Object Detection
on MS-COCO (30-shot)
1 code implementation • 23 Jul 2020 • Xuchong Qiu, Yang Xiao, Chaohui Wang, Renaud Marlet
The former provides a way to generate large-scale accurate occlusion datasets while, based on the latter, we propose a novel method for task-independent pixel-level occlusion relationship estimation from single images.
1 code implementation • 11 Jul 2020 • Fu Xiong, Yang Xiao, Zhiguo Cao, Yancheng Wang, Joey Tianyi Zhou, Jianxi Wu
Embedding RMML into the proposed ECML mechanism, our metric learning paradigm (EC-RMML) can run in the one-pass learning manner.
2 code implementations • CVPR 2020 • Haozhe Qi, Chen Feng, Zhiguo Cao, Feng Zhao, Yang Xiao
Specifically, we first sample seeds from the point clouds in template and search area respectively.
1 code implementation • CVPR 2020 • Yancheng Wang, Yang Xiao, Fu Xiong, Wenxiang Jiang, Zhiguo Cao, Joey Tianyi Zhou, Junsong Yuan
Each available 3DV voxel intrinsically involves 3D spatial and motion feature jointly.
2 code implementations • ICLR 2020 • Shell Xu Hu, Pablo G. Moreno, Yang Xiao, Xi Shen, Guillaume Obozinski, Neil D. Lawrence, Andreas Damianou
The evidence lower bound of the marginal log-likelihood of empirical Bayes decomposes as a sum of local KL divergences between the variational posterior and the true posterior on the query set of each task.
Ranked #13 on
Few-Shot Image Classification
on CIFAR-FS 5-way (1-shot)
no code implementations • ECCV 2020 • Anil Armagan, Guillermo Garcia-Hernando, Seungryul Baek, Shreyas Hampali, Mahdi Rad, Zhaohui Zhang, Shipeng Xie, Mingxiu Chen, Boshen Zhang, Fu Xiong, Yang Xiao, Zhiguo Cao, Junsong Yuan, Pengfei Ren, Weiting Huang, Haifeng Sun, Marek Hrúz, Jakub Kanis, Zdeněk Krňoul, Qingfu Wan, Shile Li, Linlin Yang, Dongheui Lee, Angela Yao, Weiguo Zhou, Sijia Mei, Yun-hui Liu, Adrian Spurr, Umar Iqbal, Pavlo Molchanov, Philippe Weinzaepfel, Romain Brégier, Grégory Rogez, Vincent Lepetit, Tae-Kyun Kim
To address these issues, we designed a public challenge (HANDS'19) to evaluate the abilities of current 3D hand pose estimators (HPEs) to interpolate and extrapolate the poses of a training set.
no code implementations • 19 Mar 2020 • Jiawei Wu, Jianxue Li, Yang Xiao, Jun Liu
Routing is one of the key functions for stable operation of network infrastructure.
2 code implementations • ICCV 2019 • Fu Xiong, Boshen Zhang, Yang Xiao, Zhiguo Cao, Taidong Yu, Joey Tianyi Zhou, Junsong Yuan
For 3D hand and body pose estimation task in depth image, a novel anchor-based approach termed Anchor-to-Joint regression network (A2J) with the end-to-end learning ability is proposed.
Ranked #1 on
Hand Pose Estimation
on K2HPD
2 code implementations • 12 Jun 2019 • Yang Xiao, Xuchong Qiu, Pierre-Alain Langlois, Mathieu Aubry, Renaud Marlet
Most deep pose estimation methods need to be trained for specific object instances or categories.
1 code implementation • 30 Apr 2019 • Chen Zhao, Jiaqi Yang, Yang Xiao, Zhiguo Cao
Correspondence selection aiming at seeking correct feature correspondences from raw feature matches is pivotal for a number of feature-matching-based tasks.
no code implementations • 21 Feb 2019 • Guilei Hu, Yang Xiao, Zhiguo Cao, Lubin Meng, Zhiwen Fang, Joey Tianyi Zhou, Junsong Yuan
Effective and real-time eyeblink detection is of wide-range applications, such as deception detection, drive fatigue detection, face anti-spoofing, etc.
Ranked #2 on
Eyeblink detection
on HUST-LEBW
1 code implementation • 29 Jul 2018 • Fu Xiong, Yang Xiao, Zhiguo Cao, Kaicheng Gong, Zhiwen Fang, Joey Tianyi Zhou
Person re-identification is indeed a challenging visual recognition task due to the critical issues of human pose variation, human body occlusion, camera view variation, etc.
1 code implementation • ACL 2018 • Hang Yang, Yubo Chen, Kang Liu, Yang Xiao, Jun Zhao
We present an event extraction framework to detect event mentions and extract events from the document-level financial news.
1 code implementation • 29 Jun 2018 • Yang Xiao, Jun Chen, Yancheng Wang, Zhiguo Cao, Joey Tianyi Zhou, Xiang Bai
To better exploit three-dimensional (3D) characteristics, multi-view dynamic images are proposed.
no code implementations • CVPR 2018 • Ke Xian, Chunhua Shen, Zhiguo Cao, Hao Lu, Yang Xiao, Ruibo Li, Zhenbo Luo
In this paper we study the problem of monocular relative depth perception in the wild.
no code implementations • 6 Apr 2018 • Jiaqi Yang, Ke Xian, Yang Xiao, Zhiguo Cao
This paper presents a thorough evaluation of several widely-used 3D correspondence grouping algorithms, motived by their significance in vision tasks relying on correct feature correspondences.
no code implementations • 7 Jul 2017 • Hao Lu, Zhiguo Cao, Yang Xiao, Bohan Zhuang, Chunhua Shen
To our knowledge, this is the first time that a plant-related counting problem is considered using computer vision technologies under unconstrained field-based environment.
no code implementations • COLING 2016 • Yang Xiao, Yu-An Wang, Hangyu Mao, Zhen Xiao
Accurate prediction of user attributes from social media is valuable for both social science analysis and consumer targeting.