Search Results for author: Yang Xiao

Found 54 papers, 29 papers with code

Partial FC: Training 10 Million Identities on a Single Machine

7 code implementations11 Oct 2020 Xiang An, Xuhan Zhu, Yang Xiao, Lan Wu, Ming Zhang, Yuan Gao, Bin Qin, Debing Zhang, Ying Fu

The experiment demonstrates no loss of accuracy when training with only 10\% randomly sampled classes for the softmax-based loss functions, compared with training with full classes using state-of-the-art models on mainstream benchmarks.

Face Identification Face Recognition +2

ExplainaBoard: An Explainable Leaderboard for NLP

1 code implementation ACL 2021 PengFei Liu, Jinlan Fu, Yang Xiao, Weizhe Yuan, Shuaicheng Chang, Junqi Dai, Yixin Liu, Zihuiwen Ye, Zi-Yi Dou, Graham Neubig

In this paper, we present a new conceptualization and implementation of NLP evaluation: the ExplainaBoard, which in addition to inheriting the functionality of the standard leaderboard, also allows researchers to (i) diagnose strengths and weaknesses of a single system (e. g.~what is the best-performing system bad at?)

Machine Translation

A2J: Anchor-to-Joint Regression Network for 3D Articulated Pose Estimation from a Single Depth Image

2 code implementations ICCV 2019 Fu Xiong, Boshen Zhang, Yang Xiao, Zhiguo Cao, Taidong Yu, Joey Tianyi Zhou, Junsong Yuan

For 3D hand and body pose estimation task in depth image, a novel anchor-based approach termed Anchor-to-Joint regression network (A2J) with the end-to-end learning ability is proposed.

3D Pose Estimation Depth Estimation +1

Empirical Bayes Transductive Meta-Learning with Synthetic Gradients

2 code implementations ICLR 2020 Shell Xu Hu, Pablo G. Moreno, Yang Xiao, Xi Shen, Guillaume Obozinski, Neil D. Lawrence, Andreas Damianou

The evidence lower bound of the marginal log-likelihood of empirical Bayes decomposes as a sum of local KL divergences between the variational posterior and the true posterior on the query set of each task.

Few-Shot Image Classification Meta-Learning +3

Templates for 3D Object Pose Estimation Revisited: Generalization to New Objects and Robustness to Occlusions

2 code implementations CVPR 2022 Van Nguyen Nguyen, Yinlin Hu, Yang Xiao, Mathieu Salzmann, Vincent Lepetit

It relies on a small set of training objects to learn local object representations, which allow us to locally match the input image to a set of "templates", rendered images of the CAD models for the new objects.

6D Pose Estimation 6D Pose Estimation using RGB +1

UVO Challenge on Video-based Open-World Segmentation 2021: 1st Place Solution

2 code implementations22 Oct 2021 Yuming Du, Wen Guo, Yang Xiao, Vincent Lepetit

In this report, we introduce our (pretty straightforard) two-step "detect-then-match" video instance segmentation method.

Instance Segmentation Optical Flow Estimation +3

A2J-Transformer: Anchor-to-Joint Transformer Network for 3D Interacting Hand Pose Estimation from a Single RGB Image

1 code implementation CVPR 2023 Changlong Jiang, Yang Xiao, Cunlin Wu, Mingyang Zhang, Jinghong Zheng, Zhiguo Cao, Joey Tianyi Zhou

3D interacting hand pose estimation from a single RGB image is a challenging task, due to serious self-occlusion and inter-occlusion towards hands, confusing similar appearance patterns between 2 hands, ill-posed joint position mapping from 2D to 3D, etc.. To address these, we propose to extend A2J-the state-of-the-art depth-based 3D single hand pose estimation method-to RGB domain under interacting hand condition.

3D Interacting Hand Pose Estimation Hand Pose Estimation +1

PIZZA: A Powerful Image-only Zero-Shot Zero-CAD Approach to 6 DoF Tracking

1 code implementation15 Sep 2022 Van Nguyen Nguyen, Yuming Du, Yang Xiao, Michael Ramamonjisoa, Vincent Lepetit

Our results on challenging datasets are on par with previous works that require much more information (training images of the target objects, 3D models, and/or depth data).

Pixel-Pair Occlusion Relationship Map(P2ORM): Formulation, Inference & Application

1 code implementation23 Jul 2020 Xuchong Qiu, Yang Xiao, Chaohui Wang, Renaud Marlet

The former provides a way to generate large-scale accurate occlusion datasets while, based on the latter, we propose a novel method for task-independent pixel-level occlusion relationship estimation from single images.

Monocular Depth Estimation Occlusion Estimation

End-to-end Video Gaze Estimation via Capturing Head-face-eye Spatial-temporal Interaction Context

1 code implementation27 Oct 2023 Yiran Guan, Zhuoguang Chen, Wenzheng Zeng, Zhiguo Cao, Yang Xiao

In this letter, we propose a new method, Multi-Clue Gaze (MCGaze), to facilitate video gaze estimation via capturing spatial-temporal interaction context among head, face, and eye in an end-to-end learning way, which has not been well concerned yet.

Gaze Estimation

Towards Good Practices on Building Effective CNN Baseline Model for Person Re-identification

1 code implementation29 Jul 2018 Fu Xiong, Yang Xiao, Zhiguo Cao, Kaicheng Gong, Zhiwen Fang, Joey Tianyi Zhou

Person re-identification is indeed a challenging visual recognition task due to the critical issues of human pose variation, human body occlusion, camera view variation, etc.

Open-Ended Question Answering Person Re-Identification

How Far Are We from Believable AI Agents? A Framework for Evaluating the Believability of Human Behavior Simulation

1 code implementation28 Dec 2023 Yang Xiao, Yi Cheng, Jinlan Fu, Jiashuo Wang, Wenjie Li, PengFei Liu

Human behavior simulation of AI agents necessitates the agents to possess a quality of believability, which is crucial as it facilitates users in establishing trust toward the agents and streamlines the fulfillment of the agents' goal.

Language Modelling Large Language Model

Continual Learning For On-Device Environmental Sound Classification

1 code implementation15 Jul 2022 Yang Xiao, Xubo Liu, James King, Arshdeep Singh, Eng Siong Chng, Mark D. Plumbley, Wenwu Wang

Experimental results on the DCASE 2019 Task 1 and ESC-50 dataset show that our proposed method outperforms baseline continual learning methods on classification accuracy and computational efficiency, indicating our method can efficiently and incrementally learn new classes without the catastrophic forgetting problem for on-device environmental sound classification.

Classification Computational Efficiency +3

On the Robustness of Reading Comprehension Models to Entity Renaming

1 code implementation NAACL 2022 Jun Yan, Yang Xiao, Sagnik Mukherjee, Bill Yuchen Lin, Robin Jia, Xiang Ren

We study the robustness of machine reading comprehension (MRC) models to entity renaming -- do models make more wrong predictions when the same questions are asked about an entity whose name has been changed?

Continual Pretraining Machine Reading Comprehension

Rainbow Keywords: Efficient Incremental Learning for Online Spoken Keyword Spotting

1 code implementation30 Mar 2022 Yang Xiao, Nana Hou, Eng Siong Chng

Catastrophic forgetting is a thorny challenge when updating keyword spotting (KWS) models after deployment.

Data Augmentation Incremental Learning +3

CANShield: Deep Learning-Based Intrusion Detection Framework for Controller Area Networks at the Signal-Level

1 code implementation3 May 2022 Md Hasan Shahriar, Yang Xiao, Pablo Moriano, Wenjing Lou, Y. Thomas Hou

As ordinary injection attacks disrupt the typical timing properties of the CAN data stream, rule-based intrusion detection systems (IDS) can easily detect them.

Intrusion Detection Time Series +1

Curriculum CycleGAN for Textual Sentiment Domain Adaptation with Multiple Sources

1 code implementation17 Nov 2020 Sicheng Zhao, Yang Xiao, Jiang Guo, Xiangyu Yue, Jufeng Yang, Ravi Krishna, Pengfei Xu, Kurt Keutzer

C-CycleGAN transfers source samples at instance-level to an intermediate domain that is closer to the target domain with sentiment semantics preserved and without losing discriminative features.

Domain Adaptation Generative Adversarial Network +2

ECML: An Ensemble Cascade Metric Learning Mechanism towards Face Verification

1 code implementation11 Jul 2020 Fu Xiong, Yang Xiao, Zhiguo Cao, Yancheng Wang, Joey Tianyi Zhou, Jianxi Wu

Embedding RMML into the proposed ECML mechanism, our metric learning paradigm (EC-RMML) can run in the one-pass learning manner.

Face Verification Fine-Grained Visual Recognition +1

Comparative evaluation of 2D feature correspondence selection algorithms

1 code implementation30 Apr 2019 Chen Zhao, Jiaqi Yang, Yang Xiao, Zhiguo Cao

Correspondence selection aiming at seeking correct feature correspondences from raw feature matches is pivotal for a number of feature-matching-based tasks.

Performance Evaluation of 3D Correspondence Grouping Algorithms

no code implementations6 Apr 2018 Jiaqi Yang, Ke Xian, Yang Xiao, Zhiguo Cao

This paper presents a thorough evaluation of several widely-used 3D correspondence grouping algorithms, motived by their significance in vision tasks relying on correct feature correspondences.

3D Object Recognition Point Cloud Registration +1

TasselNet: Counting maize tassels in the wild via local counts regression network

no code implementations7 Jul 2017 Hao Lu, Zhiguo Cao, Yang Xiao, Bohan Zhuang, Chunhua Shen

To our knowledge, this is the first time that a plant-related counting problem is considered using computer vision technologies under unconstrained field-based environment.

Plant Phenotyping regression

Predicting Restaurant Consumption Level through Social Media Footprints

no code implementations COLING 2016 Yang Xiao, Yu-An Wang, Hangyu Mao, Zhen Xiao

Accurate prediction of user attributes from social media is valuable for both social science analysis and consumer targeting.

Towards Real-time Eyeblink Detection in The Wild:Dataset,Theory and Practices

no code implementations21 Feb 2019 Guilei Hu, Yang Xiao, Zhiguo Cao, Lubin Meng, Zhiwen Fang, Joey Tianyi Zhou, Junsong Yuan

Effective and real-time eyeblink detection is of wide-range applications, such as deception detection, drive fatigue detection, face anti-spoofing, etc.

Attribute Deception Detection +1

Pixel-Pair Occlusion Relationship Map (P2ORM): Formulation, Inference & Application

no code implementations ECCV 2020 Xuchong Qiu, Yang Xiao, Chaohui Wang, Renaud Marlet

Inference & Application","We formalize concepts around geometric occlusion in 2D images (i. e., ignoring semantics), and propose a novel unified formulation of both occlusion boundaries and occlusion orientations via a pixel-pair occlusion relation.

Monocular Depth Estimation

Learning to Better Segment Objects from Unseen Classes with Unlabeled Videos

no code implementations ICCV 2021 Yuming Du, Yang Xiao, Vincent Lepetit

Through extensive experiments, we show that our method can generate a high-quality training set which significantly boosts the performance of segmenting objects of unseen classes.

Object Open-World Instance Segmentation +3

Robust Learning with Adaptive Sample Credibility Modeling

no code implementations29 Sep 2021 Boshen Zhang, Yuxi Li, Yuanpeng Tu, Yabiao Wang, Yang Xiao, Cai Rong Zhao, Chengjie Wang

For the clean set, we deliberately design a memory-based modulation scheme to dynamically adjust the contribution of each sample in terms of its historical credibility sequence during training, thus to alleviate the effect from potential hard noisy samples in clean set.

Denoising

Re-ranking for image retrieval and transductive few-shot classification

no code implementations NeurIPS 2021 Xi Shen, Yang Xiao, Shell Hu, Othman Sbai, Mathieu Aubry

In the problems of image retrieval and few-shot classification, the mainstream approaches focus on learning a better feature representation.

Classification Few-Shot Learning +3

DataLab: A Platform for Data Analysis and Intervention

no code implementations ACL 2022 Yang Xiao, Jinlan Fu, Weizhe Yuan, Vijay Viswanathan, Zhoumianze Liu, Yixin Liu, Graham Neubig, PengFei Liu

Despite data's crucial role in machine learning, most existing tools and research tend to focus on systems on top of existing data rather than how to interpret and manipulate data.

Learning from Noisy Labels with Coarse-to-Fine Sample Credibility Modeling

no code implementations23 Aug 2022 Boshen Zhang, Yuxi Li, Yuanpeng Tu, Jinlong Peng, Yabiao Wang, Cunlin Wu, Yang Xiao, Cairong Zhao

Specifically, for the clean set, we deliberately design a memory-based modulation scheme to dynamically adjust the contribution of each sample in terms of its historical credibility sequence during training, thus alleviating the effect from noisy samples incorrectly grouped into the clean set.

Denoising Image Classification

Multi-Ship Tracking by Robust Similarity metric

no code implementations8 Oct 2023 Hongyu Zhao, Gongming Wei, Yang Xiao, Xianglei Xing

The low frame rates and severe image shake caused by wave turbulence in ship datasets often result in minimal, or even zero, Intersection of Union (IoU) between the predicted and detected bounding boxes.

Multi-Object Tracking Object

Scale-MIA: A Scalable Model Inversion Attack against Secure Federated Learning via Latent Space Reconstruction

no code implementations10 Nov 2023 Shanghao Shi, Ning Wang, Yang Xiao, Chaoyu Zhang, Yi Shi, Y. Thomas Hou, Wenjing Lou

Unlike existing approaches treating models as black boxes, Scale-MIA recognizes the importance of the intricate architecture and inner workings of machine learning models.

Federated Learning

SAI3D: Segment Any Instance in 3D Scenes

no code implementations17 Dec 2023 Yingda Yin, Yuzheng Liu, Yang Xiao, Daniel Cohen-Or, Jingwei Huang, Baoquan Chen

Advancements in 3D instance segmentation have traditionally been tethered to the availability of annotated datasets, limiting their application to a narrow spectrum of object categories.

3D Instance Segmentation Scene Parsing +2

Dual Knowledge Distillation for Efficient Sound Event Detection

no code implementations5 Feb 2024 Yang Xiao, Rohan Kumar Das

To address this issue, we introduce a novel framework referred to as dual knowledge distillation for developing efficient SED systems in this work.

Ranked #2 on Sound Event Detection on DESED (using extra training data)

Event Detection Knowledge Distillation +1

A Survey of Lottery Ticket Hypothesis

no code implementations7 Mar 2024 Bohan Liu, Zijie Zhang, Peixiong He, Zhensen Wang, Yang Xiao, Ruimeng Ye, Yang Zhou, Wei-Shinn Ku, Bo Hui

The Lottery Ticket Hypothesis (LTH) states that a dense neural network model contains a highly sparse subnetwork (i. e., winning tickets) that can achieve even better performance than the original model when trained in isolation.

CrossGLG: LLM Guides One-shot Skeleton-based 3D Action Recognition in a Cross-level Manner

no code implementations15 Mar 2024 Tingbing Yan, Wenzheng Zeng, Yang Xiao, Xingyu Tong, Bo Tan, Zhiwen Fang, Zhiguo Cao, Joey Tianyi Zhou

Most existing one-shot skeleton-based action recognition focuses on raw low-level information (e. g., joint location), and may suffer from local information loss and low generalization ability.

Skeleton Based Action Recognition

A Survey on Long Video Generation: Challenges, Methods, and Prospects

no code implementations25 Mar 2024 Chengxuan Li, Di Huang, Zeyu Lu, Yang Xiao, Qingqi Pei, Lei Bai

Video generation is a rapidly advancing research area, garnering significant attention due to its broad range of applications.

Video Generation

Cannot find the paper you are looking for? You can Submit a new open access paper.