Search Results for author: Xiaodong Yang

Found 51 papers, 25 papers with code

UFO²: A Unified Framework towards Omni-supervised Object Detection

1 code implementation • ECCV 2020 • Zhongzheng Ren, Zhiding Yu, Xiaodong Yang, Ming-Yu Liu, Alexander G. Schwing, Jan Kautz

Existing work on object detection often relies on a single form of annotation: the model is trained using either accurate yet costly bounding boxes or cheaper but less expressive image-level tags.

Object object-detection +1

358

Paper
Code

Fundamental limits of community detection from multi-view data: multi-layer, dynamic and partially labeled block models

no code implementations • 16 Jan 2024 • Xiaodong Yang, Buyu Lin, Subhabrata Sen

Multi-view data arises frequently in modern network analysis e. g. relations of multiple types among individuals in social network analysis, longitudinal measurements of interactions among observational units, annotated networks with noisy partial labeling of vertices etc.

Community Detection Stochastic Block Model

Paper
Add Code

Benign Oscillation of Stochastic Gradient Descent with Large Learning Rates

no code implementations • 26 Oct 2023 • Miao Lu, Beining Wu, Xiaodong Yang, Difan Zou

In view of this finding, we call such a phenomenon "benign oscillation".

Paper
Add Code

DistillBEV: Boosting Multi-Camera 3D Object Detection with Cross-Modal Knowledge Distillation

1 code implementation • ICCV 2023 • Zeyu Wang, Dingwen Li, Chenxu Luo, Cihang Xie, Xiaodong Yang

In this work, we propose to boost the representation learning of a multi-camera BEV based student detector by training it to imitate the features of a well-trained LiDAR based teacher detector.

3D Object Detection Autonomous Driving +4

Paper
Code

GEDepth: Ground Embedding for Monocular Depth Estimation

1 code implementation • ICCV 2023 • Xiaodong Yang, Zhuang Ma, Zhiyu Ji, Zhe Ren

Monocular depth estimation is an ill-posed problem as the same 2D image can be projected from infinite 3D scenes.

Ranked #2 on Monocular Depth Estimation on DDAD

Depth Prediction Monocular Depth Estimation

Paper
Code

Transcendental Idealism of Planner: Evaluating Perception from Planning Perspective for Autonomous Driving

1 code implementation • 12 Jun 2023 • Wei-Xin Li, Xiaodong Yang

Evaluating the performance of perception modules in autonomous driving is one of the most critical tasks in developing the complex intelligent system.

Autonomous Driving Reinforcement Learning (RL)

Paper
Code

PillarNeXt: Rethinking Network Designs for 3D Object Detection in LiDAR Point Clouds

1 code implementation • CVPR 2023 • Jinyu Li, Chenxu Luo, Xiaodong Yang

In order to deal with the sparse and unstructured raw point clouds, LiDAR based 3D object detection research mostly focuses on designing dedicated local point aggregators for fine-grained geometrical modeling.

Ranked #1 on 3D Object Detection on waymo vehicle

3D Object Detection Object +1

156

Paper
Code

ProphNet: Efficient Agent-Centric Motion Forecasting with Anchor-Informed Proposals

no code implementations • CVPR 2023 • Xishun Wang, Tong Su, Fang Da, Xiaodong Yang

To cope with these difficulties, this paper proposes a novel agent-centric model with anchor-informed proposals for efficient multimodal motion prediction.

Motion Forecasting motion prediction

Paper
Add Code

SportsTrack: An Innovative Method for Tracking Athletes in Sports Scenes

1 code implementation • 14 Nov 2022 • Jie Wang, Yuzhou Peng, Xiaodong Yang, Ting Wang, YanMing Zhang

The SportsMOT dataset aims to solve multiple object tracking of athletes in different sports scenes such as basketball or soccer.

Multiple Object Tracking

Paper
Code

QML for Argoverse 2 Motion Forecasting Challenge

no code implementations • 13 Jul 2022 • Tong Su, Xishun Wang, Xiaodong Yang

To safely navigate in various complex traffic scenarios, autonomous driving systems are generally equipped with a motion forecasting module to provide vital information for the downstream planning module.

Motion Forecasting Navigate

Paper
Add Code

Normalized/Clipped SGD with Perturbation for Differentially Private Non-Convex Optimization

no code implementations • 27 Jun 2022 • Xiaodong Yang, Huishuai Zhang, Wei Chen, Tie-Yan Liu

By ensuring differential privacy in the learning algorithms, one can rigorously mitigate the risk of large models memorizing sensitive training data.

Paper
Add Code

TL-GAN: Improving Traffic Light Recognition via Data Synthesis for Autonomous Driving

no code implementations • 28 Mar 2022 • Danfeng Wang, Xin Ma, Xiaodong Yang

Traffic light recognition, as a critical component of the perception module of self-driving vehicles, plays a vital role in the intelligent transportation systems.

Autonomous Driving Image Generation

Paper
Add Code

Attribute Descent: Simulating Object-Centric Datasets on the Content Level and Beyond

2 code implementations • 28 Feb 2022 • Yue Yao, Liang Zheng, Xiaodong Yang, Milind Napthade, Tom Gedeon

This article aims to use graphic engines to simulate a large number of training data that have free annotations and possibly strongly resemble to real-world data.

Attribute Data Augmentation +2

153

Paper
Code

Learning Intrinsic Images for Clothing

no code implementations • 16 Nov 2021 • Kuo Jiang, Zian Wang, Xiaodong Yang

A more interpretable edge-aware metric and an annotation scheme is designed for the testing set, which allows diagnostic evaluation for intrinsic models.

Intrinsic Image Decomposition

Paper
Add Code

Exploring Simple 3D Multi-Object Tracking for Autonomous Driving

3 code implementations • ICCV 2021 • Chenxu Luo, Xiaodong Yang, Alan Yuille

3D multi-object tracking in LiDAR point clouds is a key ingredient for self-driving vehicles.

3D Multi-Object Tracking Autonomous Driving +4

163

Paper
Code

Neural Network Repair with Reachability Analysis

no code implementations • 9 Aug 2021 • Xiaodong Yang, Tom Yamaguchi, Hoang-Dung Tran, Bardh Hoxha, Taylor T Johnson, Danil Prokhorov

Formally verifying the safety and robustness of well-trained DNNs and learning-enabled systems under attacks, model uncertainties, and sensing errors is essential for safe autonomy.

Collision Avoidance

Paper
Add Code

Reachability Analysis of Convolutional Neural Networks

no code implementations • 22 Jun 2021 • Xiaodong Yang, Tomoya Yamaguchi, Hoang-Dung Tran, Bardh Hoxha, Taylor T Johnson, Danil Prokhorov

Besides the computation of reachable sets, our approach is also capable of backtracking to the input domain given an output reachable set.

Paper
Add Code

The 5th AI City Challenge

1 code implementation • 25 Apr 2021 • Milind Naphade, Shuo Wang, David C. Anastasiu, Zheng Tang, Ming-Ching Chang, Xiaodong Yang, Yue Yao, Liang Zheng, Pranamesh Chakraborty, Christian E. Lopez, Anuj Sharma, Qi Feng, Vitaly Ablavsky, Stan Sclaroff

Track 3 addressed city-scale multi-target multi-camera vehicle tracking.

Anomaly Detection Computational Efficiency +2

Paper
Code

Self-Supervised Pillar Motion Learning for Autonomous Driving

1 code implementation • CVPR 2021 • Chenxu Luo, Xiaodong Yang, Alan Yuille

Autonomous driving can benefit from motion behavior comprehension when interacting with diverse traffic participants in highly dynamic environments.

Autonomous Driving Motion Estimation

117

Paper
Code

Robust Dynamical Decoupling for the Manipulation of a Spin Network via a Single Spin

no code implementations • 11 Jan 2021 • Xiaodong Yang, Yunrui Ge, Bo Zhang, Jun Li

High-fidelity control of quantum systems is crucial for quantum information processing, but is often limited by perturbations from the environment and imperfections in the applied control fields.

Quantum Physics

Paper
Add Code

Network-Constrained Transactive Control for Multi- Microgrids-based Distribution Networks with SOPs

no code implementations • 6 Nov 2020 • Xiaodong Yang, Zehao Song, Jinyu Wen, Chongbo Xu, Qiuwei Wu, Youbing Zhang, Menglin Zhang, Shijie Cheng

Different from most transactive control studies only focusing on economic aspect, this paper develops a novel network-constrained transactive control (NTC) framework that can address both economic and secure issues for a multi-microgrids-based distribution network considering uncertainties.

Optimization and Control

Paper
Add Code

UFO$^2$: A Unified Framework towards Omni-supervised Object Detection

no code implementations • 21 Oct 2020 • Zhongzheng Ren, Zhiding Yu, Xiaodong Yang, Ming-Yu Liu, Alexander G. Schwing, Jan Kautz

Existing work on object detection often relies on a single form of annotation: the model is trained using either accurate yet costly bounding boxes or cheaper but less expressive image-level tags.

object-detection Object Detection

Paper
Add Code

Hierarchical Contrastive Motion Learning for Video Action Recognition

no code implementations • 20 Jul 2020 • Xitong Yang, Xiaodong Yang, Sifei Liu, Deqing Sun, Larry Davis, Jan Kautz

Thus, the motion features at higher levels are trained to gradually capture semantic dynamics and evolve more discriminative for action recognition.

Action Recognition Contrastive Learning +2

Paper
Add Code

Joint Disentangling and Adaptation for Cross-Domain Person Re-Identification

1 code implementation • ECCV 2020 • Yang Zou, Xiaodong Yang, Zhiding Yu, B. V. K. Vijaya Kumar, Jan Kautz

To this end, we propose a joint learning framework that disentangles id-related/unrelated features and enforces adaptation to work on the id-related feature space exclusively.

Ranked #6 on Unsupervised Domain Adaptation on Market to MSMT

Person Re-Identification Unsupervised Domain Adaptation

Paper
Code

Contrastive Learning for Weakly Supervised Phrase Grounding

1 code implementation • ECCV 2020 • Tanmay Gupta, Arash Vahdat, Gal Chechik, Xiaodong Yang, Jan Kautz, Derek Hoiem

Given pairs of images and captions, we maximize compatibility of the attention-weighted regions and the words in the corresponding caption, compared to non-corresponding pairs of images and captions.

Contrastive Learning Language Modelling +1

Paper
Code

PAMTRI: Pose-Aware Multi-Task Learning for Vehicle Re-Identification Using Highly Randomized Synthetic Data

8 code implementations • ICCV 2019 • Zheng Tang, Milind Naphade, Stan Birchfield, Jonathan Tremblay, William Hodge, Ratnesh Kumar, Shuo Wang, Xiaodong Yang

In comparison with person re-identification (ReID), which has been widely studied in the research community, vehicle ReID has received less attention.

Attribute Multi-Task Learning +3

334

Paper
Code

The 4th AI City Challenge

no code implementations • 30 Apr 2020 • Milind Naphade, Shuo Wang, David Anastasiu, Zheng Tang, Ming-Ching Chang, Xiaodong Yang, Liang Zheng, Anuj Sharma, Rama Chellappa, Pranamesh Chakraborty

Track 3 addressed city-scale multi-target multi-camera vehicle tracking.

Anomaly Detection Computational Efficiency +1

Paper
Add Code

Reachable Set Estimation for Neural Network Control Systems: A Simulation-Guided Approach

no code implementations • 26 Apr 2020 • Weiming Xiang, Hoang-Dung Tran, Xiaodong Yang, Taylor T. Johnson

Then, in combination with reachability methods developed for various dynamical system classes modeled by ordinary differential equations, a recursive algorithm is developed for over-approximating the reachable set of the closed-loop system.

Paper
Add Code

NNV: The Neural Network Verification Tool for Deep Neural Networks and Learning-Enabled Cyber-Physical Systems

no code implementations • 12 Apr 2020 • Hoang-Dung Tran, Xiaodong Yang, Diego Manzanas Lopez, Patrick Musau, Luan Viet Nguyen, Weiming Xiang, Stanley Bak, Taylor T. Johnson

For learning-enabled CPS, such as closed-loop control systems incorporating neural networks, NNV provides exact and over-approximate reachability analysis schemes for linear plant models and FFNN controllers with piecewise-linear activation functions, such as ReLUs.

Paper
Add Code

Instance-aware, Context-focused, and Memory-efficient Weakly Supervised Object Detection

2 code implementations • CVPR 2020 • Zhongzheng Ren, Zhiding Yu, Xiaodong Yang, Ming-Yu Liu, Yong Jae Lee, Alexander G. Schwing, Jan Kautz

Weakly supervised learning has emerged as a compelling tool for object detection by reducing the need for strong supervision during training.

Ranked #1 on Weakly Supervised Object Detection on COCO test-dev

Object object-detection +3

358

Paper
Code

Reachability Analysis for Feed-Forward Neural Networks using Face Lattices

1 code implementation • 2 Mar 2020 • Xiaodong Yang, Hoang-Dung Tran, Weiming Xiang, Taylor Johnson

To address this challenge, we propose a parallelizable technique to compute exact reachable sets of a neural network to an input set.

Paper
Code

Robust Multimodal Image Registration Using Deep Recurrent Reinforcement Learning

no code implementations • 29 Jan 2020 • Shanhui Sun, Jing Hu, Mingqing Yao, Jinrong Hu, Xiaodong Yang, Qi Song, Xi Wu

To this end, these two components are tackled in an end-to-end manner via reinforcement learning in this work.

Image Registration Medical Image Registration +2

Paper
Add Code

FOCUS: Dealing with Label Quality Disparity in Federated Learning

1 code implementation • 29 Jan 2020 • Yiqiang Chen, Xiaodong Yang, Xin Qin, Han Yu, Biao Chen, Zhiqi Shen

It maintains a small set of benchmark samples on the FL server and quantifies the credibility of the client local data without directly observing them by computing the mutual cross-entropy between performance of the FL model on the local datasets and that of the client local FL model on the benchmark dataset.

Federated Learning Privacy Preserving

Paper
Code

Simulating Content Consistent Vehicle Datasets with Attribute Descent

2 code implementations • ECCV 2020 • Yue Yao, Liang Zheng, Xiaodong Yang, Milind Naphade, Tom Gedeon

Between synthetic and real data, there is a two-level domain gap, i. e., content level and appearance level.

Attribute Data Augmentation +4

211

Paper
Code

Dancing to Music

2 code implementations • NeurIPS 2019 • Hsin-Ying Lee, Xiaodong Yang, Ming-Yu Liu, Ting-Chun Wang, Yu-Ding Lu, Ming-Hsuan Yang, Jan Kautz

In the analysis phase, we decompose a dance into a series of basic dance units, through which the model learns how to move.

Ranked #3 on Motion Synthesis on BRACE

Motion Synthesis Pose Estimation

521

Paper
Code

A Delay Metric for Video Object Detection: What Average Precision Fails to Tell

1 code implementation • ICCV 2019 • Huizi Mao, Xiaodong Yang, William J. Dally

Average precision (AP) is a widely used metric to evaluate detection accuracy of image and video object detectors.

Object object-detection +1

Paper
Code

Rethinking Full Connectivity in Recurrent Neural Networks

no code implementations • 29 May 2019 • Matthijs Van Keirsbilck, Alexander Keller, Xiaodong Yang

We study structurally sparse RNNs, showing that they are well suited for acceleration on parallel hardware, with a greatly reduced cost of the recurrent operations as well as orders of magnitude less recurrent weights.

Action Recognition Language Modelling +3

Paper
Add Code

STEP: Spatio-Temporal Progressive Learning for Video Action Detection

1 code implementation • CVPR 2019 • Xitong Yang, Xiaodong Yang, Ming-Yu Liu, Fanyi Xiao, Larry Davis, Jan Kautz

In this paper, we propose Spatio-TEmporal Progressive (STEP) action detector---a progressive learning framework for spatio-temporal action detection in videos.

Ranked #7 on Action Detection on UCF101-24

Action Detection Action Recognition

244

Paper
Code

Joint Discriminative and Generative Learning for Person Re-identification

12 code implementations • CVPR 2019 • Zhedong Zheng, Xiaodong Yang, Zhiding Yu, Liang Zheng, Yi Yang, Jan Kautz

To this end, we propose a joint learning framework that couples re-id learning and data generation end-to-end.

Ranked #1 on Person Re-Identification on UAV-Human

Image-to-Image Translation Unsupervised Domain Adaptation +1

3,948

Paper
Code

CityFlow: A City-Scale Benchmark for Multi-Target Multi-Camera Vehicle Tracking and Re-Identification

no code implementations • CVPR 2019 • Zheng Tang, Milind Naphade, Ming-Yu Liu, Xiaodong Yang, Stan Birchfield, Shuo Wang, Ratnesh Kumar, David Anastasiu, Jenq-Neng Hwang

Urban traffic optimization using traffic cameras as sensors is driving the need to advance state-of-the-art multi-target multi-camera (MTMC) tracking.

object-detection Object Detection +1

Paper
Add Code

Discovering Spatio-Temporal Action Tubes

no code implementations • 29 Nov 2018 • Yuancheng Ye, Xiaodong Yang, YingLi Tian

In this paper, we address the challenging problem of spatial and temporal action detection in videos.

Action Detection

Paper
Add Code

Self-Supervised Spatiotemporal Feature Learning via Video Rotation Prediction

no code implementations • 28 Nov 2018 • Longlong Jing, Xiaodong Yang, Jingen Liu, YingLi Tian

The success of deep neural networks generally requires a vast amount of training data to be labeled, which is expensive and unfeasible in scale, especially for video collections.

Ranked #42 on Self-Supervised Action Recognition on HMDB51

Self-Supervised Action Recognition Temporal Action Localization +1

Paper
Add Code

Verification for Machine Learning, Autonomy, and Neural Networks Survey

1 code implementation • 3 Oct 2018 • Weiming Xiang, Patrick Musau, Ayana A. Wild, Diego Manzanas Lopez, Nathaniel Hamilton, Xiaodong Yang, Joel Rosenfeld, Taylor T. Johnson

This survey presents an overview of verification techniques for autonomous systems, with a focus on safety-critical autonomous cyber-physical systems (CPS) and subcomponents thereof.

BIG-bench Machine Learning General Classification

Paper
Code

Models Matter, So Does Training: An Empirical Study of CNNs for Optical Flow Estimation

2 code implementations • 14 Sep 2018 • Deqing Sun, Xiaodong Yang, Ming-Yu Liu, Jan Kautz

We investigate two crucial and closely related aspects of CNNs for optical flow estimation: models and training.

Ranked #7 on Optical Flow Estimation on KITTI 2012

Optical Flow Estimation

1,585

Paper
Code

Making Convolutional Networks Recurrent for Visual Sequence Learning

no code implementations • CVPR 2018 • Xiaodong Yang, Pavlo Molchanov, Jan Kautz

Recurrent neural networks (RNNs) have emerged as a powerful model for a broad range of machine learning problems that involve sequential data.

Action Recognition Face Alignment +6

Paper
Add Code

Budget-Aware Activity Detection with A Recurrent Policy Network

no code implementations • 30 Nov 2017 • Behrooz Mahasseni, Xiaodong Yang, Pavlo Molchanov, Jan Kautz

In this paper, we address the challenging problem of efficient temporal activity detection in untrimmed long videos.

Action Detection Activity Detection

Paper
Add Code

PWC-Net: CNNs for Optical Flow Using Pyramid, Warping, and Cost Volume

21 code implementations • CVPR 2018 • Deqing Sun, Xiaodong Yang, Ming-Yu Liu, Jan Kautz

It then uses the warped features and features of the first image to construct a cost volume, which is processed by a CNN to estimate the optical flow.

Ranked #3 on Dense Pixel Correspondence Estimation on HPatches

Dense Pixel Correspondence Estimation Optical Flow Estimation

1,585

Paper
Code

MoCoGAN: Decomposing Motion and Content for Video Generation

5 code implementations • CVPR 2018 • Sergey Tulyakov, Ming-Yu Liu, Xiaodong Yang, Jan Kautz

The proposed framework generates a video by mapping a sequence of random vectors to a sequence of video frames.

Ranked #4 on Video Generation on UCF-101 16 frames, Unconditional, Single GPU

Generative Adversarial Network Video Generation

560

Paper
Code

Dynamic Facial Analysis: From Bayesian Filtering to Recurrent Neural Network

no code implementations • CVPR 2017 • Jinwei Gu, Xiaodong Yang, Shalini De Mello, Jan Kautz

We are inspired by the fact that the computation performed in an RNN bears resemblance to Bayesian filters, which have been used for tracking in many previous methods for facial analysis from videos.

Ranked #1 on Head Pose Estimation on BIWI (MAE (trained with BIWI data) metric, using extra training data)

Face Alignment Feature Engineering +3

Paper
Add Code

Online Detection and Classification of Dynamic Hand Gestures With Recurrent 3D Convolutional Neural Network

no code implementations • CVPR 2016 • Pavlo Molchanov, Xiaodong Yang, Shalini Gupta, Kihwan Kim, Stephen Tyree, Jan Kautz

Automatic detection and classification of dynamic hand gestures in real-world systems intended for human computer interaction is challenging as: 1) there is a large diversity in how people perform gestures, making detection and classification difficult; 2) the system must work online in order to avoid noticeable lag between performing a gesture and its classification; in fact, a negative lag (classification before the gesture is finished) is desirable, as feedback to the user can then be truly instantaneous.

Classification General Classification +1

Paper
Add Code

Super Normal Vector for Activity Recognition Using Depth Sequences

no code implementations • CVPR 2014 • Xiaodong Yang, YingLi Tian

This paper presents a new framework for human activity recognition from video sequences captured by a depth camera.

General Classification Human Activity Recognition

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.