1 code implementation • ECCV 2020 • Zhongzheng Ren, Zhiding Yu, Xiaodong Yang, Ming-Yu Liu, Alexander G. Schwing, Jan Kautz
Existing work on object detection often relies on a single form of annotation: the model is trained using either accurate yet costly bounding boxes or cheaper but less expressive image-level tags.
no code implementations • 21 Mar 2023 • Xishun Wang, Tong Su, Fang Da, Xiaodong Yang
To cope with these difficulties, this paper proposes a novel agent-centric model with anchor-informed proposals for efficient multimodal motion prediction.
1 code implementation • 14 Nov 2022 • Jie Wang, Yuzhou Peng, Xiaodong Yang, Ting Wang, YanMing Zhang
The SportsMOT dataset aims to solve multiple object tracking of athletes in different sports scenes such as basketball or soccer.
no code implementations • 13 Jul 2022 • Tong Su, Xishun Wang, Xiaodong Yang
To safely navigate in various complex traffic scenarios, autonomous driving systems are generally equipped with a motion forecasting module to provide vital information for the downstream planning module.
no code implementations • 27 Jun 2022 • Xiaodong Yang, Huishuai Zhang, Wei Chen, Tie-Yan Liu
By ensuring differential privacy in the learning algorithms, one can rigorously mitigate the risk of large models memorizing sensitive training data.
no code implementations • 28 Mar 2022 • Danfeng Wang, Xin Ma, Xiaodong Yang
Traffic light recognition, as a critical component of the perception module of self-driving vehicles, plays a vital role in the intelligent transportation systems.
no code implementations • 28 Feb 2022 • Yue Yao, Liang Zheng, Xiaodong Yang, Milind Napthade, Tom Gedeon
This article aims to use graphic engines to simulate a large number of training data that have free annotations and possibly strongly resemble to real-world data.
no code implementations • 16 Nov 2021 • Kuo Jiang, Zian Wang, Xiaodong Yang
A more interpretable edge-aware metric and an annotation scheme is designed for the testing set, which allows diagnostic evaluation for intrinsic models.
2 code implementations • ICCV 2021 • Chenxu Luo, Xiaodong Yang, Alan Yuille
3D multi-object tracking in LiDAR point clouds is a key ingredient for self-driving vehicles.
no code implementations • 9 Aug 2021 • Xiaodong Yang, Tom Yamaguchi, Hoang-Dung Tran, Bardh Hoxha, Taylor T Johnson, Danil Prokhorov
Formally verifying the safety and robustness of well-trained DNNs and learning-enabled systems under attacks, model uncertainties, and sensing errors is essential for safe autonomy.
no code implementations • 22 Jun 2021 • Xiaodong Yang, Tomoya Yamaguchi, Hoang-Dung Tran, Bardh Hoxha, Taylor T Johnson, Danil Prokhorov
Besides the computation of reachable sets, our approach is also capable of backtracking to the input domain given an output reachable set.
1 code implementation • 25 Apr 2021 • Milind Naphade, Shuo Wang, David C. Anastasiu, Zheng Tang, Ming-Ching Chang, Xiaodong Yang, Yue Yao, Liang Zheng, Pranamesh Chakraborty, Christian E. Lopez, Anuj Sharma, Qi Feng, Vitaly Ablavsky, Stan Sclaroff
Track 3 addressed city-scale multi-target multi-camera vehicle tracking.
1 code implementation • CVPR 2021 • Chenxu Luo, Xiaodong Yang, Alan Yuille
Autonomous driving can benefit from motion behavior comprehension when interacting with diverse traffic participants in highly dynamic environments.
no code implementations • 11 Jan 2021 • Xiaodong Yang, Yunrui Ge, Bo Zhang, Jun Li
High-fidelity control of quantum systems is crucial for quantum information processing, but is often limited by perturbations from the environment and imperfections in the applied control fields.
Quantum Physics
no code implementations • 6 Nov 2020 • Xiaodong Yang, Zehao Song, Jinyu Wen, Chongbo Xu, Qiuwei Wu, Youbing Zhang, Menglin Zhang, Shijie Cheng
Different from most transactive control studies only focusing on economic aspect, this paper develops a novel network-constrained transactive control (NTC) framework that can address both economic and secure issues for a multi-microgrids-based distribution network considering uncertainties.
Optimization and Control
no code implementations • 21 Oct 2020 • Zhongzheng Ren, Zhiding Yu, Xiaodong Yang, Ming-Yu Liu, Alexander G. Schwing, Jan Kautz
Existing work on object detection often relies on a single form of annotation: the model is trained using either accurate yet costly bounding boxes or cheaper but less expressive image-level tags.
no code implementations • 20 Jul 2020 • Xitong Yang, Xiaodong Yang, Sifei Liu, Deqing Sun, Larry Davis, Jan Kautz
Thus, the motion features at higher levels are trained to gradually capture semantic dynamics and evolve more discriminative for action recognition.
1 code implementation • ECCV 2020 • Yang Zou, Xiaodong Yang, Zhiding Yu, B. V. K. Vijaya Kumar, Jan Kautz
To this end, we propose a joint learning framework that disentangles id-related/unrelated features and enforces adaptation to work on the id-related feature space exclusively.
Ranked #6 on
Unsupervised Domain Adaptation
on Market to MSMT
1 code implementation • ECCV 2020 • Tanmay Gupta, Arash Vahdat, Gal Chechik, Xiaodong Yang, Jan Kautz, Derek Hoiem
Given pairs of images and captions, we maximize compatibility of the attention-weighted regions and the words in the corresponding caption, compared to non-corresponding pairs of images and captions.
3 code implementations • ICCV 2019 • Zheng Tang, Milind Naphade, Stan Birchfield, Jonathan Tremblay, William Hodge, Ratnesh Kumar, Shuo Wang, Xiaodong Yang
In comparison with person re-identification (ReID), which has been widely studied in the research community, vehicle ReID has received less attention.
no code implementations • 30 Apr 2020 • Milind Naphade, Shuo Wang, David Anastasiu, Zheng Tang, Ming-Ching Chang, Xiaodong Yang, Liang Zheng, Anuj Sharma, Rama Chellappa, Pranamesh Chakraborty
Track 3 addressed city-scale multi-target multi-camera vehicle tracking.
no code implementations • 26 Apr 2020 • Weiming Xiang, Hoang-Dung Tran, Xiaodong Yang, Taylor T. Johnson
Then, in combination with reachability methods developed for various dynamical system classes modeled by ordinary differential equations, a recursive algorithm is developed for over-approximating the reachable set of the closed-loop system.
1 code implementation • 12 Apr 2020 • Hoang-Dung Tran, Xiaodong Yang, Diego Manzanas Lopez, Patrick Musau, Luan Viet Nguyen, Weiming Xiang, Stanley Bak, Taylor T. Johnson
For learning-enabled CPS, such as closed-loop control systems incorporating neural networks, NNV provides exact and over-approximate reachability analysis schemes for linear plant models and FFNN controllers with piecewise-linear activation functions, such as ReLUs.
2 code implementations • CVPR 2020 • Zhongzheng Ren, Zhiding Yu, Xiaodong Yang, Ming-Yu Liu, Yong Jae Lee, Alexander G. Schwing, Jan Kautz
Weakly supervised learning has emerged as a compelling tool for object detection by reducing the need for strong supervision during training.
Ranked #1 on
Weakly Supervised Object Detection
on COCO test-dev
1 code implementation • 2 Mar 2020 • Xiaodong Yang, Hoang-Dung Tran, Weiming Xiang, Taylor Johnson
To address this challenge, we propose a parallelizable technique to compute exact reachable sets of a neural network to an input set.
1 code implementation • 29 Jan 2020 • Yiqiang Chen, Xiaodong Yang, Xin Qin, Han Yu, Biao Chen, Zhiqi Shen
It maintains a small set of benchmark samples on the FL server and quantifies the credibility of the client local data without directly observing them by computing the mutual cross-entropy between performance of the FL model on the local datasets and that of the client local FL model on the benchmark dataset.
no code implementations • 29 Jan 2020 • Shanhui Sun, Jing Hu, Mingqing Yao, Jinrong Hu, Xiaodong Yang, Qi Song, Xi Wu
To this end, these two components are tackled in an end-to-end manner via reinforcement learning in this work.
2 code implementations • ECCV 2020 • Yue Yao, Liang Zheng, Xiaodong Yang, Milind Naphade, Tom Gedeon
Between synthetic and real data, there is a two-level domain gap, i. e., content level and appearance level.
1 code implementation • NeurIPS 2019 • Hsin-Ying Lee, Xiaodong Yang, Ming-Yu Liu, Ting-Chun Wang, Yu-Ding Lu, Ming-Hsuan Yang, Jan Kautz
In the analysis phase, we decompose a dance into a series of basic dance units, through which the model learns how to move.
Ranked #3 on
Motion Synthesis
on BRACE
1 code implementation • ICCV 2019 • Huizi Mao, Xiaodong Yang, William J. Dally
Average precision (AP) is a widely used metric to evaluate detection accuracy of image and video object detectors.
no code implementations • 29 May 2019 • Matthijs Van Keirsbilck, Alexander Keller, Xiaodong Yang
We study structurally sparse RNNs, showing that they are well suited for acceleration on parallel hardware, with a greatly reduced cost of the recurrent operations as well as orders of magnitude less recurrent weights.
1 code implementation • CVPR 2019 • Xitong Yang, Xiaodong Yang, Ming-Yu Liu, Fanyi Xiao, Larry Davis, Jan Kautz
In this paper, we propose Spatio-TEmporal Progressive (STEP) action detector---a progressive learning framework for spatio-temporal action detection in videos.
Ranked #2 on
Action Detection
on UCF101-24
12 code implementations • CVPR 2019 • Zhedong Zheng, Xiaodong Yang, Zhiding Yu, Liang Zheng, Yi Yang, Jan Kautz
To this end, we propose a joint learning framework that couples re-id learning and data generation end-to-end.
Ranked #1 on
Person Re-Identification
on UAV-Human
Image-to-Image Translation
Unsupervised Domain Adaptation
+1
no code implementations • CVPR 2019 • Zheng Tang, Milind Naphade, Ming-Yu Liu, Xiaodong Yang, Stan Birchfield, Shuo Wang, Ratnesh Kumar, David Anastasiu, Jenq-Neng Hwang
Urban traffic optimization using traffic cameras as sensors is driving the need to advance state-of-the-art multi-target multi-camera (MTMC) tracking.
no code implementations • 29 Nov 2018 • Yuancheng Ye, Xiaodong Yang, YingLi Tian
In this paper, we address the challenging problem of spatial and temporal action detection in videos.
no code implementations • 28 Nov 2018 • Longlong Jing, Xiaodong Yang, Jingen Liu, YingLi Tian
The success of deep neural networks generally requires a vast amount of training data to be labeled, which is expensive and unfeasible in scale, especially for video collections.
Ranked #39 on
Self-Supervised Action Recognition
on HMDB51
Self-Supervised Action Recognition
Temporal Action Localization
+1
2 code implementations • 3 Oct 2018 • Weiming Xiang, Patrick Musau, Ayana A. Wild, Diego Manzanas Lopez, Nathaniel Hamilton, Xiaodong Yang, Joel Rosenfeld, Taylor T. Johnson
This survey presents an overview of verification techniques for autonomous systems, with a focus on safety-critical autonomous cyber-physical systems (CPS) and subcomponents thereof.
2 code implementations • 14 Sep 2018 • Deqing Sun, Xiaodong Yang, Ming-Yu Liu, Jan Kautz
We investigate two crucial and closely related aspects of CNNs for optical flow estimation: models and training.
Ranked #4 on
Optical Flow Estimation
on KITTI 2012
no code implementations • CVPR 2018 • Xiaodong Yang, Pavlo Molchanov, Jan Kautz
Recurrent neural networks (RNNs) have emerged as a powerful model for a broad range of machine learning problems that involve sequential data.
no code implementations • 30 Nov 2017 • Behrooz Mahasseni, Xiaodong Yang, Pavlo Molchanov, Jan Kautz
In this paper, we address the challenging problem of efficient temporal activity detection in untrimmed long videos.
16 code implementations • CVPR 2018 • Deqing Sun, Xiaodong Yang, Ming-Yu Liu, Jan Kautz
It then uses the warped features and features of the first image to construct a cost volume, which is processed by a CNN to estimate the optical flow.
Ranked #3 on
Dense Pixel Correspondence Estimation
on HPatches
Dense Pixel Correspondence Estimation
Optical Flow Estimation
5 code implementations • CVPR 2018 • Sergey Tulyakov, Ming-Yu Liu, Xiaodong Yang, Jan Kautz
The proposed framework generates a video by mapping a sequence of random vectors to a sequence of video frames.
no code implementations • CVPR 2017 • Jinwei Gu, Xiaodong Yang, Shalini De Mello, Jan Kautz
We are inspired by the fact that the computation performed in an RNN bears resemblance to Bayesian filters, which have been used for tracking in many previous methods for facial analysis from videos.
Ranked #1 on
Head Pose Estimation
on BIWI
(MAE (trained with BIWI data) metric, using extra
training data)
no code implementations • CVPR 2016 • Pavlo Molchanov, Xiaodong Yang, Shalini Gupta, Kihwan Kim, Stephen Tyree, Jan Kautz
Automatic detection and classification of dynamic hand gestures in real-world systems intended for human computer interaction is challenging as: 1) there is a large diversity in how people perform gestures, making detection and classification difficult; 2) the system must work online in order to avoid noticeable lag between performing a gesture and its classification; in fact, a negative lag (classification before the gesture is finished) is desirable, as feedback to the user can then be truly instantaneous.
no code implementations • CVPR 2014 • Xiaodong Yang, YingLi Tian
This paper presents a new framework for human activity recognition from video sequences captured by a depth camera.