E^2TAD: An Energy-Efficient Tracking-based Action Detector

no code implementations9 Apr 2022 Xin Hu, Zhenyu Wu, Hao-Yu Miao, Siqi Fan, Taiyu Long, Zhenyu Hu, Pengcheng Pi, Yi Wu, Zhou Ren, Zhangyang Wang, Gang Hua

Video action detection (spatio-temporal action localization) is usually the starting point for human-centric intelligent analysis of videos nowadays.

Fine-Grained Action Detection Frame +3

Self-supervised Pretraining with Classification Labels for Temporal Activity Detection

no code implementations26 Nov 2021 Kumara Kahatapitiya, Zhou Ren, Haoxiang Li, Zhenyu Wu, Michael S. Ryoo

However, such pretrained models are not ideal for downstream detection performance due to the disparity between the pretraining and the downstream fine-tuning tasks.

Action Detection Activity Detection +3

Learning Dynamics via Graph Neural Networks for Human Pose Estimation and Tracking

no code implementations CVPR 2021 Yiding Yang, Zhou Ren, Haoxiang Li, Chunluan Zhou, Xinchao Wang, Gang Hua

In this paper, we propose a novel online approach to learning the pose dynamics, which are independent of pose detections in current fame, and hence may serve as a robust estimation even in challenging scenarios including occlusion.

Frame Multi-Person Pose Estimation +2

Calibrated Domain-Invariant Learning for Highly Generalizable Large Scale Re-Identification

1 code implementation26 Nov 2019 Ye Yuan, Wuyang Chen, Tianlong Chen, Yang Yang, Zhou Ren, Zhangyang Wang, Gang Hua

Many real-world applications, such as city-scale traffic monitoring and control, requires large-scale re-identification.

Streamlined Dense Video Captioning

1 code implementation CVPR 2019 Jonghwan Mun, Linjie Yang, Zhou Ren, Ning Xu, Bohyung Han

Dense video captioning is an extremely challenging task since accurate and coherent description of events in a video requires holistic understanding of video contents as well as contextual reasoning of individual events.

Dense Video Captioning

3D Hand Shape and Pose Estimation from a Single RGB Image

2 code implementations CVPR 2019 Liuhao Ge, Zhou Ren, Yuncheng Li, Zehao Xue, Yingying Wang, Jianfei Cai, Junsong Yuan

This work addresses a novel and challenging problem of estimating the full 3D hand shape and pose from a single RGB image.

3D Hand Pose Estimation

Deep Regionlets: Blended Representation and Deep Learning for Generic Object Detection

no code implementations28 Nov 2018 Hongyu Xu, Xutao Lv, Xiaoyu Wang, Zhou Ren, Navaneeth Bodla, Rama Chellappa

The deep regionlets framework consists of a region selection network and a deep regionlet learning module.

Object Detection

Point-to-Point Regression PointNet for 3D Hand Pose Estimation

no code implementations ECCV 2018 Liuhao Ge, Zhou Ren, Junsong Yuan

Convolutional Neural Networks (CNNs)-based methods for 3D hand pose estimation with depth cameras usually take 2D depth images as input and directly regress holistic 3D hand pose.

3D Hand Pose Estimation

An Aggressive Genetic Programming Approach for Searching Neural Network Structure Under Computational Constraints

no code implementations3 Jun 2018 Zhe Li, Xuehan Xiong, Zhou Ren, Ning Zhang, Xiaoyu Wang, Tianbao Yang

In this paper, we study how to design a genetic programming approach for optimizing the structure of a CNN for a given task under limited computational resources yet without imposing strong restrictions on the search space.

Adversarial Attacks and Defences Competition

1 code implementation31 Mar 2018 Alexey Kurakin, Ian Goodfellow, Samy Bengio, Yinpeng Dong, Fangzhou Liao, Ming Liang, Tianyu Pang, Jun Zhu, Xiaolin Hu, Cihang Xie, Jian-Yu Wang, Zhishuai Zhang, Zhou Ren, Alan Yuille, Sangxia Huang, Yao Zhao, Yuzhe Zhao, Zhonglin Han, Junjiajia Long, Yerkebulan Berdibekov, Takuya Akiba, Seiya Tokui, Motoki Abe

To accelerate research on adversarial examples and robustness of machine learning classifiers, Google Brain organized a NIPS 2017 competition that encouraged researchers to develop new methods to generate adversarial examples as well as to develop new ways to defend against them.

Improving Transferability of Adversarial Examples with Input Diversity

1 code implementation CVPR 2019 Cihang Xie, Zhishuai Zhang, Yuyin Zhou, Song Bai, Jian-Yu Wang, Zhou Ren, Alan Yuille

We hope that our proposed attack strategy can serve as a strong benchmark baseline for evaluating the robustness of networks to adversaries and the effectiveness of different defense methods in the future.

Adversarial Attack Image Classification

Deep Regionlets for Object Detection

no code implementations ECCV 2018 Hongyu Xu, Xutao Lv, Xiaoyu Wang, Zhou Ren, Navaneeth Bodla, Rama Chellappa

The deep regionlets framework consists of a region selection network and a deep regionlet learning module.

Object Detection

Deep Reinforcement Learning-based Image Captioning with Embedding Reward

no code implementations CVPR 2017 Zhou Ren, Xiaoyu Wang, Ning Zhang, Xutao Lv, Li-Jia Li

The policy network serves as a local guidance by providing the confidence of predicting the next word according to the current state.

Decision Making Image Captioning +1

Multi-Instance Visual-Semantic Embedding

no code implementations22 Dec 2015 Zhou Ren, Hailin Jin, Zhe Lin, Chen Fang, Alan Yuille

Visual-semantic embedding models have been recently proposed and shown to be effective for image classification and zero-shot learning, by mapping images into a continuous semantic label space.

General Classification Image Classification +1

Scene-Domain Active Part Models for Object Representation

no code implementations ICCV 2015 Zhou Ren, Chaohui Wang, Alan L. Yuille

In this paper, we are interested in enhancing the expressivity and robustness of part-based models for object representation, in the common scenario where the training data are based on 2D images.

Viewpoint Estimation

