Search Results for author: Yihui He

Found 16 papers, 7 papers with code

Motion Prediction in Visual Object Tracking

no code implementations1 Jul 2020 Jianren Wang, Yihui He

Although our baseline system is a straightforward combination of standard methods, we obtain state-of-the-art results.

Autonomous Driving motion prediction +4

Epipolar Transformers

1 code implementation CVPR 2020 Yihui He, Rui Yan, Katerina Fragkiadaki, Shoou-I Yu

The intuition is: given a 2D location p in the current view, we would like to first find its corresponding point p' in a neighboring view, and then combine the features at p' with the features at p, thus leading to a 3D-aware feature at p. Inspired by stereo matching, the epipolar transformer leverages epipolar constraints and feature matching to approximate the features at p'.

Ranked #2 on 3D Human Pose Estimation on Human3.6M (using extra training data)

3D Human Pose Estimation 3D Pose Estimation +1

Deep Mixture Density Network for Probabilistic Object Detection

no code implementations24 Nov 2019 Yihui He, Jianren Wang

The covariances help to learn the relationship between the borders, and the mixture components potentially learn different configurations of an occluded part.

Object Detection Object Localization

Depth-wise Decomposition for Accelerating Separable Convolutions in Efficient Convolutional Neural Networks

no code implementations21 Oct 2019 Yihui He, Jianing Qian, Jianren Wang

Very deep convolutional neural networks (CNNs) have been firmly established as the primary methods for many computer vision tasks.

Self-Driving Cars

MoBiNet: A Mobile Binary Network for Image Classification

no code implementations29 Jul 2019 Hai Phan, Dang Huynh, Yihui He, Marios Savvides, Zhiqiang Shen

MobileNet and Binary Neural Networks are two among the most widely used techniques to construct deep learning models for performing a variety of tasks on mobile and embedded platforms. In this paper, we present a simple yet efficient scheme to exploit MobileNet binarization at activation function and model weights.

Binarization Classification +2

Prediction-Tracking-Segmentation

no code implementations5 Apr 2019 Jianren Wang, Yihui He, Xiaobo Wang, Xinjia Yu, Xia Chen

We introduce a prediction driven method for visual tracking and segmentation in videos.

Video Segmentation Video Semantic Segmentation +1

Feature Selective Anchor-Free Module for Single-Shot Object Detection

3 code implementations CVPR 2019 Chenchen Zhu, Yihui He, Marios Savvides

The general concept of the FSAF module is online feature selection applied to the training of multi-level anchor-free branches.

Object Detection

An Empirical Analysis of Deep Audio-Visual Models for Speech Recognition

no code implementations21 Dec 2018 Devesh Walawalkar, Yihui He, Rohit Pillai

In this project, we worked on speech recognition, specifically predicting individual words based on both the video frames and audio.

Lip Reading Speech Recognition

Shift-based Primitives for Efficient Convolutional Neural Networks

no code implementations22 Sep 2018 Huasong Zhong, Xianggen Liu, Yihui He, Yuchun Ma

These three primitives (channel shift, address shift, shortcut shift) can reduce the inference time on GPU while maintains the prediction accuracy.

AMC: AutoML for Model Compression and Acceleration on Mobile Devices

11 code implementations ECCV 2018 Yihui He, Ji Lin, Zhijian Liu, Hanrui Wang, Li-Jia Li, Song Han

Model compression is a critical technique to efficiently deploy neural network models on mobile devices which have limited computation resources and tight power budgets.

Model Compression Neural Architecture Search

Estimated Depth Map Helps Image Classification

no code implementations20 Sep 2017 Yihui He

We build a RGBD dataset based on RGB dataset and do image classification on it.

Classification Depth Estimation +3

Channel Pruning for Accelerating Very Deep Neural Networks

1 code implementation ICCV 2017 Yihui He, Xiangyu Zhang, Jian Sun

In this paper, we introduce a new channel pruning method to accelerate very deep convolutional neural networks. Given a trained CNN model, we propose an iterative two-step algorithm to effectively prune each layer, by a LASSO regression based channel selection and least square reconstruction.

Vehicle Traffic Driven Camera Placement for Better Metropolis Security Surveillance

1 code implementation1 Apr 2017 Yihui He, Xiaobo Ma, Xiapu Luo, Jianfeng Li, Mengchen Zhao, Bo An, Xiaohong Guan

Security surveillance is one of the most important issues in smart cities, especially in an era of terrorism.

Decision Making

Single Image Super-resolution via a Lightweight Residual Convolutional Neural Network

no code implementations23 Mar 2017 Yudong Liang, Ze Yang, Kai Zhang, Yihui He, Jinjun Wang, Nanning Zheng

To tackle with the second problem, a lightweight CNN architecture which has carefully designed width, depth and skip connections was proposed.

Image Super-Resolution SSIM

Cannot find the paper you are looking for? You can Submit a new open access paper.