Search Results for author: Chunhua Shen

Found 370 papers, 160 papers with code

Adversarial Learning with Local Coordinate Coding

no code implementations • ICML 2018 • Jiezhang Cao, Yong Guo, Qingyao Wu, Chunhua Shen, Junzhou Huang, Mingkui Tan

Generative adversarial networks (GANs) aim to generate realistic data from some prior distribution (e. g., Gaussian noises).

Paper
Add Code

Adaptive Importance Learning for Improving Lightweight Image Super-resolution Network

no code implementations • 5 Jun 2018 • Lei Zhang, Peng Wang, Chunhua Shen, Lingqiao Liu, Wei Wei, Yanning Zhang, Anton Van Den Hengel

In this study, we revisit this problem from an orthog- onal view, and propose a novel learning strategy to maxi- mize the pixel-wise fitting capacity of a given lightweight network architecture.

Image Super-Resolution

Paper
Add Code

Adversarial Learning of Structure-Aware Fully Convolutional Networks for Landmark Localization

no code implementations • 1 Nov 2017 • Yu Chen, Chunhua Shen, Hao Chen, Xiu-Shen Wei, Lingqiao Liu, Jian Yang

In contrast, human vision is able to predict poses by exploiting geometric constraints of landmark point inter-connectivity.

Pose Estimation

Paper
Add Code

Monocular Depth Estimation with Augmented Ordinal Depth Relationships

no code implementations • 2 Jun 2018 • Yuanzhouhan Cao, Tianqi Zhao, Ke Xian, Chunhua Shen, Zhiguo Cao, Shugong Xu

In this paper, we propose to improve the performance of metric depth estimation with relative depths collected from stereo movie videos using existing stereo matching algorithm.

Depth Prediction Monocular Depth Estimation +2

Paper
Add Code

Multi-label Learning Based Deep Transfer Neural Network for Facial Attribute Classification

no code implementations • 3 May 2018 • Ni Zhuang, Yan Yan, Si Chen, Hanzi Wang, Chunhua Shen

To address the above problem, we propose a novel deep transfer neural network method based on multi-label learning for facial attribute classification, termed FMTNet, which consists of three sub-networks: the Face detection Network (FNet), the Multi-label learning Network (MNet) and the Transfer learning Network (TNet).

Attribute Classification +6

Paper
Add Code

Salient Object Detection by Lossless Feature Reflection

no code implementations • 19 Feb 2018 • Pingping Zhang, Wei Liu, Huchuan Lu, Chunhua Shen

Inspired by the intrinsic reflection of natural images, in this paper we propose a novel feature learning framework for large-scale salient object detection.

Object object-detection +3

Paper
Add Code

HyperFusion-Net: Densely Reflective Fusion for Salient Object Detection

no code implementations • 14 Apr 2018 • Pingping Zhang, Huchuan Lu, Chunhua Shen

Salient object detection (SOD), which aims to find the most important region of interest and segment the relevant object/item in that area, is an important yet challenging vision task.

object-detection RGB Salient Object Detection +1

Paper
Add Code

VITAL: VIsual Tracking via Adversarial Learning

no code implementations • CVPR 2018 • Yibing Song, Chao Ma, Xiaohe Wu, Lijun Gong, Linchao Bao, WangMeng Zuo, Chunhua Shen, Rynson Lau, Ming-Hsuan Yang

To augment positive samples, we use a generative network to randomly generate masks, which are applied to adaptively dropout input features to capture a variety of appearance changes.

General Classification Visual Tracking

Paper
Add Code

Visual Question Answering with Memory-Augmented Networks

no code implementations • CVPR 2018 • Chao Ma, Chunhua Shen, Anthony Dick, Qi Wu, Peng Wang, Anton Van Den Hengel, Ian Reid

In this paper, we exploit a memory-augmented neural network to predict accurate answers to visual questions, even when those answers occur rarely in the training set.

Question Answering Visual Question Answering

Paper
Add Code

Decoupled Spatial Neural Attention for Weakly Supervised Semantic Segmentation

no code implementations • 7 Mar 2018 • Tianyi Zhang, Guosheng Lin, Jianfei Cai, Tong Shen, Chunhua Shen, Alex C. Kot

In our work, we focus on the weakly supervised semantic segmentation with image label annotations.

Image Captioning Segmentation +2

Paper
Add Code

Non-rigid Object Tracking via Deep Multi-scale Spatial-temporal Discriminative Saliency Maps

no code implementations • 22 Feb 2018 • Pingping Zhang, Wei Liu, Dong Wang, Yinjie Lei, Hongyu Wang, Chunhua Shen, Huchuan Lu

Extensive experiments demonstrate that the proposed algorithm achieves competitive performance in both saliency detection and visual tracking, especially outperforming other related trackers on the non-rigid object tracking datasets.

Object Object Tracking +2

Paper
Add Code

Agile Amulet: Real-Time Salient Object Detection with Contextual Attention

no code implementations • 20 Feb 2018 • Pingping Zhang, Luyao Wang, Dong Wang, Huchuan Lu, Chunhua Shen

This paper proposes an Agile Aggregating Multi-Level feaTure framework (Agile Amulet) for salient object detection.

object-detection RGB Salient Object Detection +1

Paper
Add Code

Automatic Image Cropping for Visual Aesthetic Enhancement Using Deep Neural Networks and Cascaded Regression

no code implementations • 25 Dec 2017 • Guanjun Guo, Hanzi Wang, Chunhua Shen, Yan Yan, Hong-Yuan Mark Liao

The deep CNN model is then designed to extract features from several image cropping datasets, upon which the cropping bounding boxes are predicted by the proposed CCR method.

Image Cropping regression

Paper
Add Code

Real-time Semantic Image Segmentation via Spatial Sparsity

no code implementations • 1 Dec 2017 • Zifeng Wu, Chunhua Shen, Anton Van Den Hengel

We propose an approach to semantic (image) segmentation that reduces the computational costs by a factor of 25 with limited impact on the quality of results.

Image Segmentation Segmentation +1

Paper
Add Code

Asking the Difficult Questions: Goal-Oriented Visual Question Generation via Intermediate Rewards

no code implementations • 21 Nov 2017 • Jun-Jie Zhang, Qi Wu, Chunhua Shen, Jian Zhang, Jianfeng Lu, Anton Van Den Hengel

Despite significant progress in a variety of vision-and-language problems, developing a method capable of asking intelligent, goal-oriented questions about images is proven to be an inscrutable challenge.

Informativeness Question Generation +2

Paper
Add Code

Are You Talking to Me? Reasoned Visual Dialog Generation through Adversarial Learning

no code implementations • CVPR 2018 • Qi Wu, Peng Wang, Chunhua Shen, Ian Reid, Anton Van Den Hengel

The Visual Dialogue task requires an agent to engage in a conversation about an image with a human.

Ranked #4 on Visual Dialog on VisDial v0.9 val

Question Answering Visual Dialog +1

Paper
Add Code

Kill Two Birds with One Stone: Weakly-Supervised Neural Network for Image Annotation and Tag Refinement

no code implementations • 19 Nov 2017 • Jun-Jie Zhang, Qi Wu, Jian Zhang, Chunhua Shen, Jianfeng Lu

These comments can be a description of the image, or some objects, attributes, scenes in it, which are normally used as the user-provided tags.

Retrieval TAG

Paper
Add Code

Parallel Attention: A Unified Framework for Visual Object Discovery through Dialogs and Queries

no code implementations • CVPR 2018 • Bohan Zhuang, Qi Wu, Chunhua Shen, Ian Reid, Anton Van Den Hengel

To this end we propose a unified framework, the ParalleL AttentioN (PLAN) network, to discover the object in an image that is being referred to in variable length natural expression descriptions, from short phrases query to long multi-round dialogs.

Object Object Discovery +2

Paper
Add Code

Adversarial Generation of Training Examples: Applications to Moving Vehicle License Plate Recognition

no code implementations • 11 Jul 2017 • Xinlong Wang, Zhipeng Man, Mingyu You, Chunhua Shen

Our experimental results on a few data sets demonstrate the effectiveness of using GAN images: an improvement of 7. 5% over a strong baseline with moderate-sized real data being available.

Image Generation License Plate Recognition

Paper
Add Code

Towards End-to-End Car License Plates Detection and Recognition with Deep Neural Networks

no code implementations • 26 Sep 2017 • Hui Li, Peng Wang, Chunhua Shen

In contrast to existing approaches which take license plate detection and recognition as two separate tasks and settle them step by step, our method jointly solves these two tasks by a single network.

License Plate Detection

Paper
Add Code

Estimating Depth from Monocular Images as Classification Using Deep Fully Convolutional Residual Networks

no code implementations • 8 May 2016 • Yuanzhouhan Cao, Zifeng Wu, Chunhua Shen

Then we train fully convolutional deep residual networks to predict the depth label of each pixel.

Depth Estimation General Classification +1

Paper
Add Code

FVQA: Fact-based Visual Question Answering

no code implementations • 17 Jun 2016 • Peng Wang, Qi Wu, Chunhua Shen, Anton Van Den Hengel, Anthony Dick

We evaluate several baseline models on the FVQA dataset, and describe a novel model which is capable of reasoning about an image on the basis of supporting facts.

Ranked #2 on Visual Question Answering (VQA) on F-VQA

Common Sense Reasoning Question Answering +1

Paper
Add Code

Weakly Supervised Semantic Segmentation Based on Web Image Co-segmentation

no code implementations • 25 May 2017 • Tong Shen, Guosheng Lin, Lingqiao Liu, Chunhua Shen, Ian Reid

Training a Fully Convolutional Network (FCN) for semantic segmentation requires a large number of masks with pixel level labelling, which involves a large amount of human labour and time for annotation.

Segmentation Weakly supervised Semantic Segmentation +1

Paper
Add Code

Beyond Low Rank: A Data-Adaptive Tensor Completion Method

no code implementations • 3 Aug 2017 • Lei Zhang, Wei Wei, Qinfeng Shi, Chunhua Shen, Anton Van Den Hengel, Yanning Zhang

The prior for the non-low-rank structure is established based on a mixture of Gaussians which is shown to be flexible enough, and powerful enough, to inform the completion process for a variety of real tensor data.

Paper
Add Code

Relative Depth Order Estimation Using Multi-scale Densely Connected Convolutional Networks

no code implementations • 25 Jul 2017 • Ruoxi Deng, Tianqi Zhao, Chunhua Shen, Shengjun Liu

We study the problem of estimating the relative depth order of point pairs in a monocular image.

Paper
Add Code

Unsupervised Object Discovery and Co-Localization by Deep Descriptor Transforming

no code implementations • 20 Jul 2017 • Xiu-Shen Wei, Chen-Lin Zhang, Jianxin Wu, Chunhua Shen, Zhi-Hua Zhou

Reusable model design becomes desirable with the rapid expansion of computer vision and machine learning applications.

Ranked #11 on Single-object discovery on COCO_20k

Object object-detection +4

Paper
Add Code

Visually Aligned Word Embeddings for Improving Zero-shot Learning

no code implementations • 18 Jul 2017 • Ruizhi Qiao, Lingqiao Liu, Chunhua Shen, Anton Van Den Hengel

To overcome this visual-semantic discrepancy, this work proposes an objective function to re-align the distributed word embeddings with visual information by learning a neural network to map it into a new representation called visually aligned word embedding (VAWE).

Paper
Add Code

Towards End-to-end Text Spotting with Convolutional Recurrent Neural Networks

no code implementations • ICCV 2017 • Hui Li, Peng Wang, Chunhua Shen

In this work, we jointly address the problem of text detection and recognition in natural scene images based on convolutional recurrent neural networks.

Image Cropping Text Detection +1

Paper
Add Code

TasselNet: Counting maize tassels in the wild via local counts regression network

no code implementations • 7 Jul 2017 • Hao Lu, Zhiguo Cao, Yang Xiao, Bohan Zhuang, Chunhua Shen

To our knowledge, this is the first time that a plant-related counting problem is considered using computer vision technologies under unconstrained field-based environment.

Plant Phenotyping regression

Paper
Add Code

Care about you: towards large-scale human-centric visual relationship detection

no code implementations • 28 May 2017 • Bohan Zhuang, Qi Wu, Chunhua Shen, Ian Reid, Anton Van Den Hengel

In addressing this problem we first construct a large-scale human-centric visual relationship detection dataset (HCVRD), which provides many more types of relationship annotation (nearly 10K categories) than the previous released datasets.

Human-Object Interaction Detection Relationship Detection +1

Paper
Add Code

Deep Descriptor Transforming for Image Co-Localization

no code implementations • 8 May 2017 • Xiu-Shen Wei, Chen-Lin Zhang, Yao Li, Chen-Wei Xie, Jianxin Wu, Chunhua Shen, Zhi-Hua Zhou

Reusable model design becomes desirable with the rapid expansion of machine learning applications.

Paper
Add Code

Exploring Context with Deep Structured models for Semantic Segmentation

no code implementations • 10 Mar 2016 • Guosheng Lin, Chunhua Shen, Anton Van Den Hengel, Ian Reid

We formulate deep structured models by combining CNNs and Conditional Random Fields (CRFs) for learning the patch-patch context between image regions.

Image Segmentation Segmentation +1

Paper
Add Code

Towards Context-aware Interaction Recognition

no code implementations • 18 Mar 2017 • Bohan Zhuang, Lingqiao Liu, Chunhua Shen, Ian Reid

Recognizing how objects interact with each other is a crucial task in visual recognition.

Paper
Add Code

Robust Guided Image Filtering

no code implementations • 28 Mar 2017 • Wei Liu, Xiaogang Chen, Chunhua Shen, Jingyi Yu, Qiang Wu, Jie Yang

In this paper, we propose a general framework for Robust Guided Image Filtering (RGIF), which contains a data term and a smoothness term, to solve the two issues mentioned above.

Paper
Add Code

Structured Learning of Tree Potentials in CRF for Image Segmentation

no code implementations • 26 Mar 2017 • Fayao Liu, Guosheng Lin, Ruizhi Qiao, Chunhua Shen

In this fashion, we easily achieve nonlinear learning of potential functions on both unary and pairwise terms in CRFs.

Image Segmentation Semantic Segmentation

Paper
Add Code

Multi-Label Image Classification with Regional Latent Semantic Dependencies

no code implementations • 4 Dec 2016 • Jun-Jie Zhang, Qi Wu, Chunhua Shen, Jian Zhang, Jianfeng Lu

Recent state-of-the-art approaches to multi-label image classification exploit the label dependencies in an image, at global level, largely improving the labeling capacity.

Classification General Classification +1

Paper
Add Code

Learning Multi-level Region Consistency with Dense Multi-label Networks for Semantic Segmentation

no code implementations • 25 Jan 2017 • Tong Shen, Guosheng Lin, Chunhua Shen, Ian Reid

Semantic image segmentation is a fundamental task in image understanding.

Image Segmentation Segmentation +1

Paper
Add Code

Deep Learning Features at Scale for Visual Place Recognition

no code implementations • 18 Jan 2017 • Zetao Chen, Adam Jacobson, Niko Sunderhauf, Ben Upcroft, Lingqiao Liu, Chunhua Shen, Ian Reid, Michael Milford

The success of deep learning techniques in the computer vision domain has triggered a range of initial investigations into their utility for visual place recognition, all using generic features from networks that were trained for other types of recognition tasks.

Visual Place Recognition

Paper
Add Code

Compositional Model based Fisher Vector Coding for Image Classification

1 code implementation • 16 Jan 2016 • Lingqiao Liu, Peng Wang, Chunhua Shen, Lei Wang, Anton Van Den Hengel, Chao Wang, Heng Tao Shen

To handle this limitation, in this paper we break the convention which assumes that a local feature is drawn from one of few Gaussian distributions.

Classification General Classification +1

Paper
Code

Cross-convolutional-layer Pooling for Image Recognition

no code implementations • 4 Oct 2015 • Lingqiao Liu, Chunhua Shen, Anton Van Den Hengel

Most of these studies adopt activations from a single DCNN layer, usually the fully-connected layer, as the image representation.

General Classification Image Classification

Paper
Add Code

Image Captioning and Visual Question Answering Based on Attributes and External Knowledge

no code implementations • 9 Mar 2016 • Qi Wu, Chunhua Shen, Anton Van Den Hengel, Peng Wang, Anthony Dick

Much recent progress in Vision-to-Language problems has been achieved through a combination of Convolutional Neural Networks (CNNs) and Recurrent Neural Networks (RNNs).

Ranked #9 on Visual Question Answering (VQA) on COCO Visual Question Answering (VQA) real images 1.0 open ended

General Knowledge Image Captioning +2

Paper
Add Code

The VQA-Machine: Learning How to Use Existing Vision Algorithms to Answer New Questions

no code implementations • CVPR 2017 • Peng Wang, Qi Wu, Chunhua Shen, Anton Van Den Hengel

To train a method to perform even one of these operations accurately from {image, question, answer} tuples would be challenging, but to aim to achieve them all with a limited set of such training data seems ambitious at best.

BIG-bench Machine Learning Question Answering +1

Paper
Add Code

From Motion Blur to Motion Flow: a Deep Learning Solution for Removing Heterogeneous Motion Blur

no code implementations • CVPR 2017 • Dong Gong, Jie Yang, Lingqiao Liu, Yanning Zhang, Ian Reid, Chunhua Shen, Anton Van Den Hengel, Qinfeng Shi

The critical observation underpinning our approach is thus that learning the motion flow instead allows the model to focus on the cause of the blur, irrespective of the image content.

Paper
Add Code

Sequential Person Recognition in Photo Albums with a Recurrent Network

no code implementations • CVPR 2017 • Yao Li, Guosheng Lin, Bohan Zhuang, Lingqiao Liu, Chunhua Shen, Anton Van Den Hengel

In this work, we propose to model the relational information between people as a sequence prediction task.

Person Recognition

Paper
Add Code

Attend in groups: a weakly-supervised deep learning framework for learning from web data

no code implementations • CVPR 2017 • Bohan Zhuang, Lingqiao Liu, Yao Li, Chunhua Shen, Ian Reid

Large-scale datasets have driven the rapid development of deep neural networks for visual recognition.

Paper
Add Code

Exploiting Depth from Single Monocular Images for Object Detection and Semantic Segmentation

no code implementations • 6 Oct 2016 • Yuanzhouhan Cao, Chunhua Shen, Heng Tao Shen

Augmenting RGB data with measured depth has been shown to improve the performance of a range of tasks in computer vision including object detection and semantic segmentation.

Depth Estimation Object +4

Paper
Add Code

Fast Training of Triplet-based Deep Binary Embedding Networks

no code implementations • CVPR 2016 • Bohan Zhuang, Guosheng Lin, Chunhua Shen, Ian Reid

To solve the first stage, we design a large-scale high-order binary codes inference algorithm to reduce the high-order objective to a standard binary quadratic problem such that graph cuts can be used to efficiently infer the binary code which serve as the label of each training datum.

Image Retrieval Multi-Label Classification +1

Paper
Add Code

Image Co-localization by Mimicking a Good Detector's Confidence Score Distribution

no code implementations • 15 Mar 2016 • Yao Li, Linqiao Liu, Chunhua Shen, Anton Van Den Hengel

More specifically, we observe that given a set of object proposals extracted from an image that contains the object of interest, an accurate strongly supervised object detector should give high scores to only a small minority of proposals, and low scores to most of them.

Object

Paper
Add Code

Where to Focus: Query Adaptive Matching for Instance Retrieval Using Convolutional Feature Maps

no code implementations • 22 Jun 2016 • Jiewei Cao, Lingqiao Liu, Peng Wang, Zi Huang, Chunhua Shen, Heng Tao Shen

Instance retrieval requires one to search for images that contain a particular object within a large corpus.

Retrieval

Paper
Add Code

PersonNet: Person Re-identification with Deep Convolutional Neural Networks

1 code implementation • 27 Jan 2016 • Lin Wu, Chunhua Shen, Anton Van Den Hengel

In this paper, we propose a deep end-to-end neu- ral network to simultaneously learn high-level features and a corresponding similarity metric for person re-identification.

Person Re-Identification

Paper
Code

Deep Recurrent Convolutional Networks for Video-based Person Re-identification: An End-to-End Approach

no code implementations • 6 Jun 2016 • Lin Wu, Chunhua Shen, Anton Van Den Hengel

In this paper, we present an end-to-end approach to simultaneously learn spatio-temporal features and corresponding similarity metric for video-based person re-identification.

Metric Learning Time Series +2

Paper
Add Code

Pushing the Limits of Deep CNNs for Pedestrian Detection

no code implementations • 15 Mar 2016 • Qichang Hu, Peng Wang, Chunhua Shen, Anton Van Den Hengel, Fatih Porikli

In this work, we show that by re-using the convolutional feature maps (CFMs) of a deep convolutional neural network (DCNN) model as image features to train an ensemble of boosted decision models, we are able to achieve the best reported accuracy without using specially designed learning algorithms.

Occlusion Handling Optical Flow Estimation +1

Paper
Add Code

Deep Linear Discriminant Analysis on Fisher Networks: A Hybrid Architecture for Person Re-identification

no code implementations • 6 Jun 2016 • Lin Wu, Chunhua Shen, Anton Van Den Hengel

Person re-identification is to seek a correct match for a person of interest across views among a large number of imposters.

Person Re-Identification

Paper
Add Code

Efficient piecewise training of deep structured models for semantic segmentation

no code implementations • CVPR 2016 • Guosheng Lin, Chunhua Shen, Anton van dan Hengel, Ian Reid

Recent advances in semantic image segmentation have mostly been achieved by training deep convolutional neural networks (CNNs).

Ranked #54 on Semantic Segmentation on PASCAL Context

Image Segmentation Segmentation +1

Paper
Add Code

Structured learning of metric ensembles with application to person re-identification

no code implementations • 27 Nov 2015 • Sakrapee Paisitkriangkrai, Lin Wu, Chunhua Shen, Anton Van Den Hengel

However, seeking an optimal combination of visual features which is generic yet adaptive to different benchmarks is a unsoved problem, and metric learning models easily get over-fitted due to the scarcity of training data in person re-identification.

Metric Learning Person Re-Identification

Paper
Add Code

Bridging Category-level and Instance-level Semantic Image Segmentation

no code implementations • 23 May 2016 • Zifeng Wu, Chunhua Shen, Anton Van Den Hengel

(iii) As the performance of semantic category segmentation has a significant impact on the instance-level segmentation, which is the second step of our approach, we train fully convolutional residual networks to achieve the best semantic category segmentation accuracy.

Ranked #53 on Semantic Segmentation on PASCAL Context

Image Segmentation Instance Segmentation +2

Paper
Add Code

Large-scale Binary Quadratic Optimization Using Semidefinite Relaxation and Applications

no code implementations • 27 Nov 2014 • Peng Wang, Chunhua Shen, Anton Van Den Hengel, Philip H. S. Torr

Two standard relaxation methods are widely used for solving general BQPs--spectral methods and semidefinite programming (SDP), each with their own advantages and disadvantages.

Clustering Image Segmentation +2

Paper
Add Code

Crowd Counting via Weighted VLAD on Dense Attribute Feature Maps

no code implementations • 29 Apr 2016 • Biyun Sheng, Chunhua Shen, Guosheng Lin, Jun Li, Wankou Yang, Changyin Sun

Crowd counting is an important task in computer vision, which has many applications in video surveillance.

Attribute Crowd Counting

Paper
Add Code

High-performance Semantic Segmentation Using Very Deep Fully Convolutional Networks

no code implementations • 15 Apr 2016 • Zifeng Wu, Chunhua Shen, Anton Van Den Hengel

(i) First, we evaluate different variations of a fully convolutional residual network so as to find the best configuration, including the number of layers, the resolution of feature maps, and the size of field-of-view.

Image Segmentation Semantic Segmentation +1

Paper
Add Code

Ask Me Anything: Free-form Visual Question Answering Based on Knowledge from External Sources

no code implementations • CVPR 2016 • Qi Wu, Peng Wang, Chunhua Shen, Anthony Dick, Anton Van Den Hengel

Priming a recurrent neural network with this combined information, and the submitted question, leads to a very flexible visual question answering approach.

General Knowledge Question Answering +1

Paper
Add Code

Less is more: zero-shot learning from online textual documents with noise suppression

no code implementations • CVPR 2016 • Ruizhi Qiao, Lingqiao Liu, Chunhua Shen, Anton Van Den Hengel

Classifying a visual concept merely from its associated online textual source, such as a Wikipedia article, is an attractive research topic in zero-shot learning because it alleviates the burden of manually collecting semantic attributes.

Zero-Shot Learning

Paper
Add Code

Structured Learning of Binary Codes with Column Generation

no code implementations • 22 Feb 2016 • Guosheng Lin, Fayao Liu, Chunhua Shen, Jianxin Wu, Heng Tao Shen

Our column generation based method can be further generalized from the triplet loss to a general structured learning based framework that allows one to directly optimize multivariate performance measures.

Image Retrieval Information Retrieval +1

Paper
Add Code

Hi Detector, What's Wrong with that Object? Identifying Irregular Object From Images by Modelling the Detection Score Distribution

no code implementations • 14 Feb 2016 • Peng Wang, Lingqiao Liu, Chunhua Shen, Anton Van Den Hengel, Heng Tao Shen

To address this problem, we propose a novel approach by inspecting the distribution of the detection scores at multiple image regions based on the detector trained from the "regular object" and "other objects".

Gaussian Processes Object

Paper
Add Code

Order-aware Convolutional Pooling for Video Based Action Recognition

no code implementations • 31 Jan 2016 • Peng Wang, Lingqiao Liu, Chunhua Shen, Heng Tao Shen

Most video based action recognition approaches create the video-level representation by temporally pooling the features extracted at each frame.

Action Recognition Temporal Action Localization

Paper
Add Code

Discriminative Training of Deep Fully-connected Continuous CRF with Task-specific Loss

no code implementations • 28 Jan 2016 • Fayao Liu, Guosheng Lin, Chunhua Shen

We exemplify the usefulness of the proposed model on multi-class semantic labelling (discrete) and the robust depth estimation (continuous) problems.

Depth Estimation Multi-class Classification

Paper
Add Code

Data Driven Robust Image Guided Depth Map Restoration

no code implementations • 26 Dec 2015 • Wei Liu, Yun Gu, Chunhua Shen, Xiaogang Chen, Qiang Wu, Jie Yang

Depth maps captured by modern depth cameras such as Kinect and Time-of-Flight (ToF) are usually contaminated by missing data, noises and suffer from being of low resolution.

Paper
Add Code

Explicit Knowledge-based Reasoning for Visual Question Answering

no code implementations • 9 Nov 2015 • Peng Wang, Qi Wu, Chunhua Shen, Anton Van Den Hengel, Anthony Dick

We describe a method for visual question answering which is capable of reasoning about contents of an image on the basis of information extracted from a large-scale knowledge base.

Question Answering Visual Question Answering

Paper
Add Code

Fast detection of multiple objects in traffic scenes with a common detection framework

no code implementations • 12 Oct 2015 • Qichang Hu, Sakrapee Paisitkriangkrai, Chunhua Shen, Anton Van Den Hengel, Fatih Porikli

The proposed framework consists of a dense feature extractor and detectors of three important classes.

Traffic Sign Detection

Paper
Add Code

Efficient Semidefinite Branch-and-Cut for MAP-MRF Inference

no code implementations • 20 Apr 2014 • Peng Wang, Chunhua Shen, Anton Van Den Hengel, Philip Torr

We propose a Branch-and-Cut (B&C) method for solving general MAP-MRF inference problems.

Paper
Add Code

Deeply Learning the Messages in Message Passing Inference

no code implementations • NeurIPS 2015 • Guosheng Lin, Chunhua Shen, Ian Reid, Anton Van Den Hengel

The network output dimension for message estimation is the same as the number of classes, in contrast to the network output for general CNN potential functions in CRFs, which is exponential in the order of the potentials.

Image Segmentation Semantic Segmentation +1

Paper
Add Code

Online Metric-Weighted Linear Representations for Robust Visual Tracking

no code implementations • 21 Jul 2015 • Xi Li, Chunhua Shen, Anthony Dick, Zhongfei Zhang, Yueting Zhuang

Object identification results for an entire video sequence are achieved by systematically combining the tracking information and visual recognition at each frame.

Metric Learning Object +2

Paper
Add Code

Pedestrian Detection with Spatially Pooled Features and Structured Ensemble Learning

no code implementations • 18 Sep 2014 • Sakrapee Paisitkriangkrai, Chunhua Shen, Anton Van Den Hengel

Experimental results on both synthetic and real-world data sets demonstrate the effectiveness of our approach, and we show that it is possible to train state-of-the-art pedestrian detectors using the proposed structured ensemble learning method with spatially pooled features.

Ensemble Learning object-detection +2

Paper
Add Code

Extrinsic Methods for Coding and Dictionary Learning on Grassmann Manifolds

no code implementations • 31 Jan 2014 • Mehrtash Harandi, Richard Hartley, Chunhua Shen, Brian Lovell, Conrad Sanderson

With the aim of building a bridge between the two realms, we address the problem of sparse coding and dictionary learning over the space of linear subspaces, which form Riemannian structures known as Grassmann manifolds.

Action Recognition Classification +6

Paper
Add Code

Unsupervised Feature Learning for Dense Correspondences across Scenes

1 code implementation • 4 Jan 2015 • Chao Zhang, Chunhua Shen, Tingzhi Shen

We experimentally demonstrate that the learned features, together with our matching model, outperforms state-of-the-art methods such as the SIFT flow, coherency sensitive hashing and the recent deformable spatial pyramid matching methods both in terms of accuracy and computation efficiency.

Dictionary Learning

Paper
Code

Learning discriminative trajectorylet detector sets for accurate skeleton-based action recognition

no code implementations • 20 Apr 2015 • Ruizhi Qiao, Lingqiao Liu, Chunhua Shen, Anton von den Hengel

The introduction of low-cost RGB-D sensors has promoted the research in skeleton-based human action recognition.

Action Recognition Skeleton Based Action Recognition +1

Paper
Add Code

Temporal Pyramid Pooling Based Convolutional Neural Networks for Action Recognition

no code implementations • 4 Mar 2015 • Peng Wang, Yuanzhouhan Cao, Chunhua Shen, Lingqiao Liu, Heng Tao Shen

One challenge is that video contains a varying number of frames which is incompatible to the standard input format of CNNs.

Action Recognition Image Classification +1

Paper
Add Code

Mid-level Deep Pattern Mining

no code implementations • CVPR 2015 • Yao Li, Lingqiao Liu, Chunhua Shen, Anton Van Den Hengel

We apply our approach to scene and object classification tasks, and demonstrate that our approach outperforms all previous works on mid-level visual element discovery by a sizeable margin with far fewer elements being used.

Paper
Add Code

Efficient SDP Inference for Fully-connected CRFs Based on Low-rank Decomposition

no code implementations • CVPR 2015 • Peng Wang, Chunhua Shen, Anton Van Den Hengel

Conditional Random Fields (CRF) have been widely used in a variety of computer vision tasks.

Paper
Add Code

CRF Learning with CNN Features for Image Segmentation

no code implementations • 28 Mar 2015 • Fayao Liu, Guosheng Lin, Chunhua Shen

The deep CNN is trained on the ImageNet dataset and transferred to image segmentations here for constructing potentials of superpixels.

Image Segmentation Segmentation +2

Paper
Add Code

Learning to rank in person re-identification with metric ensembles

no code implementations • CVPR 2015 • Sakrapee Paisitkriangkrai, Chunhua Shen, Anton Van Den Hengel

We propose an effective structured learning based approach to the problem of person re-identification which outperforms the current state-of-the-art on most benchmark data sets evaluated.

Learning-To-Rank Person Re-Identification

Paper
Add Code

Supervised Hashing Using Graph Cuts and Boosted Decision Trees

1 code implementation • 24 Aug 2014 • Guosheng Lin, Chunhua Shen, Anton Van Den Hengel

The proposed framework allows a number of existing approaches to hashing to be placed in context, and simplifies the development of new problem-specific hashing methods.

Descriptive Image Retrieval +1

Paper
Code

Deep Convolutional Neural Fields for Depth Estimation from a Single Image

no code implementations • CVPR 2015 • Fayao Liu, Chunhua Shen, Guosheng Lin

Therefore, we in this paper present a deep convolutional neural field model for estimating depths from a single image, aiming to jointly explore the capacity of deep CNN and continuous CRF.

Depth Estimation

Paper
Add Code

Hashing on Nonlinear Manifolds

no code implementations • 2 Dec 2014 • Fumin Shen, Chunhua Shen, Qinfeng Shi, Anton Van Den Hengel, Zhenmin Tang, Heng Tao Shen

In addition, a supervised inductive manifold hashing framework is developed by incorporating the label information, which is shown to greatly advance the semantic retrieval performance.

Image Classification Quantization +2

Paper
Add Code

Worst-Case Linear Discriminant Analysis as Scalable Semidefinite Feasibility Problems

no code implementations • 27 Nov 2014 • Hui Li, Chunhua Shen, Anton Van Den Hengel, Qinfeng Shi

Our method is also much faster and more scalable than standard interior-point SDP solvers based WLDA.

Dimensionality Reduction General Classification

Paper
Add Code

Encoding High Dimensional Local Features by Sparse Coding Based Fisher Vectors

no code implementations • NeurIPS 2014 • Lingqiao Liu, Chunhua Shen, Lei Wang, Anton Van Den Hengel, Chao Wang

By calculating the gradient vector of the proposed model, we derive a new fisher vector encoding strategy, termed Sparse Coding based Fisher Vector Coding (SCFVC).

Fine-Grained Image Classification General Classification +2

Paper
Add Code

Face Identification with Second-Order Pooling

no code implementations • 26 Jun 2014 • Fumin Shen, Chunhua Shen, Heng Tao Shen

Spatial pyramid pooling of features encoded by an over-complete dictionary has been the key component of many state-of-the-art image classification systems.

Face Identification Face Recognition +4

Paper
Add Code

Face Image Classification by Pooling Raw Features

no code implementations • 26 Jun 2014 • Fumin Shen, Chunhua Shen, Heng Tao Shen

We propose a very simple, efficient yet surprisingly effective feature extraction method for face recognition (about 20 lines of Matlab code), which is mainly inspired by spatial pyramid pooling in generic image classification.

Classification Face Recognition +2

Paper
Add Code

A Computational Model of the Short-Cut Rule for 2D Shape Decomposition

no code implementations • 7 Sep 2014 • Lei Luo, Chunhua Shen, Xinwang Liu, Chunyuan Zhang

We propose and implement a computational model for the short-cut rule and apply it to the problem of shape decomposition.

Paper
Add Code

Optimizing Ranking Measures for Compact Binary Code Learning

no code implementations • 4 Jul 2014 • Guosheng Lin, Chunhua Shen, Jianxin Wu

Hashing has proven a valuable tool for large-scale information retrieval.

Image Retrieval Information Retrieval +1

Paper
Add Code

Strengthening the Effectiveness of Pedestrian Detection with Spatially Pooled Features

no code implementations • 3 Jul 2014 • Sakrapee Paisitkriangkrai, Chunhua Shen, Anton Van Den Hengel

The combination of these factors leads to a pedestrian detector which outperforms all competitors on all of the standard benchmark datasets.

Pedestrian Detection

Paper
Add Code

Fast Supervised Hashing with Decision Trees for High-Dimensional Data

1 code implementation • CVPR 2014 • Guosheng Lin, Chunhua Shen, Qinfeng Shi, Anton Van Den Hengel, David Suter

Here we propose to use boosted decision trees for achieving non-linearity in hashing, which are fast to train and evaluate, hence more suitable for hashing with high dimensional data.

Retrieval Vocal Bursts Intensity Prediction

Paper
Code

Learning Deep Convolutional Features for MRI Based Alzheimer's Disease Classification

no code implementations • 13 Apr 2014 • Fayao Liu, Chunhua Shen

In this work, we propose to learn deep convolutional image features using unsupervised and supervised learning.

BIG-bench Machine Learning Classification +2

Paper
Add Code

Constraint Reduction using Marginal Polytope Diagrams for MAP LP Relaxations

no code implementations • 17 Dec 2013 • Zhen Zhang, Qinfeng Shi, Yanning Zhang, Chunhua Shen, Anton Van Den Hengel

We show that using Marginal Polytope Diagrams allows the number of constraints to be reduced without loosening the LP relaxations.

Paper
Add Code

Large-margin Learning of Compact Binary Image Encodings

no code implementations • 26 Feb 2014 • Sakrapee Paisitkriangkrai, Chunhua Shen, Anton Van Den Hengel

The use of high-dimensional features has become a normal practice in many computer vision applications.

Content-Based Image Retrieval General Classification +2

Paper
Add Code

Asymmetric Pruning for Learning Cascade Detectors

no code implementations • 25 Mar 2013 • Sakrapee Paisitkriangkrai, Chunhua Shen, Anton Van Den Hengel

In this work, we present a new approach to train an effective node classifier in a cascade detector.

object-detection Real-Time Object Detection

Paper
Add Code

Efficient Semidefinite Spectral Clustering via Lagrange Duality

no code implementations • 22 Feb 2014 • Yan Yan, Chunhua Shen, Hanzi Wang

constraint for spectral clustering.

Clustering

Paper
Add Code

StructBoost: Boosting Methods for Predicting Structured Output Variables

no code implementations • 14 Feb 2013 • Chunhua Shen, Guosheng Lin, Anton Van Den Hengel

Inspired by structured support vector machines (SSVM), here we propose a new boosting algorithm for structured output prediction, which we refer to as StructBoost.

Image Segmentation Multi-class Classification +2

Paper
Add Code

A Generalized Probabilistic Framework for Compact Codebook Creation

no code implementations • 30 Jan 2014 • Lingqiao Liu, Lei Wang, Chunhua Shen

In the third criterion, which shows the best merging performance, we propose a max-margin-based parameter estimation method and apply it with multinomial distribution.

Paper
Add Code

From Kernel Machines to Ensemble Learning

no code implementations • 4 Jan 2014 • Chunhua Shen, Fayao Liu

This finding not only enables us to design new ensemble learning methods directly from kernel methods, but also makes it possible to take advantage of those highly-optimized fast linear SVM solvers for ensemble learning.

Ensemble Learning Translation

Paper
Add Code

Context-Aware Hypergraph Construction for Robust Spectral Clustering

no code implementations • 4 Jan 2014 • Xi Li, Weiming Hu, Chunhua Shen, Anthony Dick, Zhongfei Zhang

Using both CAHSM and DHPC, a robust spectral clustering algorithm is developed.

Clustering hypergraph partitioning

Paper
Add Code

Fast Training of Effective Multi-class Boosting Using Coordinate Descent Optimization

no code implementations • 23 Nov 2013 • Guosheng Lin, Chunhua Shen, Anton Van Den Hengel, David Suter

Different from most existing multi-class boosting methods, which use the same set of weak learners for all the classes, we train class specified weak learners (i. e., each class has a different set of weak learners).

Multi-class Classification

Paper
Add Code

Contextual Hypergraph Modelling for Salient Object Detection

no code implementations • 22 Oct 2013 • Xi Li, Yao Li, Chunhua Shen, Anthony Dick, Anton Van Den Hengel

In this work, we model an image as a hypergraph that utilizes a set of hyperedges to capture the contextual properties of image pixels or regions.

Object object-detection +2

Paper
Add Code

Dictionary Learning and Sparse Coding on Grassmann Manifolds: An Extrinsic Solution

no code implementations • 18 Oct 2013 • Mehrtash Harandi, Conrad Sanderson, Chunhua Shen, Brian C. Lovell

Recent advances in computer vision and machine learning suggest that a wide range of problems can be addressed more appropriately by considering non-Euclidean geometry.

Action Recognition Dictionary Learning +5

Paper
Add Code

Online Unsupervised Feature Learning for Visual Tracking

no code implementations • 7 Oct 2013 • Fayao Liu, Chunhua Shen, Ian Reid, Anton Van Den Hengel

Feature encoding with respect to an over-complete dictionary learned by unsupervised methods, followed by spatial pyramid pooling, and linear classification, has exhibited powerful strength in various vision applications.

Dictionary Learning Visual Tracking

Paper
Add Code

Efficient pedestrian detection by directly optimize the partial area under the ROC curve

no code implementations • 3 Oct 2013 • Sakrapee Paisitkriangkrai, Chunhua Shen, Anton Van Den Hengel

We propose a novel ensemble learning method which achieves a maximal detection rate at a user-defined range of false positive rates by directly optimizing the partial AUC using structured learning.

Ensemble Learning object-detection +2

Paper
Add Code

Multiple Kernel Learning in the Primal for Multi-modal Alzheimer's Disease Classification

no code implementations • 3 Oct 2013 • Fayao Liu, Luping Zhou, Chunhua Shen, Jianping Yin

In this work, we propose a novel multiple kernel learning framework to combine multi-modal features for AD classification, which is scalable and easy to implement.

General Classification

Paper
Add Code

Generic Image Classification Approaches Excel on Face Recognition

no code implementations • 22 Sep 2013 • Fumin Shen, Chunhua Shen

The main finding of this work is that the standard image classification pipeline, which consists of dictionary learning, feature encoding, spatial pyramid pooling and linear classification, outperforms all state-of-the-art face recognition methods on the tested benchmark datasets (we have tested on AR, Extended Yale B, the challenging FERET, and LFW-a datasets).

Classification Dictionary Learning +3

Paper
Add Code

Characterness: An Indicator of Text in the Wild

no code implementations • 25 Sep 2013 • Yao Li, Wenjing Jia, Chunhua Shen, Anton Van Den Hengel

In order to measure the characterness we develop three novel cues that are tailored for character detection, and a Bayesian method for their integration.

Saliency Detection Scene Text Detection +1

Paper
Add Code

A General Two-Step Approach to Learning-Based Hashing

no code implementations • 7 Sep 2013 • Guosheng Lin, Chunhua Shen, David Suter, Anton Van Den Hengel

This framework allows a number of existing approaches to hashing to be placed in context, and simplifies the development of new problem-specific hashing methods.

Vocal Bursts Valence Prediction

Paper
Add Code

A scalable stage-wise approach to large-margin multi-class loss based boosting

no code implementations • 21 Jul 2013 • Sakrapee Paisitkriangkrai, Chunhua Shen, Anton Van Den Hengel

In this work, we propose a scalable and simple stage-wise multi-class boosting method, which also directly maximizes the multi-class margin.

Classification General Classification +1

Paper
Add Code

Fast Approximate L_infty Minimization: Speeding Up Robust Regression

no code implementations • 4 Apr 2013 • Fumin Shen, Chunhua Shen, Rhys Hill, Anton Van Den Hengel, Zhenmin Tang

Minimization of the $L_\infty$ norm, which can be viewed as approximately solving the non-convex least median estimation problem, is a powerful method for outlier removal and hence robust regression.

regression

Paper
Add Code

A Fast Semidefinite Approach to Solving Binary Quadratic Problems

1 code implementation • CVPR 2013 • Peng Wang, Chunhua Shen, Anton Van Den Hengel

Second, compared with conventional SDP methods, the new SDP formulation leads to a significantly more efficient and scalable dual optimization approach, which has the same degree of complexity as spectral methods.

Clustering Image Segmentation +2

Paper
Code

Inductive Hashing on Manifolds

no code implementations • CVPR 2013 • Fumin Shen, Chunhua Shen, Qinfeng Shi, Anton Van Den Hengel, Zhenmin Tang

We particularly show that hashing on the basis of t-SNE .

Paper
Add Code

Deep attention-based classification network for robust depth prediction

1 code implementation • 11 Jul 2018 • Ruibo Li, Ke Xian, Chunhua Shen, Zhiguo Cao, Hao Lu, Lingxiao Hang

However, robust depth prediction suffers from two challenging problems: a) How to extract more discriminative features for different scenes (compared to a single scene)?

Classification Deep Attention +5

Paper
Code

Troy: Give Attention to Saliency and for Saliency

no code implementations • 4 Aug 2018 • Pingping Zhang, Huchuan Lu, Chunhua Shen

In addition, our work has text overlap with arXiv:1804. 06242, arXiv:1705. 00938 by other authors.

Paper
Add Code

Training Compact Neural Networks with Binary Weights and Low Precision Activations

no code implementations • 8 Aug 2018 • Bohan Zhuang, Chunhua Shen, Ian Reid

In this paper, we propose to train a network with binary weights and low-bitwidth activations, designed especially for mobile devices with limited power consumption.

Paper
Add Code

Towards Effective Deep Embedding for Zero-Shot Learning

no code implementations • 30 Aug 2018 • Lei Zhang, Peng Wang, Lingqiao Liu, Chunhua Shen, Wei Wei, Yannning Zhang, Anton Van Den Hengel

Towards this goal, we present a simple but effective two-branch network to simultaneously map semantic descriptions and visual samples into a joint space, on which visual embeddings are forced to regress to their class-level semantic embeddings and the embeddings crossing classes are required to be distinguishable by a trainable classifier.

Zero-Shot Learning

Paper
Add Code

Diagnostics in Semantic Segmentation

no code implementations • 27 Sep 2018 • Vladimir Nekrasov, Chunhua Shen, Ian Reid

Over the past years, computer vision community has contributed to enormous progress in semantic image segmentation, a per-pixel classification task, crucial for dense scene understanding and rapidly becoming vital in lots of real-world applications, including driverless cars and medical imaging.

Image Segmentation Scene Understanding +2

Paper
Add Code

Correlation Propagation Networks for Scene Text Detection

no code implementations • 30 Sep 2018 • Zichuan Liu, Guosheng Lin, Wang Ling Goh, Fayao Liu, Chunhua Shen, Xiaokang Yang

In this work, we propose a novel hybrid method for scene text detection namely Correlation Propagation Network (CPN).

Scene Text Detection Text Detection

Paper
Add Code

Face Detection with Effective Feature Extraction

no code implementations • 29 Sep 2010 • Sakrapee Paisitkriangkrai, Chunhua Shen, Jian Zhang

There is an abundant literature on face detection due to its important role in many vision applications.

Face Detection

Paper
Add Code

Face Recognition using Optimal Representation Ensemble

no code implementations • 3 Oct 2011 • Hanxi Li, Chunhua Shen, Yongsheng Gao

It also overwhelms other modular heuristics on the faces with random occlusions, extreme expressions and disguises.

Face Recognition Model Selection

Paper
Add Code

RGB-D Based Action Recognition with Light-weight 3D Convolutional Networks

no code implementations • 24 Nov 2018 • Haokui Zhang, Ying Li, Peng Wang, Yu Liu, Chunhua Shen

Different from RGB videos, depth data in RGB-D videos provide key complementary information for tristimulus visual data which potentially could achieve accuracy improvement for action recognition.

Action Recognition Temporal Action Localization

Paper
Add Code

Structured Binary Neural Networks for Accurate Image Classification and Semantic Segmentation

no code implementations • CVPR 2019 • Bohan Zhuang, Chunhua Shen, Mingkui Tan, Lingqiao Liu, Ian Reid

In this paper, we propose to train convolutional neural networks (CNNs) with both binarized weights and activations, leading to quantized models specifically} for mobile devices with limited power capacity and computation resources.

General Classification Image Classification +2

Paper
Add Code

Visual Question Answering as Reading Comprehension

no code implementations • CVPR 2019 • Hui Li, Peng Wang, Chunhua Shen, Anton Van Den Hengel

In contrast to struggling on multimodal feature fusion, in this paper, we propose to unify all the input information by natural language so as to convert VQA into a machine reading comprehension problem.

Common Sense Reasoning General Knowledge +4

Paper
Add Code

Coarse-to-fine: A RNN-based hierarchical attention model for vehicle re-identification

no code implementations • 11 Dec 2018 • Xiu-Shen Wei, Chen-Lin Zhang, Lingqiao Liu, Chunhua Shen, Jianxin Wu

Inspired by the coarse-to-fine hierarchical process, we propose an end-to-end RNN-based Hierarchical Attention (RNN-HA) classification model for vehicle re-identification.

Vehicle Re-Identification

Paper
Add Code

Neighbourhood Watch: Referring Expression Comprehension via Language-guided Graph Attention Networks

no code implementations • CVPR 2019 • Peng Wang, Qi Wu, Jiewei Cao, Chunhua Shen, Lianli Gao, Anton Van Den Hengel

Being composed of node attention component and edge attention component, the proposed graph attention mechanism explicitly represents inter-object relationships, and properties with a flexibility and power impossible with competing approaches.

Graph Attention Object +2

Paper
Add Code

Positive Semidefinite Metric Learning with Boosting

no code implementations • NeurIPS 2009 • Chunhua Shen, Junae Kim, Lei Wang, Anton Hengel

In this work, we propose a boosting-based technique, termed BoostMetric, for learning a Mahalanobis distance metric.

General Classification Metric Learning

Paper
Add Code

PSDBoost: Matrix-Generation Linear Programming for Positive Semidefinite Matrices Learning

no code implementations • NeurIPS 2008 • Chunhua Shen, Alan Welsh, Lei Wang

In this work, we consider the problem of learning a positive semidefinite matrix.

Paper
Add Code

Monocular Relative Depth Perception With Web Stereo Data Supervision

no code implementations • CVPR 2018 • Ke Xian, Chunhua Shen, Zhiguo Cao, Hao Lu, Yang Xiao, Ruibo Li, Zhenbo Luo

In this paper we study the problem of monocular relative depth perception in the wild.

Depth Estimation Semantic Segmentation

Paper
Add Code

Goal-Oriented Visual Question Generation via Intermediate Rewards

no code implementations • ECCV 2018 • Jun-Jie Zhang, Qi Wu, Chunhua Shen, Jian Zhang, Jianfeng Lu, Anton Van Den Hengel

Informativeness Question Generation +2

Paper
Add Code

Salient Object Detection with Lossless Feature Reflection and Weighted Structural Loss

no code implementations • 21 Jan 2019 • Pingping Zhang, Wei Liu, Huchuan Lu, Chunhua Shen

Inspired by the intrinsic reflection of natural images, in this paper we propose a novel feature learning framework for large-scale salient object detection.

Object object-detection +3

Paper
Add Code

Bilinear Programming for Human Activity Recognition with Unknown MRF Graphs

no code implementations • CVPR 2013 • Zhenhua Wang, Qinfeng Shi, Chunhua Shen, Anton Van Den Hengel

Markov Random Fields (MRFs) have been successfully applied to human activity modelling, largely due to their ability to model complex dependencies and deal with local uncertainty.

Human Activity Recognition

Paper
Add Code

Learning Compact Binary Codes for Visual Tracking

no code implementations • CVPR 2013 • Xi Li, Chunhua Shen, Anthony Dick, Anton Van Den Hengel

A key problem in visual tracking is to represent the appearance of an object in a way that is robust to visual changes.

Visual Tracking

Paper
Add Code

Part-Based Visual Tracking with Online Latent Structural Learning

no code implementations • CVPR 2013 • Rui Yao, Qinfeng Shi, Chunhua Shen, Yanning Zhang, Anton Van Den Hengel

Despite many advances made in the area, deformable targets and partial occlusions continue to represent key problems in visual tracking.

Structured Prediction Visual Tracking

Paper
Add Code

Depth and Surface Normal Estimation From Monocular Images Using Regression on Deep Features and Hierarchical CRFs

no code implementations • CVPR 2015 • Bo Li, Chunhua Shen, Yuchao Dai, Anton Van Den Hengel, Mingyi He

Predicting the depth (or surface normal) of a scene from single monocular color images is a challenging task.

regression Surface Normal Estimation

Paper
Add Code

Learning Graph Structure for Multi-Label Image Classification via Clique Generation

no code implementations • CVPR 2015 • Mingkui Tan, Qinfeng Shi, Anton Van Den Hengel, Chunhua Shen, Junbin Gao, Fuyuan Hu, Zhen Zhang

Exploiting label dependency for multi-label image classification can significantly improve classification performance.

General Classification Multi-Label Image Classification

Paper
Add Code

What's Wrong With That Object? Identifying Images of Unusual Objects by Modelling the Detection Score Distribution

no code implementations • CVPR 2016 • Peng Wang, Lingqiao Liu, Chunhua Shen, Zi Huang, Anton Van Den Hengel, Heng Tao Shen

The key observation motivating our approach is that "regular object" images, "unusual object" images and "other objects" images exhibit different region-level scores in terms of both the score values and the spatial distributions.

Gaussian Processes Object +2

Paper
Add Code

Multi-Attention Network for One Shot Learning

no code implementations • CVPR 2017 • Peng Wang, Lingqiao Liu, Chunhua Shen, Zi Huang, Anton Van Den Hengel, Heng Tao Shen

One-shot learning is a challenging problem where the aim is to recognize a class identified by a single training image.

One-Shot Learning TAG +1

Paper
Add Code

Hyperspectral Compressive Sensing Using Manifold-Structured Sparsity Prior

no code implementations • ICCV 2015 • Lei Zhang, Wei Wei, Yanning Zhang, Fei Li, Chunhua Shen, Qinfeng Shi

To reconstruct hyperspectral image (HSI) accurately from a few noisy compressive measurements, we present a novel manifold-structured sparsity prior based hyperspectral compressive sensing (HCS) method in this study.

Compressive Sensing

Paper
Add Code

Towards Context-Aware Interaction Recognition for Visual Relationship Detection

1 code implementation • ICCV 2017 • Bohan Zhuang, Lingqiao Liu, Chunhua Shen, Ian Reid

The proposed method still builds one classifier for one interaction (as per type (ii) above), but the classifier built is adaptive to context via weights which are context dependent.

Relationship Detection Visual Relationship Detection

Paper
Code

Decoders Matter for Semantic Segmentation: Data-Dependent Decoding Enables Flexible Feature Aggregation

no code implementations • CVPR 2019 • Zhi Tian, Tong He, Chunhua Shen, Youliang Yan

In this work, we propose a data-dependent upsampling (DUpsampling) to replace bilinear, which takes advantages of the redundancy in the label space of semantic segmentation and is able to recover the pixel-wise prediction from low-resolution outputs of CNNs.

Ranked #46 on Semantic Segmentation on PASCAL Context

Decoder Segmentation +1

Paper
Add Code

Training Quantized Neural Networks with a Full-precision Auxiliary Module

no code implementations • CVPR 2020 • Bohan Zhuang, Lingqiao Liu, Mingkui Tan, Chunhua Shen, Ian Reid

In this paper, we seek to tackle a challenge in training low-precision networks: the notorious difficulty in propagating gradient through a low-precision network due to the non-differentiable quantization function.

Image Classification object-detection +2

Paper
Add Code

Architecture Search of Dynamic Cells for Semantic Video Segmentation

no code implementations • 4 Apr 2019 • Vladimir Nekrasov, Hao Chen, Chunhua Shen, Ian Reid

In semantic video segmentation the goal is to acquire consistent dense semantic labelling across image frames.

Neural Architecture Search Optical Flow Estimation +3

Paper
Add Code

Towards End-to-End Text Spotting in Natural Scenes

no code implementations • 14 Jun 2019 • Peng Wang, Hui Li, Chunhua Shen

Text spotting in natural scene images is of great importance for many image understanding tasks.

Image Cropping Text Detection +1

Paper
Add Code

V-PROM: A Benchmark for Visual Reasoning Using Visual Progressive Matrices

no code implementations • 29 Jul 2019 • Damien Teney, Peng Wang, Jiewei Cao, Lingqiao Liu, Chunhua Shen, Anton Van Den Hengel

One of the primary challenges faced by deep learning is the degree to which current methods exploit superficial statistics and dataset bias, rather than learning to generalise over the specific representations they have experienced.

Visual Reasoning

Paper
Add Code

MobileFAN: Transferring Deep Hidden Representation for Face Alignment

no code implementations • 11 Aug 2019 • Yang Zhao, Yifan Liu, Chunhua Shen, Yongsheng Gao, Shengwu Xiong

To this end, we propose an effective lightweight model, namely Mobile Face Alignment Network (MobileFAN), using a simple backbone MobileNetV2 as the encoder and three deconvolutional layers as the decoder.

Decoder Face Alignment +1

Paper
Add Code

Effective Training of Convolutional Neural Networks with Low-bitwidth Weights and Activations

no code implementations • 10 Aug 2019 • Bohan Zhuang, Jing Liu, Mingkui Tan, Lingqiao Liu, Ian Reid, Chunhua Shen

Furthermore, we propose a second progressive quantization scheme which gradually decreases the bit-width from high-precision to low-precision during training.

Knowledge Distillation Quantization

Paper
Add Code

Auxiliary Learning for Deep Multi-task Learning

no code implementations • 5 Sep 2019 • Yifan Liu, Bohan Zhuang, Chunhua Shen, Hao Chen, Wei Yin

The most current methods can be categorized as either: (i) hard parameter sharing where a subset of the parameters is shared among tasks while other parameters are task-specific; or (ii) soft parameter sharing where all parameters are task-specific but they are jointly regularized.

Auxiliary Learning Depth Estimation +3

Paper
Add Code

Structured Binary Neural Networks for Image Recognition

no code implementations • 22 Sep 2019 • Bohan Zhuang, Chunhua Shen, Mingkui Tan, Peng Chen, Lingqiao Liu, Ian Reid

Experiments on both classification, semantic segmentation and object detection tasks demonstrate the superior performance of the proposed methods over various quantized networks in the literature.

object-detection Object Detection +2

Paper
Add Code

Unified Multifaceted Feature Learning for Person Re-Identification

no code implementations • 20 Nov 2019 • Cheng Yan, Guansong Pang, Xiao Bai, Chunhua Shen

The loss structures the augmented images resulted by the two types of image erasing in a two-level hierarchy and enforces multifaceted attention to different parts.

Person Re-Identification

Paper
Add Code

To Balance or Not to Balance: A Simple-yet-Effective Approach for Learning with Long-Tailed Distributions

no code implementations • 10 Dec 2019 • Jun-Jie Zhang, Lingqiao Liu, Peng Wang, Chunhua Shen

Such imbalanced distribution causes a great challenge for learning a deep neural network, which can be boiled down into a dilemma: on the one hand, we prefer to increase the exposure of tail class samples to avoid the excessive dominance of head classes in the classifier training.

Auxiliary Learning Self-Supervised Learning

Paper
Add Code

Ordered or Orderless: A Revisit for Video based Person Re-Identification

no code implementations • 24 Dec 2019 • Le Zhang, Zenglin Shi, Joey Tianyi Zhou, Ming-Ming Cheng, Yun Liu, Jia-Wang Bian, Zeng Zeng, Chunhua Shen

Specifically, with a diagnostic analysis, we show that the recurrent structure may not be effective to learn temporal dependencies than what we expected and implicitly yields an orderless representation.

Video-Based Person Re-Identification

Paper
Add Code

Learning and Memorizing Representative Prototypes for 3D Point Cloud Semantic and Instance Segmentation

no code implementations • ECCV 2020 • Tong He, Dong Gong, Zhi Tian, Chunhua Shen

3D point cloud semantic and instance segmentation is crucial and fundamental for 3D scene understanding.

Ranked #28 on 3D Instance Segmentation on ScanNet(v2)

3D Instance Segmentation Scene Understanding +1

Paper
Add Code

Memorizing Comprehensively to Learn Adaptively: Unsupervised Cross-Domain Person Re-ID with Multi-level Memory

no code implementations • 13 Jan 2020 • Xin-Yu Zhang, Dong Gong, Jiewei Cao, Chunhua Shen

Due to the lack of supervision in the target domain, it is crucial to identify the underlying similarity-and-dissimilarity relationships among the unlabelled samples in the target domain.

Person Re-Identification

Paper
Add Code

Separating Content from Style Using Adversarial Learning for Recognizing Text in the Wild

no code implementations • 13 Jan 2020 • Canjie Luo, Qingxiang Lin, Yuliang Liu, Lianwen Jin, Chunhua Shen

Furthermore, to tackle the issue of lacking paired training samples, we design an interactive joint training scheme, which shares attention masks from the recognizer to the discriminator, and enables the discriminator to extract the features of each character for further adversarial training.

Style Transfer

Paper
Add Code

Joint Deep Learning of Facial Expression Synthesis and Recognition

no code implementations • 6 Feb 2020 • Yan Yan, Ying Huang, Si Chen, Chunhua Shen, Hanzi Wang

Firstly, a facial expression synthesis generative adversarial network (FESGAN) is pre-trained to generate facial images with different facial expressions.

Facial Expression Recognition Facial Expression Recognition (FER) +1

Paper
Add Code

On the General Value of Evidence, and Bilingual Scene-Text Visual Question Answering

no code implementations • CVPR 2020 • Xinyu Wang, Yuliang Liu, Chunhua Shen, Chun Chet Ng, Canjie Luo, Lianwen Jin, Chee Seng Chan, Anton Van Den Hengel, Liangwei Wang

Visual Question Answering (VQA) methods have made incredible progress, but suffer from a failure to generalize.

Question Answering Referring Expression +1

Paper
Add Code

Self-trained Deep Ordinal Regression for End-to-End Video Anomaly Detection

no code implementations • CVPR 2020 • Guansong Pang, Cheng Yan, Chunhua Shen, Anton Van Den Hengel, Xiao Bai

Video anomaly detection is of critical practical importance to a variety of real applications because it allows human attention to be focused on events that are likely to be of interest, in spite of an otherwise overwhelming volume of video.

Anomaly Detection regression +2

Paper
Add Code

Real-Time High-Performance Semantic Image Segmentation of Urban Street Scenes

no code implementations • 11 Mar 2020 • Genshun Dong, Yan Yan, Chunhua Shen, Hanzi Wang

Meanwhile, a Spatial detail-Preserving Network (SPN) with shallow convolutional layers is designed to generate high-resolution feature maps preserving the detailed spatial information.

Image Segmentation Segmentation +2

Paper
Add Code

Scope Head for Accurate Localization in Object Detection

no code implementations • 11 May 2020 • Geng Zhan, Dan Xu, Guo Lu, Wei Wu, Chunhua Shen, Wanli Ouyang

Existing anchor-based and anchor-free object detectors in multi-stage or one-stage pipelines have achieved very promising detection performance.

Object object-detection +2

Paper
Add Code

A Robust Attentional Framework for License Plate Recognition in the Wild

no code implementations • 6 Jun 2020 • Linjiang Zhang, Peng Wang, Hui Li, Zhen Li, Chunhua Shen, Yanning Zhang

On the other hand, the 2D attentional based license plate recognizer with an Xception-based CNN encoder is capable of recognizing license plates with different patterns under various scenarios accurately and robustly.

Image Generation License Plate Recognition

Paper
Add Code

FCOS: A simple and strong anchor-free object detector

no code implementations • 14 Jun 2020 • Zhi Tian, Chunhua Shen, Hao Chen, Tong He

In computer vision, object detection is one of most important tasks, which underpins a few instance-level recognition tasks and many downstream applications.

Object Object Detection +1

Paper
Add Code

Deep Learning for Anomaly Detection: A Review

no code implementations • 6 Jul 2020 • Guansong Pang, Chunhua Shen, Longbing Cao, Anton Van Den Hengel

This paper surveys the research of deep anomaly detection with a comprehensive taxonomy, covering advancements in three high-level categories and 11 fine-grained categories of the methods.

Anomaly Detection Novelty Detection +1

Paper
Add Code

Soft Expert Reward Learning for Vision-and-Language Navigation

no code implementations • ECCV 2020 • Hu Wang, Qi Wu, Chunhua Shen

In this paper, we introduce a Soft Expert Reward Learning (SERL) model to overcome the reward engineering designing and generalisation problems of the VLN task.

Reinforcement Learning (RL) Vision and Language Navigation

Paper
Add Code

Pairwise Relation Learning for Semi-supervised Gland Segmentation

no code implementations • 6 Aug 2020 • Yutong Xie, Jianpeng Zhang, Zhibin Liao, Chunhua Shen, Johan Verjans, Yong Xia

In this paper, we propose the pairwise relation-based semi-supervised (PRS^2) model for gland segmentation on histology images.

Relation Relation Network +1

Paper
Add Code

Representative Graph Neural Network

no code implementations • ECCV 2020 • Changqian Yu, Yifan Liu, Changxin Gao, Chunhua Shen, Nong Sang

In this paper, we present a Representative Graph (RepGraph) layer to dynamically sample a few representative features, which dramatically reduces redundancy.

object-detection Object Detection +1

Paper
Add Code

FATNN: Fast and Accurate Ternary Neural Networks

no code implementations • ICCV 2021 • Peng Chen, Bohan Zhuang, Chunhua Shen

Ternary Neural Networks (TNNs) have received much attention due to being potentially orders of magnitude faster in inference, as well as more power efficient, than full-precision counterparts.

Image Classification Quantization

Paper
Add Code

Instance-Aware Embedding for Point Cloud Instance Segmentation

no code implementations • ECCV 2020 • Tong He, Yifan Liu, Chunhua Shen, Xinlong Wang, Changming Sun

However, these methods are unaware of the instance context and fail to realize the boundary and geometric information of an instance, which are critical to separate adjacent objects.

Instance Segmentation Semantic Segmentation

Paper
Add Code

Unifying Instance and Panoptic Segmentation with Dynamic Rank-1 Convolutions

no code implementations • 19 Nov 2020 • Hao Chen, Chunhua Shen, Zhi Tian

To our knowledge, DR1Mask is the first panoptic segmentation framework that exploits a shared feature map for both instance and semantic segmentation by considering both efficacy and efficiency.

Instance Segmentation Multi-Task Learning +2

Paper
Add Code

Graph Attention Tracking

no code implementations • CVPR 2021 • Dongyan Guo, Yanyan Shao, Ying Cui, Zhenhua Wang, Liyan Zhang, Chunhua Shen

We propose to establish part-to-part correspondence between the target and the search region with a complete bipartite graph, and apply the graph attention mechanism to propagate target information from the template feature to the search feature.

Graph Attention Object Tracking +1

Paper
Add Code

PGL: Prior-Guided Local Self-supervised Learning for 3D Medical Image Segmentation

no code implementations • 25 Nov 2020 • Yutong Xie, Jianpeng Zhang, Zehui Liao, Yong Xia, Chunhua Shen

In this paper, we propose a PriorGuided Local (PGL) self-supervised model that learns the region-wise local consistency in the latent feature space.

Image Segmentation Medical Image Segmentation +3

Paper
Add Code

Diverse Knowledge Distillation for End-to-End Person Search

no code implementations • 21 Dec 2020 • Xinyu Zhang, Xinlong Wang, Jia-Wang Bian, Chunhua Shen, Mingyu You

Person search aims to localize and identify a specific person from a gallery of images.

Human Detection Knowledge Distillation +1

Paper
Add Code

Single-path Bit Sharing for Automatic Loss-aware Model Compression

no code implementations • 13 Jan 2021 • Jing Liu, Bohan Zhuang, Peng Chen, Chunhua Shen, Jianfei Cai, Mingkui Tan

By jointly training the binary gates in conjunction with network parameters, the compression configurations of each layer can be automatically determined.

Model Compression Network Pruning +1

Paper
Add Code

Multi-intersection Traffic Optimisation: A Benchmark Dataset and a Strong Baseline

no code implementations • 24 Jan 2021 • Hu Wang, Hao Chen, Qi Wu, Congbo Ma, Yidong Li, Chunhua Shen

To address these issues, in this work we carefully design our settings and propose a new dataset including both synthetic and real traffic data in more complex scenarios.

Decoder

Paper
Add Code

Object Detection Made Simpler by Eliminating Heuristic NMS

no code implementations • 28 Jan 2021 • Qiang Zhou, Chaohui Yu, Chunhua Shen, Zhibin Wang, Hao Li

On the COCO dataset, our simple design achieves superior performance compared to both the FCOS baseline detector with NMS post-processing and the recent end-to-end NMS-free detectors.

Object object-detection +1

Paper
Add Code

Instance and Panoptic Segmentation Using Conditional Convolutions

no code implementations • 5 Feb 2021 • Zhi Tian, BoWen Zhang, Hao Chen, Chunhua Shen

In the literature, top-performing instance segmentation methods typically follow the paradigm of Mask R-CNN and rely on ROI operations (typically ROIAlign) to attend to each instance.

Instance Segmentation Panoptic Segmentation +1

Paper
Add Code

Generic Perceptual Loss for Modeling Structured Output Dependencies

no code implementations • CVPR 2021 • Yifan Liu, Hao Chen, Yu Chen, Wei Yin, Chunhua Shen

We hope that this simple, extended perceptual loss may serve as a generic structured-output loss that is applicable to most structured output learning tasks.

Depth Estimation Image Generation +4

Paper
Add Code

TFPose: Direct Human Pose Estimation with Transformers

no code implementations • 29 Mar 2021 • Weian Mao, Yongtao Ge, Chunhua Shen, Zhi Tian, Xinlong Wang, Zhibin Wang

We propose a human pose estimation framework that solves the task in the regression-based fashion.

Ranked #26 on Pose Estimation on MPII Human Pose (using extra training data)

Pose Estimation regression

Paper
Add Code

An Adversarial Human Pose Estimation Network Injected with Graph Structure

no code implementations • 29 Mar 2021 • Lei Tian, Guoqiang Liang, Peng Wang, Chunhua Shen

Because of the invisible human keypoints in images caused by illumination, occlusion and overlap, it is likely to produce unreasonable human pose prediction for most of the current human pose estimation methods.

Generative Adversarial Network Pose Estimation +1

Paper
Add Code

Feature Decomposition and Reconstruction Learning for Effective Facial Expression Recognition

no code implementations • CVPR 2021 • Delian Ruan, Yan Yan, Shenqi Lai, Zhenhua Chai, Chunhua Shen, Hanzi Wang

In this paper, we propose a novel Feature Decomposition and Reconstruction Learning (FDRL) method for effective facial expression recognition.

Facial Expression Recognition Facial Expression Recognition (FER) +1

Paper
Add Code

HCRF-Flow: Scene Flow from Point Clouds with Continuous High-order CRFs and Position-aware Flow Embedding

no code implementations • CVPR 2021 • Ruibo Li, Guosheng Lin, Tong He, Fayao Liu, Chunhua Shen

Scene flow in 3D point clouds plays an important role in understanding dynamic environments.

Position

Paper
Add Code

SOLO: A Simple Framework for Instance Segmentation

no code implementations • 30 Jun 2021 • Xinlong Wang, Rufeng Zhang, Chunhua Shen, Tao Kong, Lei LI

Besides instance segmentation, our method yields state-of-the-art results in object detection (from our mask byproduct) and panoptic segmentation.

Image Matting Instance Segmentation +4

Paper
Add Code

Learning Spatial-Semantic Relationship for Facial Attribute Recognition With Limited Labeled Data

no code implementations • CVPR 2021 • Ying Shu, Yan Yan, Si Chen, Jing-Hao Xue, Chunhua Shen, Hanzi Wang

First, three auxiliary tasks, consisting of a Patch Rotation Task (PRT), a Patch Segmentation Task (PST), and a Patch Classification Task (PCT), are jointly developed to learn the spatial-semantic relationship from large-scale unlabeled facial data.

Ranked #3 on Facial Attribute Classification on LFWA

Attribute Facial Attribute Classification +1

Paper
Add Code

Meta Navigator: Search for a Good Adaptation Policy for Few-shot Learning

no code implementations • ICCV 2021 • Chi Zhang, Henghui Ding, Guosheng Lin, Ruibo Li, Changhu Wang, Chunhua Shen

Inspired by the recent success in Automated Machine Learning literature (AutoML), in this paper, we present Meta Navigator, a framework that attempts to solve the aforementioned limitation in few-shot learning by seeking a higher-level strategy and proffer to automate the selection from various few-shot learning designs.

AutoML Few-Shot Learning

Paper
Add Code

BV-Person: A Large-Scale Dataset for Bird-View Person Re-Identification

no code implementations • ICCV 2021 • Cheng Yan, Guansong Pang, Lei Wang, Jile Jiao, Xuetao Feng, Chunhua Shen, Jingjing Li

In this work we introduce a new ReID task, bird-view person ReID, which aims at searching for a person in a gallery of horizontal-view images with the query images taken from a bird's-eye view, i. e., an elevated view of an object from above.

Person Re-Identification

Paper
Add Code

Occluded Person Re-Identification With Single-Scale Global Representations

no code implementations • ICCV 2021 • Cheng Yan, Guansong Pang, Jile Jiao, Xiao Bai, Xuetao Feng, Chunhua Shen

However, real-world ReID applications typically have highly diverse occlusions and involve a hybrid of occluded and non-occluded pedestrians.

Graph Matching Person Re-Identification +1

Paper
Add Code

Boosting Robustness of Image Matting with Context Assembling and Strong Data Augmentation

no code implementations • CVPR 2022 • Yutong Dai, Brian Price, He Zhang, Chunhua Shen

Deep image matting methods have achieved increasingly better results on benchmarks (e. g., Composition-1k/alphamatting. com).

Data Augmentation Decoder +1

Paper
Add Code

Towards 3D Scene Reconstruction from Locally Scale-Aligned Monocular Video Depth

no code implementations • 3 Feb 2022 • Guangkai Xu, Wei Yin, Hao Chen, Chunhua Shen, Kai Cheng, Feng Wu, Feng Zhao

However, in some video-based scenarios such as video depth estimation and 3D scene reconstruction from a video, the unknown scale and shift residing in per-frame prediction may cause the depth inconsistency.

3D Scene Reconstruction Depth Completion +1

Paper
Add Code

Scaling up Multi-domain Semantic Segmentation with Sentence Embeddings

no code implementations • 4 Feb 2022 • Wei Yin, Yifan Liu, Chunhua Shen, Baichuan Sun, Anton Van Den Hengel

The resulting merged semantic segmentation dataset of over 2 Million images enables training a model that achieves performance equal to that of state-of-the-art supervised methods on 7 benchmark datasets, despite not using any images therefrom.

Ranked #1 on Semantic Segmentation on WildDash

Instance Segmentation Monocular Depth Estimation +4

Paper
Add Code

Retrieval Augmented Classification for Long-Tail Visual Recognition

no code implementations • CVPR 2022 • Alexander Long, Wei Yin, Thalaiyasingam Ajanthan, Vu Nguyen, Pulak Purkait, Ravi Garg, Alan Blair, Chunhua Shen, Anton Van Den Hengel

We introduce Retrieval Augmented Classification (RAC), a generic approach to augmenting standard image classification pipelines with an explicit retrieval module.

Ranked #4 on Long-tail Learning on iNaturalist 2018

Classification Image Classification +2

Paper
Add Code

Efficient Video Segmentation Models with Per-frame Inference

no code implementations • 24 Feb 2022 • Yifan Liu, Chunhua Shen, Changqian Yu, Jingdong Wang

To this end, we perform inference at each frame.

Image Matting Instance Segmentation +6

Paper
Add Code

Improving Monocular Visual Odometry Using Learned Depth

no code implementations • 4 Apr 2022 • Libo Sun, Wei Yin, Enze Xie, Zhengrong Li, Changming Sun, Chunhua Shen

The core of our framework is a monocular depth estimation module with a strong generalization capability for diverse scenes.

Monocular Depth Estimation Monocular Visual Odometry

Paper
Add Code

On the Dual Formulation of Boosting Algorithms

no code implementations • 23 Jan 2009 • Chunhua Shen, Hanxi Li

We study boosting algorithms from a new perspective.

Paper
Add Code

PointInst3D: Segmenting 3D Instances by Points

no code implementations • 25 Apr 2022 • Tong He, Wei Yin, Chunhua Shen, Anton Van Den Hengel

The current state-of-the-art methods in 3D instance segmentation typically involve a clustering step, despite the tendency towards heuristics, greedy algorithms, and a lack of robustness to the changes in data statistics.

3D Instance Segmentation Clustering +2

Paper
Add Code

PyramidCLIP: Hierarchical Feature Alignment for Vision-language Model Pretraining

no code implementations • 29 Apr 2022 • Yuting Gao, Jinfeng Liu, Zihan Xu, Jun Zhang, Ke Li, Rongrong Ji, Chunhua Shen

Large-scale vision-language pre-training has achieved promising results on downstream tasks.

Image Classification Language Modelling +3

Paper
Add Code

Fully Convolutional One-Stage 3D Object Detection on LiDAR Range Images

no code implementations • 27 May 2022 • Zhi Tian, Xiangxiang Chu, Xiaoming Wang, Xiaolin Wei, Chunhua Shen

In this work, we tackle this challenging issue with a novel range view projection mechanism, and for the first time demonstrate the benefits of fusing multi-frame point clouds for a range-view based detector.

3D Object Detection Autonomous Driving +2

Paper
Add Code

Point-Teaching: Weakly Semi-Supervised Object Detection with Point Annotations

no code implementations • 1 Jun 2022 • Yongtao Ge, Qiang Zhou, Xinlong Wang, Zhibin Wang, Hao Li, Chunhua Shen

Point annotations are considerably more time-efficient than bounding box annotations.

Data Augmentation Multiple Instance Learning +4

Paper
Add Code

RigidFlow: Self-Supervised Scene Flow Learning on Point Clouds by Local Rigidity Prior

no code implementations • CVPR 2022 • Ruibo Li, Chi Zhang, Guosheng Lin, Zhe Wang, Chunhua Shen

In this work, we focus on scene flow learning on point clouds in a self-supervised manner.

Motion Estimation Self-Supervised Learning

Paper
Add Code

Text-Adaptive Multiple Visual Prototype Matching for Video-Text Retrieval

no code implementations • 27 Sep 2022 • Chengzhi Lin, AnCong Wu, Junwei Liang, Jun Zhang, Wenhang Ge, Wei-Shi Zheng, Chunhua Shen

To address this problem, we propose a Text-Adaptive Multiple Visual Prototype Matching model, which automatically captures multiple prototypes to describe a video by adaptive aggregation of video token features.

Cross-Modal Retrieval Retrieval +2

Paper
Add Code

Adv-Attribute: Inconspicuous and Transferable Adversarial Attack on Face Recognition

no code implementations • 13 Oct 2022 • Shuai Jia, Bangjie Yin, Taiping Yao, Shouhong Ding, Chunhua Shen, Xiaokang Yang, Chao Ma

For face recognition attacks, existing methods typically generate the l_p-norm perturbations on pixels, however, resulting in low attack transferability and high vulnerability to denoising defense models.

Adversarial Attack Attribute +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.