Search Results for author: Jianbing Shen

Found 59 papers, 41 papers with code

CLNet: A Compact Latent Network for Fast Adjusting Siamese Trackers

1 code implementation ECCV 2020 Xingping Dong, Jianbing Shen, Ling Shao, Fatih Porikli

To make full use of these sequence-specific samples, {we propose a compact latent network to quickly adjust the tracking model to adapt to new scenes.}

Full-Duplex Strategy for Video Object Segmentation

1 code implementation ICCV 2021 Ge-Peng Ji, Deng-Ping Fan, Keren Fu, Zhe Wu, Jianbing Shen, Ling Shao

Previous video object segmentation approaches mainly focus on using simplex solutions between appearance and motion, limiting feature collaboration efficiency among and across these two cues.

Salient Object Detection Semantic Segmentation +2

Video Object Segmentation Using Global and Instance Embedding Learning

no code implementations CVPR 2021 Wenbin Ge, Xiankai Lu, Jianbing Shen

In this paper, we propose a feature embedding based video object segmentation (VOS) method which is simple, fast and effective.

Semantic Segmentation Video Object Segmentation +1

Face Forensics in the Wild

1 code implementation CVPR 2021 Tianfei Zhou, Wenguan Wang, Zhiyuan Liang, Jianbing Shen

On existing public benchmarks, face forgery detection techniques have achieved great success.

Multiple Instance Learning

Structured Scene Memory for Vision-Language Navigation

1 code implementation CVPR 2021 Hanqing Wang, Wenguan Wang, Wei Liang, Caiming Xiong, Jianbing Shen

Recently, numerous algorithms have been developed to tackle the problem of vision-language navigation (VLN), i. e., entailing an agent to navigate 3D environments through following linguistic instructions.

Decision Making Vision-Language Navigation

Learning to Fuse Asymmetric Feature Maps in Siamese Trackers

1 code implementation CVPR 2021 Wencheng Han, Xingping Dong, Fahad Shahbaz Khan, Ling Shao, Jianbing Shen

We propose a learnable module, called the asymmetric convolution (ACM), which learns to better capture the semantic correlation information in offline training on large-scale data.

Visual Tracking

Siamese Network for RGB-D Salient Object Detection and Beyond

2 code implementations26 Aug 2020 Keren Fu, Deng-Ping Fan, Ge-Peng Ji, Qijun Zhao, Jianbing Shen, Ce Zhu

Inspired by the observation that RGB and depth modalities actually present certain commonality in distinguishing salient objects, a novel joint learning and densely cooperative fusion (JL-DCF) architecture is designed to learn from both RGB and depth inputs through a shared network backbone, known as the Siamese architecture.

Ranked #2 on RGB-D Salient Object Detection on SIP (using extra training data)

RGB-D Salient Object Detection Salient Object Detection +1

RGB-D Salient Object Detection: A Survey

9 code implementations1 Aug 2020 Tao Zhou, Deng-Ping Fan, Ming-Ming Cheng, Jianbing Shen, Ling Shao

Further, considering that the light field can also provide depth maps, we review SOD models and popular benchmark datasets from this domain as well.

RGB-D Salient Object Detection RGB Salient Object Detection +1

Weakly Supervised 3D Object Detection from Lidar Point Cloud

1 code implementation ECCV 2020 Qinghao Meng, Wenguan Wang, Tianfei Zhou, Jianbing Shen, Luc van Gool, Dengxin Dai

This work proposes a weakly supervised approach for 3D object detection, only requiring a small set of weakly annotated scenes, associated with a few precisely labeled object instances.

3D Object Detection

Dynamic Dual-Attentive Aggregation Learning for Visible-Infrared Person Re-Identification

1 code implementation ECCV 2020 Mang Ye, Jianbing Shen, David J. Crandall, Ling Shao, Jiebo Luo

In this paper, we propose a novel dynamic dual-attentive aggregation (DDAG) learning method by mining both intra-modality part-level and cross-modality graph-level contextual cues for VI-ReID.

Person Re-Identification

Active Visual Information Gathering for Vision-Language Navigation

1 code implementation ECCV 2020 Hanqing Wang, Wenguan Wang, Tianmin Shu, Wei Liang, Jianbing Shen

Vision-language navigation (VLN) is the task of entailing an agent to carry out navigational instructions inside photo-realistic environments.

Vision-Language Navigation

Video Object Segmentation with Episodic Graph Memory Networks

1 code implementation ECCV 2020 Xiankai Lu, Wenguan Wang, Martin Danelljan, Tianfei Zhou, Jianbing Shen, Luc van Gool

How to make a segmentation model efficiently adapt to a specific video and to online target appearance variations are fundamentally crucial issues in the field of video object segmentation.

Semantic Segmentation Video Object Segmentation +2

Re-thinking Co-Salient Object Detection

2 code implementations7 Jul 2020 Deng-Ping Fan, Tengpeng Li, Zheng Lin, Ge-Peng Ji, Dingwen Zhang, Ming-Ming Cheng, Huazhu Fu, Jianbing Shen

CoSOD is an emerging and rapidly growing extension of salient object detection (SOD), which aims to detect the co-occurring salient objects in a group of images.

Co-Salient Object Detection Salient Object Detection

PraNet: Parallel Reverse Attention Network for Polyp Segmentation

3 code implementations13 Jun 2020 Deng-Ping Fan, Ge-Peng Ji, Tao Zhou, Geng Chen, Huazhu Fu, Jianbing Shen, Ling Shao

To address these challenges, we propose a parallel reverse attention network (PraNet) for accurate polyp segmentation in colonoscopy images.

Ranked #3 on Camouflaged Object Segmentation on CAMO (using extra training data)

Camouflaged Object Segmentation Camouflage Segmentation +1

M2Net: Multi-modal Multi-channel Network for Overall Survival Time Prediction of Brain Tumor Patients

no code implementations1 Jun 2020 Tao Zhou, Huazhu Fu, Yu Zhang, Changqing Zhang, Xiankai Lu, Jianbing Shen, Ling Shao

Then, we use a modality-specific network to extract implicit and high-level features from different MR scans.

Modeling and Enhancing Low-quality Retinal Fundus Images

1 code implementation12 May 2020 Ziyi Shen, Huazhu Fu, Jianbing Shen, Ling Shao

Retinal fundus images are widely used for the clinical screening and diagnosis of eye diseases.

Image Enhancement Retinal Vessel Segmentation

Self-Learning with Rectification Strategy for Human Parsing

no code implementations CVPR 2020 Tao Li, Zhiyuan Liang, Sanyuan Zhao, Jiahao Gong, Jianbing Shen

For the global error, we first transform category-wise features into a high-level graph model with coarse-grained structural information, and then decouple the high-level graph to reconstruct the category features.

Human Parsing Rectification

A Unified Object Motion and Affinity Model for Online Multi-Object Tracking

1 code implementation CVPR 2020 Junbo Yin, Wenguan Wang, Qinghao Meng, Ruigang Yang, Jianbing Shen

In this paper, we propose a novel MOT framework that unifies object motion and affinity model into a single network, named UMA, in order to learn a compact feature that is discriminative for both object motion and affinity measure.

Metric Learning Multi-Object Tracking +2

Learning Video Object Segmentation from Unlabeled Videos

1 code implementation CVPR 2020 Xiankai Lu, Wenguan Wang, Jianbing Shen, Yu-Wing Tai, David Crandall, Steven C. H. Hoi

We propose a new method for video object segmentation (VOS) that addresses object pattern learning from unlabeled videos, unlike most existing methods which rely heavily on extensive annotated data.

Representation Learning Semantic Segmentation +3

Hierarchical Human Parsing with Typed Part-Relation Reasoning

1 code implementation CVPR 2020 Wenguan Wang, Hailong Zhu, Jifeng Dai, Yanwei Pang, Jianbing Shen, Ling Shao

As human bodies are underlying hierarchically structured, how to model human structures is the central theme in this task.

Human Parsing

Cascaded Human-Object Interaction Recognition

1 code implementation CVPR 2020 Tianfei Zhou, Wenguan Wang, Siyuan Qi, Haibin Ling, Jianbing Shen

The interaction recognition network has two crucial parts: a relation ranking module for high-quality HOI proposal selection and a triple-stream classifier for relation prediction.

Human-Object Interaction Detection

Infinitely Wide Graph Convolutional Networks: Semi-supervised Learning via Gaussian Processes

no code implementations26 Feb 2020 Jilin Hu, Jianbing Shen, Bin Yang, Ling Shao

Graph convolutional neural networks~(GCNs) have recently demonstrated promising results on graph-based semi-supervised classification, but little work has been done to explore their theoretical properties.

Gaussian Processes General Classification

Hi-Net: Hybrid-fusion Network for Multi-modal MR Image Synthesis

2 code implementations11 Feb 2020 Tao Zhou, Huazhu Fu, Geng Chen, Jianbing Shen, Ling Shao

Medical image synthesis has been proposed as an effective solution to this, where any missing modalities are synthesized from the existing ones.

Image Generation

Learning Compositional Neural Information Fusion for Human Parsing

1 code implementation ICCV 2019 Wenguan Wang, Zhijie Zhang, Siyuan Qi, Jianbing Shen, Yanwei Pang, Ling Shao

The bottom-up and top-down inferences explicitly model the compositional and decompositional relations in human bodies, respectively.

Human Parsing

Human-Aware Motion Deblurring

1 code implementation ICCV 2019 Ziyi Shen, Wenguan Wang, Xiankai Lu, Jianbing Shen, Haibin Ling, Tingfa Xu, Ling Shao

This paper proposes a human-aware deblurring model that disentangles the motion blur between foreground (FG) humans and background (BG).

Deblurring

Zero-Shot Video Object Segmentation via Attentive Graph Neural Networks

1 code implementation ICCV 2019 Wenguan Wang, Xiankai Lu, Jianbing Shen, David Crandall, Ling Shao

Through parametric message passing, AGNN is able to efficiently capture and mine much richer and higher-order relations between video frames, thus enabling a more complete understanding of video content and more accurate foreground estimation.

Semantic Segmentation Video Object Segmentation +3

NETNet: Neighbor Erasing and Transferring Network for Better Single Shot Object Detection

no code implementations CVPR 2020 Yazhao Li, Yanwei Pang, Jianbing Shen, Jiale Cao, Ling Shao

With this observation, we propose a new Neighbor Erasing and Transferring (NET) mechanism to reconfigure the pyramid features and explore scale-aware features.

Object Detection

Deep Learning for Person Re-identification: A Survey and Outlook

3 code implementations13 Jan 2020 Mang Ye, Jianbing Shen, Gaojie Lin, Tao Xiang, Ling Shao, Steven C. H. Hoi

The widely studied closed-world setting is usually applied under various research-oriented assumptions, and has achieved inspiring success using deep learning techniques on a number of datasets.

Metric Learning Person Re-Identification +1

Teacher-Students Knowledge Distillation for Siamese Trackers

no code implementations24 Jul 2019 Yuanpei Liu, Xingping Dong, Xiankai Lu, Fahad Shahbaz Khan, Jianbing Shen, Steven Hoi

To the best of our knowledge, we are the first to investigate knowledge distillation for Siamese trackers and propose a distilled Siamese tracking framework.

Knowledge Distillation Object Tracking

Evaluation of Retinal Image Quality Assessment Networks in Different Color-spaces

2 code implementations10 Jul 2019 Huazhu Fu, Boyang Wang, Jianbing Shen, Shanshan Cui, Yanwu Xu, Jiang Liu, Ling Shao

Retinal image quality assessment (RIQA) is essential for controlling the quality of retinal imaging and guaranteeing the reliability of diagnoses by ophthalmologists or automated analysis systems.

Image Quality Assessment

Understanding More about Human and Machine Attention in Deep Neural Networks

no code implementations20 Jun 2019 Qiuxia Lai, Salman Khan, Yongwei Nie, Jianbing Shen, Hanqiu Sun, Ling Shao

With three example computer vision tasks, diverse representative backbones, and famous architectures, corresponding real human gaze data, and systematically conducted large-scale quantitative studies, we quantify the consistency between artificial attention and human visual attention and offer novel insights into existing artificial attention mechanisms by giving preliminary answers to several key questions related to human and artificial attention mechanisms.

Fine-Grained Image Classification Semantic Segmentation +1

Extreme Points Derived Confidence Map as a Cue For Class-Agnostic Segmentation Using Deep Neural Network

1 code implementation6 Jun 2019 Shadab Khan, Ahmed H. Shahin, Javier Villafruela, Jianbing Shen, Ling Shao

To automate the process of segmenting an anatomy of interest, we can learn a model from previously annotated data.

Salient Object Detection in the Deep Learning Era: An In-Depth Survey

1 code implementation19 Apr 2019 Wenguan Wang, Qiuxia Lai, Huazhu Fu, Jianbing Shen, Haibin Ling, Ruigang Yang

As an essential problem in computer vision, salient object detection (SOD) has attracted an increasing amount of research attention over the years.

RGB Salient Object Detection Saliency Prediction +1

Adversarial Defense by Restricting the Hidden Space of Deep Neural Networks

1 code implementation ICCV 2019 Aamir Mustafa, Salman Khan, Munawar Hayat, Roland Goecke, Jianbing Shen, Ling Shao

Deep neural networks are vulnerable to adversarial attacks, which can fool them by adding minuscule perturbations to the input images.

Adversarial Defense

Striking the Right Balance with Uncertainty

no code implementations CVPR 2019 Salman Khan, Munawar Hayat, Waqas Zamir, Jianbing Shen, Ling Shao

Rare classes tend to get a concentrated representation in the classification space which hampers the generalization of learned boundaries to new test examples.

Classification Face Verification +2

Image Super-Resolution as a Defense Against Adversarial Attacks

1 code implementation7 Jan 2019 Aamir Mustafa, Salman H. Khan, Munawar Hayat, Jianbing Shen, Ling Shao

The proposed scheme is simple and has the following advantages: (1) it does not require any model training or parameter optimization, (2) it complements other existing defense mechanisms, (3) it is agnostic to the attacked model and attack type and (4) it provides superior performance across all popular attack algorithms.

Adversarial Defense Image Enhancement +2

Triplet Loss in Siamese Network for Object Tracking

1 code implementation ECCV 2018 Xingping Dong, Jianbing Shen

In this paper, a novel triplet loss is proposed to extract expressive deep feature for object tracking by adding it into Siamese network framework instead of pairwise loss for training.

Object Tracking

Pyramid Dilated Deeper ConvLSTM for Video Salient Object Detection

1 code implementation ECCV 2018 Hongmei Song, Wenguan Wang, Sanyuan Zhao, Jianbing Shen, Kin-Man Lam

This paper proposes a fast video salient object detection model, based on a novel recurrent network architecture, named Pyramid Dilated Bidirectional ConvLSTM (PDB-ConvLSTM).

 Ranked #1 on Video Salient Object Detection on UVSD (using extra training data)

Salient Object Detection Semantic Segmentation +3

Learning Human-Object Interactions by Graph Parsing Neural Networks

1 code implementation ECCV 2018 Siyuan Qi, Wenguan Wang, Baoxiong Jia, Jianbing Shen, Song-Chun Zhu

For a given scene, GPNN infers a parse graph that includes i) the HOI graph structure represented by an adjacency matrix, and ii) the node labels.

Human-Object Interaction Detection

Attentive Fashion Grammar Network for Fashion Landmark Detection and Clothing Category Classification

1 code implementation CVPR 2018 Wenguan Wang, Yuanlu Xu, Jianbing Shen, Song-Chun Zhu

This paper proposes a knowledge-guided fashion network to solve the problem of visual fashion analysis, e. g., fashion landmark localization and clothing category classification.

General Classification

Salient Object Detection Driven by Fixation Prediction

1 code implementation CVPR 2018 Wenguan Wang, Jianbing Shen, Xingping Dong, Ali Borji

Salient object detection is then viewed as fine-grained object-level saliency segmentation and is progressively optimized with the guidance of the fixation map in a top-down manner.

RGB Salient Object Detection Salient Object Detection

Revisiting Video Saliency: A Large-scale Benchmark and a New Model

1 code implementation CVPR 2018 Wenguan Wang, Jianbing Shen, Fang Guo, Ming-Ming Cheng, Ali Borji

Existing video saliency datasets lack variety and generality of common dynamic scenes and fall short in covering challenging situations in unconstrained environments.

Deep Cropping via Attention Box Prediction and Aesthetics Assessment

no code implementations ICCV 2017 Wenguan Wang, Jianbing Shen

We model the photo cropping problem as a cascade of attention box regression and aesthetic quality classification, based on deep learning.

Improved Face Detection and Alignment using Cascade Deep Convolutional Network

no code implementations28 Jul 2017 Weilin Cong, Sanyuan Zhao, Hui Tian, Jianbing Shen

Real-world face detection and alignment demand an advanced discriminative model to address challenges by pose, lighting and expression.

Face Detection

Quadruplet Network with One-Shot Learning for Fast Visual Object Tracking

no code implementations19 May 2017 Xingping Dong, Jianbing Shen, Dongming Wu, Kan Guo, Xiaogang Jin, Fatih Porikli

In this paper, we propose a new quadruplet deep network to examine the potential connections among the training instances, aiming to achieve a more powerful representation.

One-Shot Learning Visual Object Tracking

Deep Visual Attention Prediction

1 code implementation journal 2017 Wenguan Wang, Jianbing Shen

Our model is based on a skip-layer network structure, which predicts human attention from multiple convolutional layers with various reception fields.

Saliency Prediction

Super-Trajectory for Video Segmentation

no code implementations ICCV 2017 Wenguan Wang, Jianbing Shen, Jianwen Xie, Fatih Porikli

We introduce a novel semi-supervised video segmentation approach based on an efficient video representation, called as "super-trajectory".

Video Segmentation Video Semantic Segmentation

Linearization to Nonlinear Learning for Visual Tracking

no code implementations ICCV 2015 Bo Ma, Hongwei Hu, Jianbing Shen, Yuping Zhang, Fatih Porikli

Building on the theory of globally linear approximations to nonlinear functions, we introduce an elegant method that jointly learns a nonlinear classifier and a visual dictionary for tracking objects in a semi-supervised sparse coding fashion.

Dictionary Learning Visual Tracking

Saliency-Aware Geodesic Video Object Segmentation

1 code implementation CVPR 2015 Wenguan Wang, Jianbing Shen, Fatih Porikli

Building on the observation that foreground areas are surrounded by the regions with high spatiotemporal edge values, geodesic distance provides an initial estimation for foreground and background.

Ranked #4 on Video Salient Object Detection on SegTrack v2 (using extra training data)

Semantic Segmentation Video Salient Object Detection +1

Lazy Random Walks for Superpixel Segmentation

1 code implementation IEEE Trans. on Image Processing 2014 Jianbing Shen, Yunfan Du, Wenguan Wang, Xuelong. Li

Then, the boundaries of initial superpixels are obtained according to the probabilities and the commute time.

Cannot find the paper you are looking for? You can Submit a new open access paper.