Search Results for author: Jianbing Shen

Found 76 papers, 51 papers with code

CLNet: A Compact Latent Network for Fast Adjusting Siamese Trackers

1 code implementation ECCV 2020 Xingping Dong, Jianbing Shen, Ling Shao, Fatih Porikli

To make full use of these sequence-specific samples, {we propose a compact latent network to quickly adjust the tracking model to adapt to new scenes.}

Weakly Supervised Monocular 3D Object Detection using Multi-View Projection and Direction Consistency

no code implementations15 Mar 2023 Runzhou Tao, Wencheng Han, Zhongying Qiu, Cheng-Zhong Xu, Jianbing Shen

When used as a pre-training method, our model can significantly outperform the corresponding fully-supervised baseline with only 1/3 3D labels.

Monocular 3D Object Detection object-detection

Referring Multi-Object Tracking

1 code implementation6 Mar 2023 Dongming Wu, Wencheng Han, Tiancai Wang, Xingping Dong, Xiangyu Zhang, Jianbing Shen

In this paper, we propose a new and general referring understanding task, termed referring multi-object tracking (RMOT).

Multi-Object Tracking

Generalized Few-Shot 3D Object Detection of LiDAR Point Cloud for Autonomous Driving

no code implementations8 Feb 2023 Jiawei Liu, Xingping Dong, Sanyuan Zhao, Jianbing Shen

To achieve simultaneous detection for both common and rare objects, we propose a novel task, called generalized few-shot 3D object detection, where we have a large amount of training data for common (base) objects, but only a few data for rare (novel) classes.

3D Object Detection Autonomous Driving +1

Adaptive Siamese Tracking with a Compact Latent Network

no code implementations2 Feb 2023 Xingping Dong, Jianbing Shen, Fatih Porikli, Jiebo Luo, Ling Shao

Under this viewing, we perform an in-depth analysis for them through visual simulations and real tracking examples, and find that the failure cases in some challenging situations can be regarded as the issue of missing decisive samples in offline training.

LWSIS: LiDAR-guided Weakly Supervised Instance Segmentation for Autonomous Driving

1 code implementation7 Dec 2022 Xiang Li, Junbo Yin, Botian Shi, Yikang Li, Ruigang Yang, Jianbing Shen

In this paper, we present a more artful framework, LiDAR-guided Weakly Supervised Instance Segmentation (LWSIS), which leverages the off-the-shelf 3D data, i. e., Point Cloud, together with the 3D boxes, as natural weak supervisions for training the 2D image instance segmentation models.

Autonomous Driving Instance Segmentation +4

SSDA3D: Semi-supervised Domain Adaptation for 3D Object Detection from Point Cloud

1 code implementation6 Dec 2022 Yan Wang, Junbo Yin, Wei Li, Pascal Frossard, Ruigang Yang, Jianbing Shen

However, these UDA solutions just yield unsatisfactory 3D detection results when there is a severe domain shift, e. g., from Waymo (64-beam) to nuScenes (32-beam).

3D Object Detection Autonomous Driving +4

ProposalContrast: Unsupervised Pre-training for LiDAR-based 3D Object Detection

1 code implementation26 Jul 2022 Junbo Yin, Dingfu Zhou, Liangjun Zhang, Jin Fang, Cheng-Zhong Xu, Jianbing Shen, Wenguan Wang

Existing approaches for unsupervised point cloud pre-training are constrained to either scene-level or point/voxel-level instance discrimination.

3D Object Detection object-detection +2

Semi-supervised 3D Object Detection with Proficient Teachers

1 code implementation26 Jul 2022 Junbo Yin, Jin Fang, Dingfu Zhou, Liangjun Zhang, Cheng-Zhong Xu, Jianbing Shen, Wenguan Wang

To reduce the dependence on large supervision, semi-supervised learning (SSL) based approaches have been proposed.

3D Object Detection Autonomous Driving +2

Counterfactual Cycle-Consistent Learning for Instruction Following and Generation in Vision-Language Navigation

1 code implementation CVPR 2022 Hanqing Wang, Wei Liang, Jianbing Shen, Luc van Gool, Wenguan Wang

Since the rise of vision-language navigation (VLN), great progress has been made in instruction following -- building a follower to navigate environments under the guidance of instructions.

Data Augmentation Instruction Following +2

Tree Energy Loss: Towards Sparsely Annotated Semantic Segmentation

1 code implementation CVPR 2022 Zhiyuan Liang, Tiancai Wang, Xiangyu Zhang, Jian Sun, Jianbing Shen

The tree energy loss is effective and easy to be incorporated into existing frameworks by combining it with a traditional segmentation loss.

Semantic Segmentation

Consistency and Diversity induced Human Motion Segmentation

no code implementations10 Feb 2022 Tao Zhou, Huazhu Fu, Chen Gong, Ling Shao, Fatih Porikli, Haibin Ling, Jianbing Shen

Besides, a novel constraint based on the Hilbert Schmidt Independence Criterion (HSIC) is introduced to ensure the diversity of multi-level subspace representations, which enables the complementarity of multi-level representations to be explored to boost the transfer learning performance.

Motion Segmentation Transfer Learning

Multi-Level Representation Learning With Semantic Alignment for Referring Video Object Segmentation

no code implementations CVPR 2022 Dongming Wu, Xingping Dong, Ling Shao, Jianbing Shen

To address this, we propose a novel multi-level representation learning approach, which explores the inherent structure of the video content to provide a set of discriminative visual embedding, enabling more effective vision-language semantic alignment.

Referring Expression Segmentation Referring Video Object Segmentation +4

A Graph Matching Perspective With Transformers on Video Instance Segmentation

no code implementations CVPR 2022 Zheyun Qin, Xiankai Lu, Xiushan Nie, Yilong Yin, Jianbing Shen

Video Instance Segmentation (VIS) needs to automatically track and segment multiple objects in videos that rely on modeling the spatial-temporal interactions of the instances.

Association Graph Matching +3

Full-Duplex Strategy for Video Object Segmentation

1 code implementation ICCV 2021 Ge-Peng Ji, Deng-Ping Fan, Keren Fu, Zhe Wu, Jianbing Shen, Ling Shao

Previous video object segmentation approaches mainly focus on using simplex solutions between appearance and motion, limiting feature collaboration efficiency among and across these two cues.

Salient Object Detection Semantic Segmentation +4

Video Object Segmentation Using Global and Instance Embedding Learning

no code implementations CVPR 2021 Wenbin Ge, Xiankai Lu, Jianbing Shen

In this paper, we propose a feature embedding based video object segmentation (VOS) method which is simple, fast and effective.

Semantic Segmentation Video Object Segmentation +1

Person Re-Identification by Context-aware Part Attention and Multi-Head Collaborative Learning

no code implementations IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY 2021 Dongming Wu, Mang Ye, Gaojie Lin, Xin Gao, Jianbing Shen

In addition, we propose a novel multi-head collaborative training scheme to improve the performance, which is collaboratively supervised by multiple heads with the same structure but different parameters.

Video-Based Person Re-Identification

Face Forensics in the Wild

1 code implementation CVPR 2021 Tianfei Zhou, Wenguan Wang, Zhiyuan Liang, Jianbing Shen

On existing public benchmarks, face forgery detection techniques have achieved great success.

Multiple Instance Learning

Structured Scene Memory for Vision-Language Navigation

1 code implementation CVPR 2021 Hanqing Wang, Wenguan Wang, Wei Liang, Caiming Xiong, Jianbing Shen

Recently, numerous algorithms have been developed to tackle the problem of vision-language navigation (VLN), i. e., entailing an agent to navigate 3D environments through following linguistic instructions.

Decision Making Navigate +1

Learning to Fuse Asymmetric Feature Maps in Siamese Trackers

1 code implementation CVPR 2021 Wencheng Han, Xingping Dong, Fahad Shahbaz Khan, Ling Shao, Jianbing Shen

We propose a learnable module, called the asymmetric convolution (ACM), which learns to better capture the semantic correlation information in offline training on large-scale data.

Visual Object Tracking Visual Tracking

Siamese Network for RGB-D Salient Object Detection and Beyond

2 code implementations26 Aug 2020 Keren Fu, Deng-Ping Fan, Ge-Peng Ji, Qijun Zhao, Jianbing Shen, Ce Zhu

Inspired by the observation that RGB and depth modalities actually present certain commonality in distinguishing salient objects, a novel joint learning and densely cooperative fusion (JL-DCF) architecture is designed to learn from both RGB and depth inputs through a shared network backbone, known as the Siamese architecture.

object-detection RGB-D Salient Object Detection +2

RGB-D Salient Object Detection: A Survey

9 code implementations1 Aug 2020 Tao Zhou, Deng-Ping Fan, Ming-Ming Cheng, Jianbing Shen, Ling Shao

Further, considering that the light field can also provide depth maps, we review SOD models and popular benchmark datasets from this domain as well.

object-detection RGB-D Salient Object Detection +2

Weakly Supervised 3D Object Detection from Lidar Point Cloud

1 code implementation ECCV 2020 Qinghao Meng, Wenguan Wang, Tianfei Zhou, Jianbing Shen, Luc van Gool, Dengxin Dai

This work proposes a weakly supervised approach for 3D object detection, only requiring a small set of weakly annotated scenes, associated with a few precisely labeled object instances.

3D Object Detection object-detection

Dynamic Dual-Attentive Aggregation Learning for Visible-Infrared Person Re-Identification

3 code implementations ECCV 2020 Mang Ye, Jianbing Shen, David J. Crandall, Ling Shao, Jiebo Luo

In this paper, we propose a novel dynamic dual-attentive aggregation (DDAG) learning method by mining both intra-modality part-level and cross-modality graph-level contextual cues for VI-ReID.

Person Re-Identification Retrieval

Active Visual Information Gathering for Vision-Language Navigation

1 code implementation ECCV 2020 Hanqing Wang, Wenguan Wang, Tianmin Shu, Wei Liang, Jianbing Shen

Vision-language navigation (VLN) is the task of entailing an agent to carry out navigational instructions inside photo-realistic environments.

Vision-Language Navigation

Video Object Segmentation with Episodic Graph Memory Networks

1 code implementation ECCV 2020 Xiankai Lu, Wenguan Wang, Martin Danelljan, Tianfei Zhou, Jianbing Shen, Luc van Gool

How to make a segmentation model efficiently adapt to a specific video and to online target appearance variations are fundamentally crucial issues in the field of video object segmentation.

Semantic Segmentation Video Object Segmentation +2

Re-thinking Co-Salient Object Detection

2 code implementations7 Jul 2020 Deng-Ping Fan, Tengpeng Li, Zheng Lin, Ge-Peng Ji, Dingwen Zhang, Ming-Ming Cheng, Huazhu Fu, Jianbing Shen

CoSOD is an emerging and rapidly growing extension of salient object detection (SOD), which aims to detect the co-occurring salient objects in a group of images.

Benchmarking Co-Salient Object Detection +2

PraNet: Parallel Reverse Attention Network for Polyp Segmentation

4 code implementations13 Jun 2020 Deng-Ping Fan, Ge-Peng Ji, Tao Zhou, Geng Chen, Huazhu Fu, Jianbing Shen, Ling Shao

To address these challenges, we propose a parallel reverse attention network (PraNet) for accurate polyp segmentation in colonoscopy images.

Ranked #4 on Camouflaged Object Segmentation on CAMO (using extra training data)

Camouflaged Object Segmentation Camouflage Segmentation +1

M2Net: Multi-modal Multi-channel Network for Overall Survival Time Prediction of Brain Tumor Patients

no code implementations1 Jun 2020 Tao Zhou, Huazhu Fu, Yu Zhang, Changqing Zhang, Xiankai Lu, Jianbing Shen, Ling Shao

Then, we use a modality-specific network to extract implicit and high-level features from different MR scans.

Modeling and Enhancing Low-quality Retinal Fundus Images

1 code implementation12 May 2020 Ziyi Shen, Huazhu Fu, Jianbing Shen, Ling Shao

Retinal fundus images are widely used for the clinical screening and diagnosis of eye diseases.

Image Enhancement Retinal Vessel Segmentation

Self-Learning with Rectification Strategy for Human Parsing

no code implementations CVPR 2020 Tao Li, Zhiyuan Liang, Sanyuan Zhao, Jiahao Gong, Jianbing Shen

For the global error, we first transform category-wise features into a high-level graph model with coarse-grained structural information, and then decouple the high-level graph to reconstruct the category features.

Human Parsing Self-Learning

A Unified Object Motion and Affinity Model for Online Multi-Object Tracking

1 code implementation CVPR 2020 Junbo Yin, Wenguan Wang, Qinghao Meng, Ruigang Yang, Jianbing Shen

In this paper, we propose a novel MOT framework that unifies object motion and affinity model into a single network, named UMA, in order to learn a compact feature that is discriminative for both object motion and affinity measure.

Metric Learning Multi-Object Tracking +2

Hierarchical Human Parsing with Typed Part-Relation Reasoning

1 code implementation CVPR 2020 Wenguan Wang, Hailong Zhu, Jifeng Dai, Yanwei Pang, Jianbing Shen, Ling Shao

As human bodies are underlying hierarchically structured, how to model human structures is the central theme in this task.

Human Parsing

Learning Video Object Segmentation from Unlabeled Videos

1 code implementation CVPR 2020 Xiankai Lu, Wenguan Wang, Jianbing Shen, Yu-Wing Tai, David Crandall, Steven C. H. Hoi

We propose a new method for video object segmentation (VOS) that addresses object pattern learning from unlabeled videos, unlike most existing methods which rely heavily on extensive annotated data.

Representation Learning Semantic Segmentation +4

Cascaded Human-Object Interaction Recognition

1 code implementation CVPR 2020 Tianfei Zhou, Wenguan Wang, Siyuan Qi, Haibin Ling, Jianbing Shen

The interaction recognition network has two crucial parts: a relation ranking module for high-quality HOI proposal selection and a triple-stream classifier for relation prediction.

Human-Object Interaction Detection

Infinitely Wide Graph Convolutional Networks: Semi-supervised Learning via Gaussian Processes

no code implementations26 Feb 2020 Jilin Hu, Jianbing Shen, Bin Yang, Ling Shao

Graph convolutional neural networks~(GCNs) have recently demonstrated promising results on graph-based semi-supervised classification, but little work has been done to explore their theoretical properties.

Gaussian Processes General Classification

Hi-Net: Hybrid-fusion Network for Multi-modal MR Image Synthesis

2 code implementations11 Feb 2020 Tao Zhou, Huazhu Fu, Geng Chen, Jianbing Shen, Ling Shao

Medical image synthesis has been proposed as an effective solution to this, where any missing modalities are synthesized from the existing ones.

Image Generation

Learning Compositional Neural Information Fusion for Human Parsing

1 code implementation ICCV 2019 Wenguan Wang, Zhijie Zhang, Siyuan Qi, Jianbing Shen, Yanwei Pang, Ling Shao

The bottom-up and top-down inferences explicitly model the compositional and decompositional relations in human bodies, respectively.

Human Parsing

Human-Aware Motion Deblurring

1 code implementation ICCV 2019 Ziyi Shen, Wenguan Wang, Xiankai Lu, Jianbing Shen, Haibin Ling, Tingfa Xu, Ling Shao

This paper proposes a human-aware deblurring model that disentangles the motion blur between foreground (FG) humans and background (BG).

Deblurring Image Deblurring

Zero-Shot Video Object Segmentation via Attentive Graph Neural Networks

1 code implementation ICCV 2019 Wenguan Wang, Xiankai Lu, Jianbing Shen, David Crandall, Ling Shao

Through parametric message passing, AGNN is able to efficiently capture and mine much richer and higher-order relations between video frames, thus enabling a more complete understanding of video content and more accurate foreground estimation.

Semantic Segmentation Unsupervised Video Object Segmentation +3

NETNet: Neighbor Erasing and Transferring Network for Better Single Shot Object Detection

no code implementations CVPR 2020 Yazhao Li, Yanwei Pang, Jianbing Shen, Jiale Cao, Ling Shao

With this observation, we propose a new Neighbor Erasing and Transferring (NET) mechanism to reconfigure the pyramid features and explore scale-aware features.

object-detection Object Detection

Deep Learning for Person Re-identification: A Survey and Outlook

4 code implementations13 Jan 2020 Mang Ye, Jianbing Shen, Gaojie Lin, Tao Xiang, Ling Shao, Steven C. H. Hoi

The widely studied closed-world setting is usually applied under various research-oriented assumptions, and has achieved inspiring success using deep learning techniques on a number of datasets.

Cross-Modal Person Re-Identification Metric Learning +2

Distilled Siamese Networks for Visual Tracking

no code implementations24 Jul 2019 Jianbing Shen, Yuanpei Liu, Xingping Dong, Xiankai Lu, Fahad Shahbaz Khan, Steven Hoi

This model is intuitively inspired by the one teacher vs. multiple students learning method typically employed in schools.

Knowledge Distillation Object Tracking +1

Evaluation of Retinal Image Quality Assessment Networks in Different Color-spaces

2 code implementations10 Jul 2019 Huazhu Fu, Boyang Wang, Jianbing Shen, Shanshan Cui, Yanwu Xu, Jiang Liu, Ling Shao

Retinal image quality assessment (RIQA) is essential for controlling the quality of retinal imaging and guaranteeing the reliability of diagnoses by ophthalmologists or automated analysis systems.

Image Quality Assessment

Understanding More about Human and Machine Attention in Deep Neural Networks

no code implementations20 Jun 2019 Qiuxia Lai, Salman Khan, Yongwei Nie, Jianbing Shen, Hanqiu Sun, Ling Shao

With three example computer vision tasks, diverse representative backbones, and famous architectures, corresponding real human gaze data, and systematically conducted large-scale quantitative studies, we quantify the consistency between artificial attention and human visual attention and offer novel insights into existing artificial attention mechanisms by giving preliminary answers to several key questions related to human and artificial attention mechanisms.

Fine-Grained Image Classification Semantic Segmentation +1

Extreme Points Derived Confidence Map as a Cue For Class-Agnostic Segmentation Using Deep Neural Network

1 code implementation6 Jun 2019 Shadab Khan, Ahmed H. Shahin, Javier Villafruela, Jianbing Shen, Ling Shao

To automate the process of segmenting an anatomy of interest, we can learn a model from previously annotated data.

Anatomy

Salient Object Detection in the Deep Learning Era: An In-Depth Survey

1 code implementation19 Apr 2019 Wenguan Wang, Qiuxia Lai, Huazhu Fu, Jianbing Shen, Haibin Ling, Ruigang Yang

As an essential problem in computer vision, salient object detection (SOD) has attracted an increasing amount of research attention over the years.

object-detection RGB Salient Object Detection +2

Adversarial Defense by Restricting the Hidden Space of Deep Neural Networks

1 code implementation ICCV 2019 Aamir Mustafa, Salman Khan, Munawar Hayat, Roland Goecke, Jianbing Shen, Ling Shao

Deep neural networks are vulnerable to adversarial attacks, which can fool them by adding minuscule perturbations to the input images.

Adversarial Defense

Striking the Right Balance with Uncertainty

no code implementations CVPR 2019 Salman Khan, Munawar Hayat, Waqas Zamir, Jianbing Shen, Ling Shao

Rare classes tend to get a concentrated representation in the classification space which hampers the generalization of learned boundaries to new test examples.

Classification Face Verification +2

Image Super-Resolution as a Defense Against Adversarial Attacks

1 code implementation7 Jan 2019 Aamir Mustafa, Salman H. Khan, Munawar Hayat, Jianbing Shen, Ling Shao

The proposed scheme is simple and has the following advantages: (1) it does not require any model training or parameter optimization, (2) it complements other existing defense mechanisms, (3) it is agnostic to the attacked model and attack type and (4) it provides superior performance across all popular attack algorithms.

Adversarial Defense Image Enhancement +2

Triplet Loss in Siamese Network for Object Tracking

1 code implementation ECCV 2018 Xingping Dong, Jianbing Shen

In this paper, a novel triplet loss is proposed to extract expressive deep feature for object tracking by adding it into Siamese network framework instead of pairwise loss for training.

Object Tracking

Pyramid Dilated Deeper ConvLSTM for Video Salient Object Detection

1 code implementation ECCV 2018 Hongmei Song, Wenguan Wang, Sanyuan Zhao, Jianbing Shen, Kin-Man Lam

This paper proposes a fast video salient object detection model, based on a novel recurrent network architecture, named Pyramid Dilated Bidirectional ConvLSTM (PDB-ConvLSTM).

 Ranked #1 on Video Salient Object Detection on UVSD (using extra training data)

object-detection Salient Object Detection +4

Learning Human-Object Interactions by Graph Parsing Neural Networks

1 code implementation ECCV 2018 Siyuan Qi, Wenguan Wang, Baoxiong Jia, Jianbing Shen, Song-Chun Zhu

For a given scene, GPNN infers a parse graph that includes i) the HOI graph structure represented by an adjacency matrix, and ii) the node labels.

Human-Object Interaction Detection

Attentive Fashion Grammar Network for Fashion Landmark Detection and Clothing Category Classification

1 code implementation CVPR 2018 Wenguan Wang, Yuanlu Xu, Jianbing Shen, Song-Chun Zhu

This paper proposes a knowledge-guided fashion network to solve the problem of visual fashion analysis, e. g., fashion landmark localization and clothing category classification.

General Classification

Salient Object Detection Driven by Fixation Prediction

1 code implementation CVPR 2018 Wenguan Wang, Jianbing Shen, Xingping Dong, Ali Borji

Salient object detection is then viewed as fine-grained object-level saliency segmentation and is progressively optimized with the guidance of the fixation map in a top-down manner.

object-detection RGB Salient Object Detection +1

Revisiting Video Saliency: A Large-scale Benchmark and a New Model

1 code implementation CVPR 2018 Wenguan Wang, Jianbing Shen, Fang Guo, Ming-Ming Cheng, Ali Borji

Existing video saliency datasets lack variety and generality of common dynamic scenes and fall short in covering challenging situations in unconstrained environments.

Deep Cropping via Attention Box Prediction and Aesthetics Assessment

no code implementations ICCV 2017 Wenguan Wang, Jianbing Shen

We model the photo cropping problem as a cascade of attention box regression and aesthetic quality classification, based on deep learning.

Improved Face Detection and Alignment using Cascade Deep Convolutional Network

no code implementations28 Jul 2017 Weilin Cong, Sanyuan Zhao, Hui Tian, Jianbing Shen

Real-world face detection and alignment demand an advanced discriminative model to address challenges by pose, lighting and expression.

Face Detection

Quadruplet Network with One-Shot Learning for Fast Visual Object Tracking

no code implementations19 May 2017 Xingping Dong, Jianbing Shen, Dongming Wu, Kan Guo, Xiaogang Jin, Fatih Porikli

In this paper, we propose a new quadruplet deep network to examine the potential connections among the training instances, aiming to achieve a more powerful representation.

One-Shot Learning Visual Object Tracking

Deep Visual Attention Prediction

1 code implementation journal 2017 Wenguan Wang, Jianbing Shen

Our model is based on a skip-layer network structure, which predicts human attention from multiple convolutional layers with various reception fields.

Saliency Prediction

Super-Trajectory for Video Segmentation

no code implementations ICCV 2017 Wenguan Wang, Jianbing Shen, Jianwen Xie, Fatih Porikli

We introduce a novel semi-supervised video segmentation approach based on an efficient video representation, called as "super-trajectory".

Video Segmentation Video Semantic Segmentation

Linearization to Nonlinear Learning for Visual Tracking

no code implementations ICCV 2015 Bo Ma, Hongwei Hu, Jianbing Shen, Yuping Zhang, Fatih Porikli

Building on the theory of globally linear approximations to nonlinear functions, we introduce an elegant method that jointly learns a nonlinear classifier and a visual dictionary for tracking objects in a semi-supervised sparse coding fashion.

Dictionary Learning Visual Tracking

Saliency-Aware Geodesic Video Object Segmentation

1 code implementation CVPR 2015 Wenguan Wang, Jianbing Shen, Fatih Porikli

Building on the observation that foreground areas are surrounded by the regions with high spatiotemporal edge values, geodesic distance provides an initial estimation for foreground and background.

Ranked #5 on Video Salient Object Detection on DAVSOD-Difficult20 (using extra training data)

Semantic Segmentation Video Salient Object Detection +1

Lazy Random Walks for Superpixel Segmentation

1 code implementation IEEE Trans. on Image Processing 2014 Jianbing Shen, Yunfan Du, Wenguan Wang, Xuelong. Li

Then, the boundaries of initial superpixels are obtained according to the probabilities and the commute time.

Superpixels

Cannot find the paper you are looking for? You can Submit a new open access paper.