Search Results for author: Jianbin Jiao

Found 41 papers, 30 papers with code

Conformer: Local Features Coupling Global Representations for Visual Recognition

4 code implementations • ICCV 2021 • Zhiliang Peng, Wei Huang, Shanzhi Gu, Lingxi Xie, YaoWei Wang, Jianbin Jiao, Qixiang Ye

Within Convolutional Neural Network (CNN), the convolution operations are good at extracting local features but experience difficulty to capture global representations.

Ranked #322 on Image Classification on ImageNet

Image Classification Instance Segmentation +4

3,140

Paper
Code

Image-Image Domain Adaptation with Preserved Self-Similarity and Domain-Dissimilarity for Person Re-identification

2 code implementations • CVPR 2018 • Weijian Deng, Liang Zheng, Qixiang Ye, Guoliang Kang, Yi Yang, Jianbin Jiao

To this end, we propose to preserve two types of unsupervised similarities, 1) self-similarity of an image before and after translation, and 2) domain-dissimilarity of a translated source image and a target image.

Ranked #3 on Unsupervised Person Re-Identification on MSMT17->DukeMTMC-reID

Generative Adversarial Network Person Re-Identification +2

3,128

Paper
Code

Beyond Bounding-Box: Convex-Hull Feature Adaptation for Oriented and Densely Packed Object Detection

2 code implementations • CVPR 2021 • Zonghao Guo, Chang Liu, Xiaosong Zhang, Jianbin Jiao, Xiangyang Ji, Qixiang Ye

Detecting oriented and densely packed objects remains challenging for spatial feature aliasing caused by the intersection of reception fields between objects.

Ranked #34 on Object Detection In Aerial Images on DOTA (using extra training data)

object-detection Object Detection In Aerial Images

1,719

Paper
Code

Spatial Self-Distillation for Object Detection with Inaccurate Bounding Boxes

1 code implementation • ICCV 2023 • Di wu, Pengfei Chen, Xuehui Yu, Guorong Li, Zhenjun Han, Jianbin Jiao

Object detection via inaccurate bounding boxes supervision has boosted a broad interest due to the expensive high-quality annotation data or the occasional inevitability of low annotation quality (\eg tiny objects).

Multiple Instance Learning Object +2

632

Paper
Code

CPR++: Object Localization via Single Coarse Point Supervision

2 code implementations • 30 Jan 2024 • Xuehui Yu, Pengfei Chen, Kuiran Wang, Xumeng Han, Guorong Li, Zhenjun Han, Qixiang Ye, Jianbin Jiao

CPR reduces the semantic variance by selecting a semantic centre point in a neighbourhood region to replace the initial annotated point.

Object Object Localization

632

Paper
Code

Weakly Supervised Instance Segmentation using Class Peak Response

1 code implementation • CVPR 2018 • Yanzhao Zhou, Yi Zhu, Qixiang Ye, Qiang Qiu, Jianbin Jiao

Motivated by this, we first design a process to stimulate peaks to emerge from a class response map.

Ranked #11 on Image-level Supervised Instance Segmentation on PASCAL VOC 2012 val (using extra training data)

General Classification Image-level Supervised Instance Segmentation +3

345

Paper
Code

Oriented Response Networks

1 code implementation • CVPR 2017 • Yanzhao Zhou, Qixiang Ye, Qiang Qiu, Jianbin Jiao

DCNNs using ARFs, referred to as Oriented Response Networks (ORNs), can produce within-class rotation-invariant deep features while maintaining inter-class discrimination for classification tasks.

Ranked #83 on Image Classification on CIFAR-100 (using extra training data)

General Classification Image Classification

224

Paper
Code

Soft Proposal Networks for Weakly Supervised Object Localization

1 code implementation • ICCV 2017 • Yi Zhu, Yanzhao Zhou, Qixiang Ye, Qiang Qiu, Jianbin Jiao

Weakly supervised object localization remains challenging, where only image labels instead of bounding boxes are available during training.

Ranked #2 on Weakly Supervised Object Detection on MS COCO

Object Weakly Supervised Object Detection +1

210

Paper
Code

Prototype Mixture Models for Few-shot Semantic Segmentation

1 code implementation • ECCV 2020 • Boyu Yang, Chang Liu, Bohao Li, Jianbin Jiao, Qixiang Ye

Few-shot segmentation is challenging because objects within the support and query images could significantly differ in appearance and pose.

Ranked #4 on Few-Shot Semantic Segmentation on PASCAL-5i (10-Shot)

Few-Shot Semantic Segmentation Segmentation +1

163

Paper
Code

Fast-iTPN: Integrally Pre-Trained Transformer Pyramid Network with Token Migration

1 code implementation • CVPR 2023 • Yunjie Tian, Lingxi Xie, Jihao Qiu, Jianbin Jiao, YaoWei Wang, Qi Tian, Qixiang Ye

iTPN is born with two elaborated designs: 1) The first pre-trained feature pyramid upon vision transformer (ViT).

object-detection Object Detection +1

149

Paper
Code

SIXray : A Large-scale Security Inspection X-ray Benchmark for Prohibited Item Discovery in Overlapping Images

1 code implementation • 2 Jan 2019 • Caijing Miao, Lingxi Xie, Fang Wan, Chi Su, Hongye Liu, Jianbin Jiao, Qixiang Ye

In particular, the advantage of CHR is more significant in the scenarios with fewer positive training samples, which demonstrates its potential application in real-world security inspection.

Object Localization

116

Paper
Code

Adaptive Linear Span Network for Object Skeleton Detection

1 code implementation • 8 Nov 2020 • Chang Liu, Yunjie Tian, Jianbin Jiao, Qixiang Ye

Conventional networks for object skeleton detection are usually hand-crafted.

Edge Detection Neural Architecture Search +2

116

Paper
Code

Generic-to-Specific Distillation of Masked Autoencoders

1 code implementation • CVPR 2023 • Wei Huang, Zhiliang Peng, Li Dong, Furu Wei, Jianbin Jiao, Qixiang Ye

Lightweight ViT models limited by the model capacity, however, benefit little from those pre-training mechanisms.

Image Classification Knowledge Distillation +3

Paper
Code

SRN: Side-output Residual Network for Object Symmetry Detection in the Wild

1 code implementation • CVPR 2017 • Wei Ke, Jie Chen, Jianbin Jiao, Guoying Zhao, Qixiang Ye

By stacking RUs in a deep-to-shallow manner, SRN exploits the 'flow' of errors among multiple scales to ease the problems of fitting complex outputs with limited layers, suppressing the complex backgrounds, and effectively matching object symmetry of different scales.

Object Symmetry Detection

Paper
Code

SRN: Side-output Residual Network for Object Reflection Symmetry Detection and Beyond

1 code implementation • 17 Jul 2018 • Wei Ke, Jie Chen, Jianbin Jiao, Guoying Zhao, Qixiang Ye

The end-to-end deep learning approach, referred to as a side-output residual network (SRN), leverages the output residual units (RUs) to fit the errors between the object ground-truth symmetry and the side-outputs of multiple stages.

Edge Detection Hand Pose Estimation +2

Paper
Code

C-MIL: Continuation Multiple Instance Learning for Weakly Supervised Object Detection

1 code implementation • CVPR 2019 • Fang Wan, Chang Liu, Wei Ke, Xiangyang Ji, Jianbin Jiao, Qixiang Ye

Weakly supervised object detection (WSOD) is a challenging task when provided with image category supervision but required to simultaneously learn object locations and object detectors.

Ranked #9 on Weakly Supervised Object Detection on PASCAL VOC 2007

Multiple Instance Learning Object +3

Paper
Code

GraFormer: Graph Convolution Transformer for 3D Pose Estimation

1 code implementation • 17 Sep 2021 • Weixi Zhao, Yunjie Tian, Qixiang Ye, Jianbin Jiao, Weiqiang Wang

Exploiting relations among 2D joints plays a crucial role yet remains semi-developed in 2D-to-3D pose estimation.

3D Pose Estimation Implicit Relations

Paper
Code

ChatterBox: Multi-round Multimodal Referring and Grounding

1 code implementation • 24 Jan 2024 • Yunjie Tian, Tianren Ma, Lingxi Xie, Jihao Qiu, Xi Tang, Yuan Zhang, Jianbin Jiao, Qi Tian, Qixiang Ye

In this study, we establish a baseline for a new task named multimodal multi-round referring and grounding (MRG), opening up a promising direction for instance-level multimodal dialogues.

Language Modelling Visual Grounding

Paper
Code

Min-Entropy Latent Model for Weakly Supervised Object Detection

1 code implementation • CVPR 2018 • Fang Wan, Pengxu Wei, Zhenjun Han, Jianbin Jiao, Qixiang Ye

Weakly supervised object detection is a challenging task when provided with image category supervision but required to learn, at the same time, object locations and object detectors.

Ranked #19 on Weakly Supervised Object Detection on PASCAL VOC 2012 test

Image Classification Object +3

Paper
Code

Semantic-Aware Generation for Self-Supervised Visual Representation Learning

1 code implementation • 25 Nov 2021 • Yunjie Tian, Lingxi Xie, Xiaopeng Zhang, Jiemin Fang, Haohang Xu, Wei Huang, Jianbin Jiao, Qi Tian, Qixiang Ye

In this paper, we propose a self-supervised visual representation learning approach which involves both generative and discriminative proxies, where we focus on the former part by requiring the target network to recover the original image based on the mid-level features.

Ranked #63 on Semantic Segmentation on Cityscapes test

Representation Learning Semantic Segmentation

Paper
Code

Beyond Masking: Demystifying Token-Based Pre-Training for Vision Transformers

1 code implementation • 27 Mar 2022 • Yunjie Tian, Lingxi Xie, Jiemin Fang, Mengnan Shi, Junran Peng, Xiaopeng Zhang, Jianbin Jiao, Qi Tian, Qixiang Ye

The past year has witnessed a rapid development of masked image modeling (MIM).

Paper
Code

Rethinking Sampling Strategies for Unsupervised Person Re-identification

2 code implementations • 7 Jul 2021 • Xumeng Han, Xuehui Yu, Guorong Li, Jian Zhao, Gang Pan, Qixiang Ye, Jianbin Jiao, Zhenjun Han

While extensive research has focused on the framework design and loss function, this paper shows that sampling strategy plays an equally important role.

Ranked #6 on Unsupervised Person Re-Identification on DukeMTMC-reID

Pseudo Label Representation Learning +1

Paper
Code

Discretization-Aware Architecture Search

1 code implementation • 7 Jul 2020 • Yunjie Tian, Chang Liu, Lingxi Xie, Jianbin Jiao, Qixiang Ye

The search cost of neural architecture search (NAS) has been largely reduced by weight-sharing methods.

Image Classification Neural Architecture Search

Paper
Code

Vision-Dialog Navigation by Exploring Cross-modal Memory

1 code implementation • CVPR 2020 • Yi Zhu, Fengda Zhu, Zhaohuan Zhan, Bingqian Lin, Jianbin Jiao, Xiaojun Chang, Xiaodan Liang

Benefiting from the collaborative learning of the L-mem and the V-mem, our CMN is able to explore the memory about the decision making of historical navigation actions which is for the current step.

Decision Making

Paper
Code

Anti-aliasing Semantic Reconstruction for Few-Shot Semantic Segmentation

1 code implementation • CVPR 2021 • Binghao Liu, Yao Ding, Jianbin Jiao, Xiangyang Ji, Qixiang Ye

Encouraging progress in few-shot semantic segmentation has been made by leveraging features learned upon base classes with sufficient training data to represent novel classes with few-shot examples.

Ranked #68 on Few-Shot Semantic Segmentation on COCO-20i (1-shot)

Few-Shot Semantic Segmentation Segmentation +1

Paper
Code

Harmonic Feature Activation for Few-Shot Semantic Segmentation

1 code implementation • IEEE Transactions on Image Processing 2021 • Binghao Liu, Jianbin Jiao, Qixiang Ye

HFA is formulated as a bilinear model, which takes charge of the pixel-wise dense correlation (bilinear feature activation) between query and support images in a systematic way.

Few-Shot Semantic Segmentation Segmentation +1

Paper
Code

Long-tailed Distribution Adaptation

1 code implementation • 6 Oct 2021 • Zhiliang Peng, Wei Huang, Zonghao Guo, Xiaosong Zhang, Jianbin Jiao, Qixiang Ye

We propose to jointly optimize empirical risks of the unbalanced and balanced domains and approximate their domain divergence by intra-class and inter-class distances, with the aim to adapt models trained on the long-tailed distribution to general distributions in an interpretable way.

Domain Adaptation Instance Segmentation +3

Paper
Code

Towards Precise 3D Human Pose Estimation with Multi-Perspective Spatial-Temporal Relational Transformers

1 code implementation • 30 Jan 2024 • Jianbin Jiao, Xina Cheng, WeiJie Chen, Xiaoting Yin, Hao Shi, Kailun Yang

Due to the challenges in data collection, mainstream datasets of 3D human pose estimation are primarily composed of multi-view video data collected in laboratory environments, which contains rich spatial-temporal correlation information besides the image frame content.

3D Human Pose Estimation Scene Understanding

Paper
Code

BadRL: Sparse Targeted Backdoor Attack Against Reinforcement Learning

1 code implementation • 19 Dec 2023 • Jing Cui, Yufei Han, Yuzhe ma, Jianbin Jiao, Junge Zhang

Our algorithm, BadRL, strategically chooses state observations with high attack values to inject triggers during training and testing, thereby reducing the chances of detection.

Backdoor Attack reinforcement-learning +1

Paper
Code

Feature-Gate Coupling for Dynamic Network Pruning

1 code implementation • 29 Nov 2021 • Mengnan Shi, Chang Liu, Qixiang Ye, Jianbin Jiao

Gating modules have been widely explored in dynamic network pruning to reduce the run-time computational cost of deep neural networks while preserving the representation of features.

Contrastive Learning Network Pruning

Paper
Code

A scalable convolutional neural network for task-specified scenarios via knowledge distillation

no code implementations • 19 Sep 2016 • Mengnan Shi, Fei Qin, Qixiang Ye, Zhenjun Han, Jianbin Jiao

In this paper, we explore the redundancy in convolutional neural network, which scales with the complexity of vision tasks.

Knowledge Distillation

Paper
Add Code

Similarity-preserving Image-image Domain Adaptation for Person Re-identification

no code implementations • 26 Nov 2018 • Weijian Deng, Liang Zheng, Qixiang Ye, Yi Yang, Jianbin Jiao

It first preserves two types of unsupervised similarity, namely, self-similarity of an image before and after translation, and domain-dissimilarity of a translated source image and a target image.

Domain Adaptation Generative Adversarial Network +2

Paper
Add Code

Domain Alignment with Triplets

no code implementations • 3 Dec 2018 • Weijian Deng, Liang Zheng, Jianbin Jiao

When aligning the distributions in the embedding space, SCA enforces a similarity-preserving constraint to maintain class-level relations among the source and target images, i. e., if a source image and a target image are of the same class label, their corresponding embeddings are supposed to be aligned nearby, and vise versa.

Unsupervised Domain Adaptation

Paper
Add Code

Self-Motivated Communication Agent for Real-World Vision-Dialog Navigation

no code implementations • ICCV 2021 • Yi Zhu, Yue Weng, Fengda Zhu, Xiaodan Liang, Qixiang Ye, Yutong Lu, Jianbin Jiao

Vision-Dialog Navigation (VDN) requires an agent to ask questions and navigate following the human responses to find target objects.

Imitation Learning Navigate

Paper
Add Code

Exploring Complicated Search Spaces with Interleaving-Free Sampling

no code implementations • 5 Dec 2021 • Yunjie Tian, Lingxi Xie, Jiemin Fang, Jianbin Jiao, Qixiang Ye, Qi Tian

In this paper, we build the search algorithm upon a complicated search space with long-distance connections, and show that existing weight-sharing search algorithms mostly fail due to the existence of \textbf{interleaved connections}.

Neural Architecture Search

Paper
Add Code

P2P-Loc: Point to Point Tiny Person Localization

no code implementations • 31 Dec 2021 • Xuehui Yu, Di wu, Qixiang Ye, Jianbin Jiao, Zhenjun Han

As a result, we propose a point self-refinement approach that iteratively updates point annotations in a self-paced way.

Object Object Localization

Paper
Add Code

DQnet: Cross-Model Detail Querying for Camouflaged Object Detection

no code implementations • 16 Dec 2022 • Wei Sun, Chengao Liu, Linyan Zhang, Yu Li, Pengxu Wei, Chang Liu, Jialing Zou, Jianbin Jiao, Qixiang Ye

Optimizing a convolutional neural network (CNN) for camouflaged object detection (COD) tends to activate local discriminative regions while ignoring complete object extent, causing the partial activation issue which inevitably leads to missing or redundant regions of objects.

Object object-detection +2

Paper
Add Code

P2RBox: A Single Point is All You Need for Oriented Object Detection

no code implementations • 22 Nov 2023 • Guangming Cao, Xuehui Yu, Wenwen Yu, Xumeng Han, Xue Yang, Guorong Li, Jianbin Jiao, Zhenjun Han

In this study, we introduce the P2RBox network, which leverages point annotations and a mask generator to create mask proposals, followed by filtration through our Inspector Module and Constrainer Module.

Object object-detection +2

Paper
Add Code

Semantic-aware SAM for Point-Prompted Instance Segmentation

no code implementations • 26 Dec 2023 • Zhaoyang Wei, Pengfei Chen, Xuehui Yu, Guorong Li, Jianbin Jiao, Zhenjun Han

In this paper, we introduce a cost-effective category-specific segmenter using SAM.

Instance Segmentation Multiple Instance Learning +3

Paper
Add Code

Self-supervised Pretraining for Decision Foundation Model: Formulation, Pipeline and Challenges

no code implementations • 29 Dec 2023 • Xiaoqian Liu, Jianbin Jiao, Junge Zhang

Decision-making is a dynamic process requiring perception, memory, and reasoning to make choices and find optimal policies.

Decision Making Few-Shot Learning

Paper
Add Code

P2Seg: Pointly-supervised Segmentation via Mutual Distillation

no code implementations • 18 Jan 2024 • Zipeng Wang, Xuehui Yu, Xumeng Han, Wenwen Yu, Zhixun Huang, Jianbin Jiao, Zhenjun Han

Nevertheless, weakly supervised semantic segmentation methods are proficient in utilizing intra-class feature consistency to capture the boundary contours of the same semantic regions.

Box-supervised Instance Segmentation Segmentation +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.