Search Results for author: Xiaopeng Zhang

Found 94 papers, 42 papers with code

Don’t Miss the Potential Customers! Retrieving Similar Ads to Improve User Targeting

no code implementations • Findings (EMNLP) 2021 • Yi Feng, Ting Wang, Chuanyi Li, Vincent Ng, Jidong Ge, Bin Luo, Yucheng Hu, Xiaopeng Zhang

User targeting is an essential task in the modern advertising industry: given a package of ads for a particular category of products (e. g., green tea), identify the online users to whom the ad package should be targeted.

Paper
Add Code

AlignZeg: Mitigating Objective Misalignment for Zero-shot Semantic Segmentation

no code implementations • 8 Apr 2024 • Jiannan Ge, Lingxi Xie, Hongtao Xie, Pandeng Li, Xiaopeng Zhang, Yongdong Zhang, Qi Tian

(1) Mutually-Refined Proposal Extraction.

Image Segmentation Segmentation +3

Paper
Add Code

GaussianObject: Just Taking Four Images to Get A High-Quality 3D Object with Gaussian Splatting

1 code implementation • 15 Feb 2024 • Chen Yang, Sikuang Li, Jiemin Fang, Ruofan Liang, Lingxi Xie, Xiaopeng Zhang, Wei Shen, Qi Tian

Then we construct a Gaussian repair model based on diffusion models to supplement the omitted object information, where Gaussians are further refined.

Neural Rendering Object

610

Paper
Code

UMG-CLIP: A Unified Multi-Granularity Vision Generalist for Open-World Understanding

no code implementations • 12 Jan 2024 • Bowen Shi, Peisen Zhao, Zichen Wang, Yuhang Zhang, Yaoming Wang, Jin Li, Wenrui Dai, Junni Zou, Hongkai Xiong, Qi Tian, Xiaopeng Zhang

Vision-language foundation models, represented by Contrastive language-image pre-training (CLIP), have gained increasing attention for jointly understanding both vision and textual tasks.

Panoptic Segmentation Retrieval +1

Paper
Add Code

DeLR: Active Learning for Detection with Decoupled Localization and Recognition Query

no code implementations • 28 Dec 2023 • Yuhang Zhang, Yuang Deng, Xiaopeng Zhang, Jie Li, Robert C. Qiu, Qi Tian

In DeLR, the query is based on region-level, and we only annotate the object region that is queried; 2) Instead of directly providing both localization and recognition annotations, we separately query the two components, and thus reduce the recognition budget with the pseudo class labels provided by the model.

Active Learning Object +2

Paper
Add Code

Spectral Prompt Tuning:Unveiling Unseen Classes for Zero-Shot Semantic Segmentation

1 code implementation • 20 Dec 2023 • Wenhao Xu, Rongtao Xu, Changwei Wang, Shibiao Xu, Li Guo, Man Zhang, Xiaopeng Zhang

Recently, CLIP has found practical utility in the domain of pixel-level zero-shot segmentation tasks.

Semantic Segmentation Zero Shot Segmentation +1

Paper
Code

Cascade-Zero123: One Image to Highly Consistent 3D with Self-Prompted Nearby Views

no code implementations • 7 Dec 2023 • Yabo Chen, Jiemin Fang, YuYang Huang, Taoran Yi, Xiaopeng Zhang, Lingxi Xie, Xinggang Wang, Wenrui Dai, Hongkai Xiong, Qi Tian

We propose a cascade generation framework constructed with two Zero-1-to-3 models, named Cascade-Zero123, to tackle this issue, which progressively extracts 3D information from the source image.

Transparent objects

Paper
Add Code

Segment Any 3D Gaussians

no code implementations • 1 Dec 2023 • Jiazhong Cen, Jiemin Fang, Chen Yang, Lingxi Xie, Xiaopeng Zhang, Wei Shen, Qi Tian

Interactive 3D segmentation in radiance fields is an appealing task since its importance in 3D scene understanding and manipulation.

Interactive Segmentation Scene Understanding +1

Paper
Add Code

GaussianEditor: Editing 3D Gaussians Delicately with Text Instructions

no code implementations • 27 Nov 2023 • Jiemin Fang, Junjie Wang, Xiaopeng Zhang, Lingxi Xie, Qi Tian

Specifically, we first extract the region of interest (RoI) corresponding to the text instruction, aligning it to 3D Gaussians.

3D scene Editing

Paper
Add Code

AiluRus: A Scalable ViT Framework for Dense Prediction

1 code implementation • NeurIPS 2023 • Jin Li, Yaoming Wang, Xiaopeng Zhang, Bowen Shi, Dongsheng Jiang, Chenglin Li, Wenrui Dai, Hongkai Xiong, Qi Tian

Specifically, at the intermediate layer of the ViT, we utilize a spatial-aware density-based clustering algorithm to select representative tokens from the token sequence.

object-detection Object Detection +1

Paper
Code

From CLIP to DINO: Visual Encoders Shout in Multi-modal Large Language Models

1 code implementation • 13 Oct 2023 • Dongsheng Jiang, Yuchen Liu, Songlin Liu, Jin'e Zhao, Hao Zhang, Zhen Gao, Xiaopeng Zhang, Jin Li, Hongkai Xiong

By simply equipping it with an MLP layer for alignment, DINO surpasses CLIP in fine-grained related perception tasks.

Hallucination Image Captioning +3

174

Paper
Code

4D Gaussian Splatting for Real-Time Dynamic Scene Rendering

1 code implementation • 12 Oct 2023 • Guanjun Wu, Taoran Yi, Jiemin Fang, Lingxi Xie, Xiaopeng Zhang, Wei Wei, Wenyu Liu, Qi Tian, Xinggang Wang

Representing and rendering dynamic scenes has been an important but challenging task.

1,644

Paper
Code

GaussianDreamer: Fast Generation from Text to 3D Gaussians by Bridging 2D and 3D Diffusion Models

1 code implementation • 12 Oct 2023 • Taoran Yi, Jiemin Fang, Junjie Wang, Guanjun Wu, Lingxi Xie, Xiaopeng Zhang, Wenyu Liu, Qi Tian, Xinggang Wang

In recent times, the generation of 3D assets from text prompts has shown impressive results.

Text to 3D

515

Paper
Code

QA-LoRA: Quantization-Aware Low-Rank Adaptation of Large Language Models

1 code implementation • 26 Sep 2023 • Yuhui Xu, Lingxi Xie, Xiaotao Gu, Xin Chen, Heng Chang, Hengheng Zhang, Zhengsu Chen, Xiaopeng Zhang, Qi Tian

Recently years have witnessed a rapid development of large language models (LLMs).

Quantization

5,881

Paper
Code

Prune Spatio-temporal Tokens by Semantic-aware Temporal Accumulation

1 code implementation • ICCV 2023 • Shuangrui Ding, Peisen Zhao, Xiaopeng Zhang, Rui Qian, Hongkai Xiong, Qi Tian

Based on the STA score, we are able to progressively prune the tokens without introducing any additional parameters or requiring further re-training.

Video Recognition

Paper
Code

Hybrid Distillation: Connecting Masked Autoencoders with Contrastive Learners

no code implementations • 28 Jun 2023 • Bowen Shi, Xiaopeng Zhang, Yaoming Wang, Jin Li, Wenrui Dai, Junni Zou, Hongkai Xiong, Qi Tian

In order to better obtain both discrimination and diversity, we propose a simple but effective Hybrid Distillation strategy, which utilizes both the supervised/CL teacher and the MIM teacher to jointly guide the student model.

Contrastive Learning Representation Learning

Paper
Add Code

Towards AGI in Computer Vision: Lessons Learned from GPT and Large Language Models

no code implementations • 14 Jun 2023 • Lingxi Xie, Longhui Wei, Xiaopeng Zhang, Kaifeng Bi, Xiaotao Gu, Jianlong Chang, Qi Tian

In this paper, we start with a conceptual definition of AGI and briefly review how NLP solves a wide range of tasks via a chat system.

Paper
Add Code

ControlVideo: Training-free Controllable Text-to-Video Generation

1 code implementation • 22 May 2023 • Yabo Zhang, Yuxiang Wei, Dongsheng Jiang, Xiaopeng Zhang, WangMeng Zuo, Qi Tian

Text-driven diffusion models have unlocked unprecedented abilities in image generation, whereas their video counterpart still lags behind due to the excessive training cost of temporal modeling.

Image Generation Text-to-Video Generation +1

691

Paper
Code

Segment Anything in 3D with Radiance Fields

1 code implementation • NeurIPS 2023 • Jiazhong Cen, Jiemin Fang, Zanwei Zhou, Chen Yang, Lingxi Xie, Xiaopeng Zhang, Wei Shen, Qi Tian

The Segment Anything Model (SAM) emerges as a powerful vision foundation model to generate high-quality 2D segmentation results.

Inverse Rendering Segmentation

784

Paper
Code

Attention Weighted Local Descriptors

1 code implementation • journal 2023 • Changwei Wang, Rongtao Xu, Ke Lu, Shibiao Xu, Weiliang Meng, Yuyang Zhang, Bin Fan, Xiaopeng Zhang

Local features detection and description are widely used in many vision applications with high industrial and commercial demands.

3D Reconstruction Homography Estimation +2

Paper
Code

Multi-modal Prompting for Low-Shot Temporal Action Localization

no code implementations • 21 Mar 2023 • Chen Ju, Zeqian Li, Peisen Zhao, Ya zhang, Xiaopeng Zhang, Qi Tian, Yanfeng Wang, Weidi Xie

In this paper, we consider the problem of temporal action localization under low-shot (zero-shot & few-shot) scenario, with the goal of detecting and classifying the action instances from arbitrary categories within some untrimmed videos, even not seen at training time.

Action Classification Temporal Action Localization

Paper
Add Code

SECAD-Net: Self-Supervised CAD Reconstruction by Learning Sketch-Extrude Operations

1 code implementation • CVPR 2023 • Pu Li, Jianwei Guo, Xiaopeng Zhang, Dong-Ming Yan

Reverse engineering CAD models from raw geometry is a classic but strenuous research problem.

CAD Reconstruction

Paper
Code

M-Tuning: Prompt Tuning with Mitigated Label Bias in Open-Set Scenarios

no code implementations • 9 Mar 2023 • Ning Liao, Xiaopeng Zhang, Min Cao, Junchi Yan, Qi Tian

In realistic open-set scenarios where labels of a part of testing data are totally unknown, when vision-language (VL) prompt learning methods encounter inputs related to unknown classes (i. e., not seen during training), they always predict them as one of the training classes.

Open Set Learning

Paper
Add Code

Rethinking Visual Prompt Learning as Masked Visual Token Modeling

no code implementations • 9 Mar 2023 • Ning Liao, Bowen Shi, Xiaopeng Zhang, Min Cao, Junchi Yan, Qi Tian

To explore prompt learning on the generative pre-trained visual model, as well as keeping the task consistency, we propose Visual Prompt learning as masked visual Token Modeling (VPTM) to transform the downstream visual classification into the pre-trained masked visual token prediction.

Paper
Add Code

Self Correspondence Distillation for End-to-End Weakly-Supervised Semantic Segmentation

1 code implementation • 27 Feb 2023 • Rongtao Xu, Changwei Wang, Jiaxi Sun, Shibiao Xu, Weiliang Meng, Xiaopeng Zhang

In addition, to further improve the segmentation accuracy, we design a Variation-aware Refine Module to enhance the local consistency of pseudo-labels by computing pixel-level variation.

Weakly supervised Semantic Segmentation Weakly-Supervised Semantic Segmentation

Paper
Code

Adapting Shortcut With Normalizing Flow: An Efficient Tuning Framework for Visual Recognition

1 code implementation • CVPR 2023 • Yaoming Wang, Bowen Shi, Xiaopeng Zhang, Jin Li, Yuchen Liu, Wenrui Dai, Chenglin Li, Hongkai Xiong, Qi Tian

To mitigate the computational and storage demands, recent research has explored Parameter-Efficient Fine-Tuning (PEFT), which focuses on tuning a minimal number of parameters for efficient adaptation.

Paper
Code

Treating Pseudo-labels Generation as Image Matting for Weakly Supervised Semantic Segmentation

no code implementations • ICCV 2023 • Changwei Wang, Rongtao Xu, Shibiao Xu, Weiliang Meng, Xiaopeng Zhang

To solve this problem, we develop a Double Decoupled Class Activation Map (D2CAM) for Mat-Label to generate a high-quality trimap.

Image Matting Metric Learning +2

Paper
Add Code

Feature Calibration Network for Occluded Pedestrian Detection

no code implementations • 12 Dec 2022 • Tianliang Zhang, Qixiang Ye, Baochang Zhang, Jianzhuang Liu, Xiaopeng Zhang, Qi Tian

FC-Net is based on the observation that the visible parts of pedestrians are selective and decisive for detection, and is implemented as a self-paced feature learning framework with a self-activation (SA) module and a feature calibration (FC) module.

Pedestrian Detection

Paper
Add Code

Security Closure of IC Layouts Against Hardware Trojans

no code implementations • 15 Nov 2022 • Fangzhou Wang, Qijing Wang, Bangqi Fu, Shui Jiang, Xiaopeng Zhang, Lilas Alrahis, Ozgur Sinanoglu, Johann Knechtel, Tsung-Yi Ho, Evangeline F. Y. Young

In this work, we proactively and systematically harden the physical layouts of ICs against post-design insertion of Trojans.

Paper
Add Code

Motion-inductive Self-supervised Object Discovery in Videos

no code implementations • 1 Oct 2022 • Shuangrui Ding, Weidi Xie, Yabo Chen, Rui Qian, Xiaopeng Zhang, Hongkai Xiong, Qi Tian

In this paper, we consider the task of unsupervised object discovery in videos.

Ranked #3 on Unsupervised Object Segmentation on DAVIS 2016

Object Object Discovery +5

Paper
Add Code

SdAE: Self-distillated Masked Autoencoder

1 code implementation • 31 Jul 2022 • Yabo Chen, Yuchen Liu, Dongsheng Jiang, Xiaopeng Zhang, Wenrui Dai, Hongkai Xiong, Qi Tian

We also analyze how to build good views for the teacher branch to produce latent representation from the perspective of information bottleneck.

Descriptive Self-Supervised Learning

Paper
Code

Visual Recognition by Request

1 code implementation • CVPR 2023 • Chufeng Tang, Lingxi Xie, Xiaopeng Zhang, Xiaolin Hu, Qi Tian

Humans have the ability of recognizing visual semantics in an unlimited granularity, but existing visual recognition algorithms cannot achieve this goal.

Instance Segmentation Semantic Segmentation

Paper
Code

Active Pointly-Supervised Instance Segmentation

1 code implementation • 23 Jul 2022 • Chufeng Tang, Lingxi Xie, Gang Zhang, Xiaopeng Zhang, Qi Tian, Xiaolin Hu

In this paper, we present an economic active learning setting, named active pointly-supervised instance segmentation (APIS), which starts with box-level annotations and iteratively samples a point within the box and asks if it falls on the object.

Active Learning Instance Segmentation +2

Paper
Code

SARNet: Semantic Augmented Registration of Large-Scale Urban Point Clouds

1 code implementation • 27 Jun 2022 • Chao Liu, Jianwei Guo, Dong-Ming Yan, Zhirong Liang, Xiaopeng Zhang, Zhanglin Cheng

Registering urban point clouds is a quite challenging task due to the large-scale, noise and data incompleteness of LiDAR scanning data.

Semantic Segmentation

Paper
Code

Masked Autoencoders are Robust Data Augmentors

1 code implementation • 10 Jun 2022 • Haohang Xu, Shuangrui Ding, Xiaopeng Zhang, Hongkai Xiong, Qi Tian

Specifically, MRA consistently enhances the performance on supervised, semi-supervised as well as few-shot classification.

Image Augmentation Image Classification +1

Paper
Code

Fast Dynamic Radiance Fields with Time-Aware Neural Voxels

1 code implementation • 30 May 2022 • Jiemin Fang, Taoran Yi, Xinggang Wang, Lingxi Xie, Xiaopeng Zhang, Wenyu Liu, Matthias Nießner, Qi Tian

A multi-distance interpolation method is proposed and applied on voxel features to model both small and large motions.

307

Paper
Code

Gradient Concealment: Free Lunch for Defending Adversarial Attacks

no code implementations • 21 May 2022 • Sen Pei, Jiaxi Sun, Xiaopeng Zhang, Gaofeng Meng

Recent studies show that the deep neural networks (DNNs) have achieved great success in various tasks.

Robust classification

Paper
Add Code

Beyond Masking: Demystifying Token-Based Pre-Training for Vision Transformers

1 code implementation • 27 Mar 2022 • Yunjie Tian, Lingxi Xie, Jiemin Fang, Mengnan Shi, Junran Peng, Xiaopeng Zhang, Jianbin Jiao, Qi Tian, Qixiang Ye

The past year has witnessed a rapid development of masked image modeling (MIM).

Paper
Code

Deep Point Cloud Simplification for High-quality Surface Reconstruction

no code implementations • 17 Mar 2022 • Yuanqi Li, Jianwei Guo, Xinran Yang, Shun Liu, Jie Guo, Xiaopeng Zhang, Yanwen Guo

In this paper, we propose a novel point cloud simplification network (PCS-Net) dedicated to high-quality surface mesh reconstruction while maintaining geometric fidelity.

Scene Understanding Surface Reconstruction +1

Paper
Add Code

MTLDesc: Looking Wider to Describe Better

1 code implementation • 14 Mar 2022 • Changwei Wang, Rongtao Xu, Yuyang Zhang, Shibiao Xu, Weiliang Meng, Bin Fan, Xiaopeng Zhang

Limited by the locality of convolutional neural networks, most existing local features description methods only learn local descriptors with local information and lack awareness of global and surrounding spatial context.

Indoor Localization

Paper
Code

TAPE: Task-Agnostic Prior Embedding for Image Restoration

no code implementations • 11 Mar 2022 • Lin Liu, Lingxi Xie, Xiaopeng Zhang, Shanxin Yuan, Xiangyu Chen, Wengang Zhou, Houqiang Li, Qi Tian

In this paper, we propose a novel approach that embeds a task-agnostic prior into a transformer.

Image Restoration

Paper
Add Code

The KFIoU Loss for Rotated Object Detection

3 code implementations • 29 Jan 2022 • Xue Yang, Yue Zhou, Gefan Zhang, Jirui Yang, Wentao Wang, Junchi Yan, Xiaopeng Zhang, Qi Tian

This is in contrast to recent Gaussian modeling based rotation detectors e. g. GWD loss and KLD loss that involve a human-specified distribution distance metric which require additional hyperparameter tuning that vary across datasets and detectors.

Object object-detection +1

1,719

Paper
Code

One-Bit Active Query With Contrastive Pairs

no code implementations • CVPR 2022 • Yuhang Zhang, Xiaopeng Zhang, Lingxi Xie, Jie Li, Robert C. Qiu, Hengtong Hu, Qi Tian

The Yes query is treated as positive pairs of the queried category for contrastive pulling, while the No query is treated as hard negative pairs for contrastive repelling.

Active Learning Contrastive Learning

Paper
Add Code

NeuSample: Neural Sample Field for Efficient View Synthesis

1 code implementation • 30 Nov 2021 • Jiemin Fang, Lingxi Xie, Xinggang Wang, Xiaopeng Zhang, Wenyu Liu, Qi Tian

Neural radiance fields (NeRF) have shown great potentials in representing 3D scenes and synthesizing novel views, but the computational overhead of NeRF at the inference stage is still heavy.

Paper
Code

Semantic-Aware Generation for Self-Supervised Visual Representation Learning

1 code implementation • 25 Nov 2021 • Yunjie Tian, Lingxi Xie, Xiaopeng Zhang, Jiemin Fang, Haohang Xu, Wei Huang, Jianbin Jiao, Qi Tian, Qixiang Ye

In this paper, we propose a self-supervised visual representation learning approach which involves both generative and discriminative proxies, where we focus on the former part by requiring the target network to recover the original image based on the mid-level features.

Ranked #63 on Semantic Segmentation on Cityscapes test

Representation Learning Semantic Segmentation

Paper
Code

Self-supervised Re-renderable Facial Albedo Reconstruction from Single Image

1 code implementation • 16 Nov 2021 • Mingxin Yang, Jianwei Guo, Zhanglin Cheng, Xiaopeng Zhang, Dong-Ming Yan

To further make facial textures disentangled with illumination, we propose a novel detailed illumination representation which is reconstructed with the detailed albedo together.

3D Face Reconstruction Attribute +2

Paper
Code

Understanding Self-supervised Learning via Information Bottleneck Principle

no code implementations • 29 Sep 2021 • Jin Li, Yaoming Wang, Dongsheng Jiang, Xiaopeng Zhang, Wenrui Dai, Hongkai Xiong

To address this issue, we introduce the information bottleneck principle and propose the Self-supervised Variational Information Bottleneck (SVIB) learning framework.

Contrastive Learning Self-Supervised Learning

Paper
Add Code

Single-Image Specular Highlight Removal via Real-World Dataset Construction

1 code implementation • TMM 2021 • Zhongqi Wu, Chuanqing Zhuang, Jian Shi, Jianwei Guo, Jun Xiao, Xiaopeng Zhang, Dong-Ming Yan

Specular reflections pose great challenges on various multimedia and computer vision tasks, e. g. , image segmentation, detection and matching.

Generative Adversarial Network Highlight Detection +3

Paper
Code

Bag of Instances Aggregation Boosts Self-supervised Distillation

1 code implementation • ICLR 2022 • Haohang Xu, Jiemin Fang, Xiaopeng Zhang, Lingxi Xie, Xinggang Wang, Wenrui Dai, Hongkai Xiong, Qi Tian

Here bag of instances indicates a set of similar samples constructed by the teacher and are grouped within a bag, and the goal of distillation is to aggregate compact representations over the student with respect to instances in a bag.

Contrastive Learning Self-Supervised Learning

Paper
Code

Multi-dataset Pretraining: A Unified Model for Semantic Segmentation

no code implementations • 8 Jun 2021 • Bowen Shi, Xiaopeng Zhang, Haohang Xu, Wenrui Dai, Junni Zou, Hongkai Xiong, Qi Tian

This is achieved by first pretraining the network via the proposed pixel-to-prototype contrastive loss over multiple datasets regardless of their taxonomy labels, and followed by fine-tuning the pretrained model over specific dataset as usual.

Semantic Segmentation

Paper
Add Code

Mixture of Virtual-Kernel Experts for Multi-Objective User Profile Modeling

1 code implementation • 4 Jun 2021 • Zhenhui Xu, Meng Zhao, Liqun Liu, Lei Xiao, Xiaopeng Zhang, Bifeng Zhang

This paper introduces a novel multi-task model called Mixture of Virtual-Kernel Experts (MVKE) to learn user preferences on various actions and topics unitedly.

Recommendation Systems TAG

Paper
Code

MSG-Transformer: Exchanging Local Spatial Information by Manipulating Messenger Tokens

3 code implementations • CVPR 2022 • Jiemin Fang, Lingxi Xie, Xinggang Wang, Xiaopeng Zhang, Wenyu Liu, Qi Tian

Transformers have offered a new methodology of designing neural networks for visual recognition.

Image Classification object-detection +1

Paper
Code

What Is Considered Complete for Visual Recognition?

no code implementations • 28 May 2021 • Lingxi Xie, Xiaopeng Zhang, Longhui Wei, Jianlong Chang, Qi Tian

This is an opinion paper.

Paper
Add Code

Semi-supervised Contrastive Learning with Similarity Co-calibration

no code implementations • 16 May 2021 • Yuhang Zhang, Xiaopeng Zhang, Robert. C. Qiu, Jie Li, Haohang Xu, Qi Tian

Semi-supervised learning acts as an effective way to leverage massive unlabeled data.

Contrastive Learning Few-Shot Learning +1

Paper
Add Code

Swin-Unet: Unet-like Pure Transformer for Medical Image Segmentation

5 code implementations • 12 May 2021 • Hu Cao, Yueyue Wang, Joy Chen, Dongsheng Jiang, Xiaopeng Zhang, Qi Tian, Manning Wang

In the past few years, convolutional neural networks (CNNs) have achieved milestones in medical image analysis.

Ranked #3 on Medical Image Segmentation on ACDC

Cardiac Segmentation Image Segmentation +1

1,490

Paper
Code

Deep Deformation Detail Synthesis for Thin Shell Models

no code implementations • 23 Feb 2021 • Lan Chen, Lin Gao, Jie Yang, Shibiao Xu, Juntao Ye, Xiaopeng Zhang, Yu-Kun Lai

Moreover, as such methods only add details, they require coarse meshes to be close to fine meshes, which can be either impossible, or require unrealistic constraints when generating fine meshes.

Paper
Add Code

Rethinking Rotated Object Detection with Gaussian Wasserstein Distance Loss

2 code implementations • 28 Jan 2021 • Xue Yang, Junchi Yan, Qi Ming, Wentao Wang, Xiaopeng Zhang, Qi Tian

Boundary discontinuity and its inconsistency to the final detection metric have been the bottleneck for rotating detection regression loss design.

Ranked #16 on Object Detection In Aerial Images on DOTA (using extra training data)

object-detection Object Detection In Aerial Images +2

1,719

Paper
Code

Seed the Views: Hierarchical Semantic Alignment for Contrastive Representation Learning

no code implementations • 4 Dec 2020 • Haohang Xu, Xiaopeng Zhang, Hao Li, Lingxi Xie, Hongkai Xiong, Qi Tian

In this paper, we propose a hierarchical semantic alignment strategy via expanding the views generated by a single image to \textbf{Cross-samples and Multi-level} representation, and models the invariance to semantically similar images in a hierarchical way.

Contrastive Learning Representation Learning +2

Paper
Add Code

Scene text removal via cascaded text stroke detection and erasing

1 code implementation • 19 Nov 2020 • Xuewei Bian, Chaoqun Wang, Weize Quan, Juntao Ye, Xiaopeng Zhang, Dong-Ming Yan

Specifically, we decouple the text removal problem into text stroke detection and stroke removal.

Paper
Code

Heterogeneous Contrastive Learning: Encoding Spatial Information for Compact Visual Representations

no code implementations • 19 Nov 2020 • Xinyue Huo, Lingxi Xie, Longhui Wei, Xiaopeng Zhang, Hao Li, Zijie Yang, Wengang Zhou, Houqiang Li, Qi Tian

Contrastive learning has achieved great success in self-supervised visual representation learning, but existing approaches mostly ignored spatial information which is often crucial for visual representation.

Contrastive Learning Data Augmentation +1

Paper
Add Code

Can Semantic Labels Assist Self-Supervised Visual Representation Learning?

no code implementations • 17 Nov 2020 • Longhui Wei, Lingxi Xie, Jianzhong He, Jianlong Chang, Xiaopeng Zhang, Wengang Zhou, Houqiang Li, Qi Tian

Recently, contrastive learning has largely advanced the progress of unsupervised visual representation learning.

Contrastive Learning Representation Learning +1

Paper
Add Code

Center-wise Local Image Mixture For Contrastive Representation Learning

no code implementations • 5 Nov 2020 • Hao Li, Xiaopeng Zhang, Hongkai Xiong

Contrastive learning based on instance discrimination trains model to discriminate different transformations of the anchor sample from other samples, which does not consider the semantic similarity among samples.

Contrastive Learning Data Augmentation +3

Paper
Add Code

Weight-Sharing Neural Architecture Search: A Battle to Shrink the Optimization Gap

no code implementations • 4 Aug 2020 • Lingxi Xie, Xin Chen, Kaifeng Bi, Longhui Wei, Yuhui Xu, Zhengsu Chen, Lanfei Wang, An Xiao, Jianlong Chang, Xiaopeng Zhang, Qi Tian

Neural architecture search (NAS) has attracted increasing attentions in both academia and industry.

Neural Architecture Search

Paper
Add Code

Accurate Lung Nodules Segmentation with Detailed Representation Transfer and Soft Mask Supervision

no code implementations • 29 Jul 2020 • Changwei Wang, Rongtao Xu, Shibiao Xu, Weiliang Meng, Jun Xiao, Xiaopeng Zhang

Then, a novel Network with detailed representation transfer and Soft Mask supervision (DSNet) is proposed to process the input low-resolution images of lung nodules into high-quality segmentation results.

Computed Tomography (CT) Lesion Segmentation +3

Paper
Add Code

Searching towards Class-Aware Generators for Conditional Generative Adversarial Networks

1 code implementation • 25 Jun 2020 • Peng Zhou, Lingxi Xie, Xiaopeng Zhang, Bingbing Ni, Qi Tian

To learn the sampling policy, a Markov decision process is embedded into the search algorithm and a moving average is applied for better stability.

Image Generation

Paper
Code

Distilling Object Detectors with Task Adaptive Regularization

no code implementations • 23 Jun 2020 • Ruoyu Sun, Fuhui Tang, Xiaopeng Zhang, Hongkai Xiong, Qi Tian

Knowledge distillation, which aims at training a smaller student network by transferring knowledge from a larger teacher model, is one of the promising solutions for model miniaturization.

Knowledge Distillation Object +1

Paper
Add Code

A survey on deep hashing for image retrieval

no code implementations • 10 Jun 2020 • Xiaopeng Zhang

To this end, I propose a concept: shadow of the CNN output.

Deep Hashing Image Retrieval

Paper
Add Code

Effective and Robust Detection of Adversarial Examples via Benford-Fourier Coefficients

no code implementations • 12 May 2020 • Chengcheng Ma, Baoyuan Wu, Shibiao Xu, Yanbo Fan, Yong Zhang, Xiaopeng Zhang, Zhifeng Li

In this work, we study the detection of adversarial examples, based on the assumption that the output and internal responses of one DNN model for both adversarial and benign examples follow the generalized Gaussian distribution (GGD), but with different parameters (i. e., shape factor, mean, and variance).

Image Classification

Paper
Add Code

Attribute Mix: Semantic Data Augmentation for Fine Grained Recognition

1 code implementation • 6 Apr 2020 • Hao Li, Xiaopeng Zhang, Hongkai Xiong, Qi Tian

In this paper, we propose Attribute Mix, a data augmentation strategy at attribute level to expand the fine-grained samples.

Ranked #22 on Fine-Grained Image Classification on CUB-200-2011

Attribute Data Augmentation +1

567

Paper
Code

Circumventing Outliers of AutoAugment with Knowledge Distillation

1 code implementation • ECCV 2020 • Longhui Wei, An Xiao, Lingxi Xie, Xin Chen, Xiaopeng Zhang, Qi Tian

AutoAugment has been a powerful algorithm that improves the accuracy of many vision tasks, yet it is sensitive to the operator space as well as hyper-parameters, and an improper setting may degenerate network optimization.

Ranked #185 on Image Classification on ImageNet

Data Augmentation General Classification +2

Paper
Code

MGCN: Descriptor Learning using Multiscale GCNs

no code implementations • 28 Jan 2020 • Yiqun Wang, Jing Ren, Dong-Ming Yan, Jianwei Guo, Xiaopeng Zhang, Peter Wonka

Second, we propose a new multiscale graph convolutional network (MGCN) to transform a non-learned feature to a more discriminative descriptor.

Paper
Add Code

Latency-Aware Differentiable Neural Architecture Search

1 code implementation • 17 Jan 2020 • Yuhui Xu, Lingxi Xie, Xiaopeng Zhang, Xin Chen, Bowen Shi, Qi Tian, Hongkai Xiong

However, these methods suffer the difficulty in optimizing network, so that the searched network is often unfriendly to hardware.

Neural Architecture Search

Paper
Code

Wasserstein-Bounded Generative Adversarial Networks

no code implementations • ICLR 2020 • Peng Zhou, Bingbing Ni, Lingxi Xie, Xiaopeng Zhang, Hang Wang, Cong Geng, Qi Tian

In the field of Generative Adversarial Networks (GANs), how to design a stable training strategy remains an open problem.

Paper
Add Code

Capacity Preserving Mapping for High-dimensional Data Visualization

1 code implementation • 29 Sep 2019 • Rongrong Wang, Xiaopeng Zhang

We provide a rigorous mathematical treatment to the crowding issue in data visualization when high dimensional data sets are projected down to low dimensions for visualization.

Data Visualization Dimensionality Reduction +1

Paper
Code

Central Similarity Quantization for Efficient Image and Video Retrieval

1 code implementation • CVPR 2020 • Li Yuan, Tao Wang, Xiaopeng Zhang, Francis EH Tay, Zequn Jie, Wei Liu, Jiashi Feng

In this work, we propose a new \emph{global} similarity metric, termed as \emph{central similarity}, with which the hash codes of similar data pairs are encouraged to approach a common center and those for dissimilar pairs to converge to different centers, to improve hash learning efficiency and retrieval accuracy.

Quantization Retrieval +1

227

Paper
Code

PC-DARTS: Partial Channel Connections for Memory-Efficient Architecture Search

8 code implementations • ICLR 2020 • Yuhui Xu, Lingxi Xie, Xiaopeng Zhang, Xin Chen, Guo-Jun Qi, Qi Tian, Hongkai Xiong

Differentiable architecture search (DARTS) provided a fast solution in finding effective network architectures, but suffered from large memory and computing overheads in jointly training a super-network and searching for an optimal architecture.

Ranked #20 on Neural Architecture Search on CIFAR-10

Neural Architecture Search

429

Paper
Code

Distilling Object Detectors with Fine-grained Feature Imitation

3 code implementations • CVPR 2019 • Tao Wang, Li Yuan, Xiaopeng Zhang, Jiashi Feng

To address the challenge of distilling knowledge in detection model, we propose a fine-grained feature imitation method exploiting the cross-location discrepancy of feature response.

Knowledge Distillation Object +2

412

Paper
Code

Low-Power Computer Vision: Status, Challenges, Opportunities

no code implementations • 15 Apr 2019 • Sergei Alyamkin, Matthew Ardi, Alexander C. Berg, Achille Brighton, Bo Chen, Yiran Chen, Hsin-Pai Cheng, Zichen Fan, Chen Feng, Bo Fu, Kent Gauen, Abhinav Goel, Alexander Goncharenko, Xuyang Guo, Soonhoi Ha, Andrew Howard, Xiao Hu, Yuanjun Huang, Donghyun Kang, Jaeyoun Kim, Jong Gook Ko, Alexander Kondratyev, Junhyeok Lee, Seungjae Lee, Suwoong Lee, Zichao Li, Zhiyu Liang, Juzheng Liu, Xin Liu, Yang Lu, Yung-Hsiang Lu, Deeptanshu Malik, Hong Hanh Nguyen, Eunbyung Park, Denis Repin, Liang Shen, Tao Sheng, Fei Sun, David Svitov, George K. Thiruvathukal, Baiwu Zhang, Jingchi Zhang, Xiaopeng Zhang, Shaojie Zhuo

In addition to mobile phones, many autonomous systems rely on visual data for making decisions and some of these systems have limited energy (such as unmanned aerial vehicles also called drones and mobile robots).

Paper
Add Code

Few-shot Adaptive Faster R-CNN

no code implementations • CVPR 2019 • Tao Wang, Xiaopeng Zhang, Li Yuan, Jiashi Feng

To address these challenges, we first introduce a pairing mechanism over source and target features to alleviate the issue of insufficient target domain samples.

object-detection Object Detection +1

Paper
Add Code

Low Power Inference for On-Device Visual Recognition with a Quantization-Friendly Solution

no code implementations • 12 Mar 2019 • Chen Feng, Tao Sheng, Zhiyu Liang, Shaojie Zhuo, Xiaopeng Zhang, Liang Shen, Matthew Ardi, Alexander C. Berg, Yiran Chen, Bo Chen, Kent Gauen, Yung-Hsiang Lu

The IEEE Low-Power Image Recognition Challenge (LPIRC) is an annual competition started in 2015 that encourages joint hardware and software solutions for computer vision systems with low latency and power.

Quantization

Paper
Add Code

Detecting Colorized Images via Convolutional Neural Networks: Toward High Accuracy and Good Generalization

no code implementations • 17 Feb 2019 • Weize Quan, Dong-Ming Yan, Kai Wang, Xiaopeng Zhang, Denis Pellerin

First, we design and implement a base network, which can attain better performance in terms of classification accuracy and generalization (in most cases) compared with state-of-the-art methods.

Colorization General Classification +1

Paper
Add Code

2018 Low-Power Image Recognition Challenge

no code implementations • 3 Oct 2018 • Sergei Alyamkin, Matthew Ardi, Achille Brighton, Alexander C. Berg, Yiran Chen, Hsin-Pai Cheng, Bo Chen, Zichen Fan, Chen Feng, Bo Fu, Kent Gauen, Jongkook Go, Alexander Goncharenko, Xuyang Guo, Hong Hanh Nguyen, Andrew Howard, Yuanjun Huang, Donghyun Kang, Jaeyoun Kim, Alexander Kondratyev, Seungjae Lee, Suwoong Lee, Junhyeok Lee, Zhiyu Liang, Xin Liu, Juzheng Liu, Zichao Li, Yang Lu, Yung-Hsiang Lu, Deeptanshu Malik, Eunbyung Park, Denis Repin, Tao Sheng, Liang Shen, Fei Sun, David Svitov, George K. Thiruvathukal, Baiwu Zhang, Jingchi Zhang, Xiaopeng Zhang, Shaojie Zhuo

The Low-Power Image Recognition Challenge (LPIRC, https://rebootingcomputing. ieee. org/lpirc) is an annual competition started in 2015.

Paper
Add Code

Learning 3D Keypoint Descriptors for Non-Rigid Shape Matching

no code implementations • ECCV 2018 • Hanyu Wang, Jianwei Guo, Dong-Ming Yan, Weize Quan, Xiaopeng Zhang

In this paper, we present a novel deep learning framework that derives discriminative local descriptors for 3D surface shapes.

Metric Learning

Paper
Add Code

ML-LocNet: Improving Object Localization with Multi-view Learning Network

no code implementations • ECCV 2018 • Xiaopeng Zhang, Yang Yang, Jiashi Feng

This paper addresses Weakly Supervised Object Localization (WSOL) with only image-level supervision.

MULTI-VIEW LEARNING Weakly-Supervised Object Localization

Paper
Add Code

Zigzag Learning for Weakly Supervised Object Detection

no code implementations • CVPR 2018 • Xiaopeng Zhang, Jiashi Feng, Hongkai Xiong, Qi Tian

Unlike them, we propose a zigzag learning strategy to simultaneously discover reliable object instances and prevent the model from overfitting initial seeds.

Ranked #16 on Weakly Supervised Object Detection on PASCAL VOC 2012 test

Object object-detection +1

Paper
Add Code

A Quantization-Friendly Separable Convolution for MobileNets

1 code implementation • 22 Mar 2018 • Tao Sheng, Chen Feng, Shaojie Zhuo, Xiaopeng Zhang, Liang Shen, Mickey Aleksic

As deep learning (DL) is being rapidly pushed to edge computing, researchers invented various ways to make inference computation more efficient on mobile/IoT devices, such as network pruning, parameter compression, and etc.

Edge-computing Image Classification +2

548

Paper
Code

Speeding Up the Bilateral Filter: A Joint Acceleration Way

no code implementations • 28 Feb 2018 • Longquan Dai, Mengke Yuan, Xiaopeng Zhang

To achieve the constant-time BF whose complexity is irrelevant to the kernel size, many techniques have been proposed, such as 2D box filtering, dimension promotion, and shiftability property.

Paper
Add Code

Hardware-Efficient Guided Image Filtering For Multi-Label Problem

no code implementations • CVPR 2017 • Longquan Dai, Mengke Yuan, Zechao Li, Xiaopeng Zhang, Jinhui Tang

In this paper we propose a hardware-efficient Guided Filter (HGF), which solves the efficiency problem of multichannel guided image filtering and yields competent results when applying it to multi-label problems with synthesized polynomial multichannel guidance.

Paper
Add Code

Ensemble of Part Detectors for Simultaneous Classification and Localization

no code implementations • 29 May 2017 • Xiaopeng Zhang, Hongkai Xiong, Weiyao Lin, Qi Tian

Part-based representation has been proven to be effective for a variety of visual applications.

Classification Clustering +4

Paper
Add Code

Picking Deep Filter Responses for Fine-Grained Image Recognition

no code implementations • CVPR 2016 • Xiaopeng Zhang, Hongkai Xiong, Wengang Zhou, Weiyao Lin, Qi Tian

Recognizing fine-grained sub-categories such as birds and dogs is extremely challenging due to the highly localized and subtle differences in some specific parts.

Fine-Grained Image Recognition

Paper
Add Code

Fully Connected Guided Image Filtering

no code implementations • ICCV 2015 • Longquan Dai, Mengke Yuan, Feihu Zhang, Xiaopeng Zhang

This paper presents a linear time fully connected guided filter by introducing the minimum spanning tree (MST) to the guided filter (GF).

Paper
Add Code

Segment Graph Based Image Filtering: Fast Structure-Preserving Smoothing

no code implementations • ICCV 2015 • Feihu Zhang, Longquan Dai, Shiming Xiang, Xiaopeng Zhang

In our SGF, we use the tree distance on the segment graph to define the internal weight function of the filtering kernel, which enables the filter to smooth out high-contrast details and textures while preserving major image structures very well.

Optical Flow Estimation Stereo Matching +1

Paper
Add Code

Image Retargeting by Content-Aware Synthesis

no code implementations • 26 Mar 2014 • Weiming Dong, Fuzhang Wu, Yan Kong, Xing Mei, Tong-Yee Lee, Xiaopeng Zhang

We propose to retarget the textural regions by content-aware synthesis and non-textural regions by fast multi-operators.

Image Retargeting

Paper
Add Code

Segment-Tree Based Cost Aggregation for Stereo Matching

no code implementations • CVPR 2013 • Xing Mei, Xun Sun, Wei-Ming Dong, Haitao Wang, Xiaopeng Zhang

Instead of employing the minimum spanning tree (MST) and its variants, a new tree structure, "Segment-Tree", is proposed for non-local matching cost aggregation.

Scene Segmentation Stereo Matching +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.