Search Results for author: Xin Pan

Found 11 papers, 4 papers with code

YouTube-BoundingBoxes: A Large High-Precision Human-Annotated Data Set for Object Detection in Video

no code implementations • CVPR 2017 • Esteban Real, Jonathon Shlens, Stefano Mazzocchi, Xin Pan, Vincent Vanhoucke

We introduce a new large-scale data set of video URLs with densely-sampled object bounding box annotations called YouTube-BoundingBoxes (YT-BB).

General Classification object-detection +1

Paper
Add Code

Parallel Pre-trained Transformers (PPT) for Synthetic Data-based Instance Segmentation

no code implementations • 22 Jun 2022 • Ming Li, Jie Wu, Jinhang Cai, Jie Qin, Yuxi Ren, Xuefeng Xiao, Min Zheng, Rui Wang, Xin Pan

Recently, Synthetic data-based Instance Segmentation has become an exceedingly favorable optimization paradigm since it leverages simulation rendering and physics to generate high-quality image-annotation pairs.

Instance Segmentation Segmentation +1

Paper
Add Code

Next-ViT: Next Generation Vision Transformer for Efficient Deployment in Realistic Industrial Scenarios

4 code implementations • 12 Jul 2022 • Jiashi Li, Xin Xia, Wei Li, Huixia Li, Xing Wang, Xuefeng Xiao, Rui Wang, Min Zheng, Xin Pan

Then, Next Hybrid Strategy (NHS) is designed to stack NCB and NTB in an efficient hybrid paradigm, which boosts performance in various downstream tasks.

Ranked #281 on Image Classification on ImageNet

Image Classification

29,846

Paper
Code

Solving Oscillation Problem in Post-Training Quantization Through a Theoretical Perspective

1 code implementation • CVPR 2023 • Yuexiao Ma, Huixia Li, Xiawu Zheng, Xuefeng Xiao, Rui Wang, Shilei Wen, Xin Pan, Fei Chao, Rongrong Ji

In particular, we first formulate the oscillation in PTQ and prove the problem is caused by the difference in module capacity.

Quantization

Paper
Code

FreeSeg: Unified, Universal and Open-Vocabulary Image Segmentation

no code implementations • CVPR 2023 • Jie Qin, Jie Wu, Pengxiang Yan, Ming Li, Ren Yuxi, Xuefeng Xiao, Yitong Wang, Rui Wang, Shilei Wen, Xin Pan, Xingang Wang

Recently, open-vocabulary learning has emerged to accomplish segmentation for arbitrary categories of text-based descriptions, which popularizes the segmentation system to more general-purpose application scenarios.

Ranked #6 on Open Vocabulary Panoptic Segmentation on ADE20K

Image Segmentation Instance Segmentation +3

Paper
Add Code

Channel-Spatial-Based Few-Shot Bird Sound Event Detection

no code implementations • 18 Jun 2023 • Lingwen Liu, Yuxuan Feng, Haitao Fu, Yajie Yang, Xin Pan, Chenlei Jin

In this paper, we propose a model for bird sound event detection that focuses on a small number of training samples within the everyday long-tail distribution.

Event Detection Few-Shot Learning +2

Paper
Add Code

AlignDet: Aligning Pre-training and Fine-tuning in Object Detection

1 code implementation • ICCV 2023 • Ming Li, Jie Wu, Xionghui Wang, Chen Chen, Jie Qin, Xuefeng Xiao, Rui Wang, Min Zheng, Xin Pan

To this end, we propose AlignDet, a unified pre-training framework that can be adapted to various existing detectors to alleviate the discrepancies.

object-detection Object Detection

129

Paper
Code

UGC: Unified GAN Compression for Efficient Image-to-Image Translation

no code implementations • ICCV 2023 • Yuxi Ren, Jie Wu, Peng Zhang, Manlin Zhang, Xuefeng Xiao, Qian He, Rui Wang, Min Zheng, Xin Pan

Recent years have witnessed the prevailing progress of Generative Adversarial Networks (GANs) in image-to-image translation.

Image-to-Image Translation Translation

Paper
Add Code

AutoDiffusion: Training-Free Optimization of Time Steps and Architectures for Automated Diffusion Model Acceleration

1 code implementation • ICCV 2023 • Lijiang Li, Huixia Li, Xiawu Zheng, Jie Wu, Xuefeng Xiao, Rui Wang, Min Zheng, Xin Pan, Fei Chao, Rongrong Ji

Therefore, we propose to search the optimal time steps sequence and compressed model architecture in a unified framework to achieve effective image generation for diffusion models without any further training.

Image Generation single-image-generation

Paper
Code

SparseByteNN: A Novel Mobile Inference Acceleration Framework Based on Fine-Grained Group Sparsity

no code implementations • 30 Oct 2023 • Haitao Xu, Songwei Liu, Yuyang Xu, Shuai Wang, Jiashi Li, Chenqian Yan, Liangqiang Li, Lean Fu, Xin Pan, Fangmin Chen

Our framework consists of two parts: (a) A fine-grained kernel sparsity schema with a sparsity granularity between structured pruning and unstructured pruning.

Network Pruning

Paper
Add Code

The stability and instability of the language control network: a longitudinal resting-state functional magnetic resonance imaging study

no code implementations • 23 Jan 2024 • Zilong Li, Cong Liu, Xin Pan, Guosheng Ding, Ruiming Wang

These findings provide preliminary evidence of the coexistence of stability and instability in the language control network.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.