Search Results for author: Qiang Chen

Found 58 papers, 30 papers with code

Towards Accurate Post-training Network Quantization via Bit-Split and Stitching

1 code implementation ICML 2020 Peisong Wang, Qiang Chen, Xiangyu He, Jian Cheng

Network quantization is essential for deploying deep models to IoT devices due to the high efficiency, no matter on special hardware like TPU or general hardware like CPU and GPU.

Image Classification Instance Segmentation +4

FullAnno: A Data Engine for Enhancing Image Comprehension of MLLMs

no code implementations20 Sep 2024 Jing Hao, Yuxiang Zhao, Song Chen, Yanpeng Sun, Qiang Chen, Gang Zhang, Kun Yao, Errui Ding, Jingdong Wang

To this end, we devised the FullAnno system, which is a data engine that can generate large-scale, high-quality, and fine-grained image annotations consisting of the category and position of objects, region descriptions, text information, as well as image dense captions.

Image Captioning Image Comprehension

Automated Quantification of Hyperreflective Foci in SD-OCT With Diabetic Retinopathy

no code implementations31 Jul 2024 Idowu Paul Okuwobi, Zexuan Ji, Wen Fan, Songtao Yuan, Loza Bekalo, Qiang Chen

However, lack of efficient quantitative tools for evaluating the HFs has deprived ophthalmologist of assessing the volume of HFs.

Memory-efficient High-resolution OCT Volume Synthesis with Cascaded Amortized Latent Diffusion Models

1 code implementation26 May 2024 Kun Huang, Xiao Ma, Yuhan Zhang, Na Su, Songtao Yuan, Yong liu, Qiang Chen, Huazhu Fu

In tandem with autoencoders, we propose cascaded diffusion processes to synthesize high-resolution OCT volumes with a global-to-local refinement process, amortizing the memory and computational demands.

Adaptive Fuzzy C-Means with Graph Embedding

no code implementations22 May 2024 Qiang Chen, Weizhong Yu, Feiping Nie, Xuelong Li

Fuzzy clustering algorithms can be roughly categorized into two main groups: Fuzzy C-Means (FCM) based methods and mixture model based methods.

Clustering Graph Embedding

VRP-SAM: SAM with Visual Reference Prompt

1 code implementation CVPR 2024 Yanpeng Sun, Jiahui Chen, Shan Zhang, Xinyu Zhang, Qiang Chen, Gang Zhang, Errui Ding, Jingdong Wang, Zechao Li

In this paper, we propose a novel Visual Reference Prompt (VRP) encoder that empowers the Segment Anything Model (SAM) to utilize annotated reference images as prompts for segmentation, creating the VRP-SAM model.

Meta-Learning Segmentation

MS-DETR: Efficient DETR Training with Mixed Supervision

1 code implementation CVPR 2024 Chuyang Zhao, Yifan Sun, Wenhao Wang, Qiang Chen, Errui Ding, Yi Yang, Jingdong Wang

The traditional training procedure using one-to-one supervision in the original DETR lacks direct supervision for the object detection candidates.

Decoder Object +2

Adjustable Robust Transformer for High Myopia Screening in Optical Coherence Tomography

1 code implementation12 Dec 2023 Xiao Ma, Zetian Zhang, Zexuan Ji, Kun Huang, Na Su, Songtao Yuan, Qiang Chen

Measurements of spherical equivalent and axial length are the gold standards for identifying high myopia, but the available image data for matching them is scarce.

Unified Frequency-Assisted Transformer Framework for Detecting and Grounding Multi-Modal Manipulation

no code implementations18 Sep 2023 Huan Liu, Zichang Tan, Qiang Chen, Yunchao Wei, Yao Zhao, Jingdong Wang

Moreover, to address the semantic conflicts between image and frequency domains, the forgery-aware mutual module is developed to further enable the effective interaction of disparate image and frequency features, resulting in aligned and comprehensive visual forgery representations.

Decoder Misinformation

Group Pose: A Simple Baseline for End-to-End Multi-person Pose Estimation

2 code implementations ICCV 2023 Huan Liu, Qiang Chen, Zichang Tan, Jiang-Jiang Liu, Jian Wang, Xiangbo Su, Xiaolong Li, Kun Yao, Junyu Han, Errui Ding, Yao Zhao, Jingdong Wang

State-of-the-art solutions adopt the DETR-like framework, and mainly develop the complex decoder, e. g., regarding pose estimation as keypoint box detection and combining with human detection in ED-Pose, hierarchically predicting with pose decoder and joint (keypoint) decoder in PETR.

Decoder Human Detection +1

Learn Single-horizon Disease Evolution for Predictive Generation of Post-therapeutic Neovascular Age-related Macular Degeneration

no code implementations12 Aug 2023 Yuhan Zhang, Kun Huang, Mingchao Li, Songtao Yuan, Qiang Chen

We propose a single-horizon disease evolution network (SHENet) to predictively generate post-therapeutic SD-OCT images by inputting pre-therapeutic SD-OCT images with neovascular age-related macular degeneration (nAMD).

Disease Prediction

Enhancing Your Trained DETRs with Box Refinement

1 code implementation21 Jul 2023 Yiqun Chen, Qiang Chen, Peize Sun, Shoufa Chen, Jingdong Wang, Jian Cheng

We hope our work will bring the attention of the detection community to the localization bottleneck of current DETR-like models and highlight the potential of the RefineBox framework.

Exploring Effective Factors for Improving Visual In-Context Learning

1 code implementation10 Apr 2023 Yanpeng Sun, Qiang Chen, Jian Wang, Jingdong Wang, Zechao Li

By doing this, the model can leverage the diverse knowledge stored in different parts of the model to improve its performance on new tasks.

In-Context Learning Meta-Learning +1

s-Adaptive Decoupled Prototype for Few-Shot Object Detection

no code implementations ICCV 2023 Jinhao Du, Shan Zhang, Qiang Chen, Haifeng Le, Yanpeng Sun, Yao Ni, Jian Wang, Bin He, Jingdong Wang

To provide precise information for the query image, the prototype is decoupled into task-specific ones, which provide tailored guidance for 'where to look' and 'what to look for', respectively.

Few-Shot Object Detection Meta-Learning +3

DATE: Dual Assignment for End-to-End Fully Convolutional Object Detection

1 code implementation25 Nov 2022 Yiqun Chen, Qiang Chen, Qinghao Hu, Jian Cheng

In this paper, we revisit these two assignment methods and find that bringing one-to-many assignment back to end-to-end fully convolutional detectors helps with model convergence.

object-detection Object Detection

Group DETR v2: Strong Object Detector with Encoder-Decoder Pretraining

no code implementations arXiv 2022 Qiang Chen, Jian Wang, Chuchu Han, Shan Zhang, Zexian Li, Xiaokang Chen, Jiahui Chen, Xiaodi Wang, Shuming Han, Gang Zhang, Haocheng Feng, Kun Yao, Junyu Han, Errui Ding, Jingdong Wang

The training process consists of self-supervised pretraining and finetuning a ViT-Huge encoder on ImageNet-1K, pretraining the detector on Object365, and finally finetuning it on COCO.

Ranked #8 on Object Detection on COCO test-dev (using extra training data)

Decoder Object +2

SelfMix: Robust Learning Against Textual Label Noise with Self-Mixup Training

1 code implementation COLING 2022 Dan Qiao, Chenchen Dai, Yuyang Ding, Juntao Li, Qiang Chen, Wenliang Chen, Min Zhang

The conventional success of textual classification relies on annotated data, and the new paradigm of pre-trained language models (PLMs) still requires a few labeled data for downstream tasks.

text-classification Text Classification

Group DETR: Fast DETR Training with Group-Wise One-to-Many Assignment

2 code implementations ICCV 2023 Qiang Chen, Xiaokang Chen, Jian Wang, Shan Zhang, Kun Yao, Haocheng Feng, Junyu Han, Errui Ding, Gang Zeng, Jingdong Wang

Detection transformer (DETR) relies on one-to-one assignment, assigning one ground-truth object to one prediction, for end-to-end detection without NMS post-processing.

Data Augmentation Decoder +3

Commonsense Knowledge Salience Evaluation with a Benchmark Dataset in E-commerce

1 code implementation22 May 2022 Yincen Qu, Ningyu Zhang, Hui Chen, Zelin Dai, Zezhong Xu, Chengming Wang, Xiaoyu Wang, Qiang Chen, Huajun Chen

In addition to formulating the new task, we also release a new Benchmark dataset of Salience Evaluation in E-commerce (BSEE) and hope to promote related research on commonsense knowledge salience evaluation.

Label Adversarial Learning for Skeleton-level to Pixel-level Adjustable Vessel Segmentation

no code implementations7 May 2022 Mingchao Li, Kun Huang, Zetian Zhang, Xiao Ma, Qiang Chen

This continuous process allows us to recommend high-quality vessel segmentation with clear caliber and topology.

Segmentation

Improving Transferability for Domain Adaptive Detection Transformers

1 code implementation29 Apr 2022 Kaixiong Gong, Shuang Li, Shugang Li, Rui Zhang, Chi Harold Liu, Qiang Chen

We implement the findings and the alignment modules into our adaptation method, and it benchmarks the DETR-style detector on the domain shift settings.

Decoder Object Detection +1

Knowledge Graph Embedding in E-commerce Applications: Attentive Reasoning, Explanations, and Transferable Rules

no code implementations16 Dec 2021 Wen Zhang, Shumin Deng, Mingyang Chen, Liang Wang, Qiang Chen, Feiyu Xiong, Xiangwen Liu, Huajun Chen

We first identity three important desiderata for e-commerce KG systems: 1) attentive reasoning, reasoning over a few target relations of more concerns instead of all; 2) explanation, providing explanations for a prediction to help both users and business operators understand why the prediction is made; 3) transferable rules, generating reusable rules to accelerate the deployment of a KG to new systems.

Entity Embeddings Graph Attention +4

Image Magnification Network for Vessel Segmentation in OCTA Images

no code implementations26 Oct 2021 Mingchao Li, Yerui Chen, Weiwei Zhang, Qiang Chen

Optical coherence tomography angiography (OCTA) is a novel non-invasive imaging modality that allows micron-level resolution to visualize the retinal microvasculature.

Decoder Retinal Vessel Segmentation +1

DPNAS: Neural Architecture Search for Deep Learning with Differential Privacy

1 code implementation16 Oct 2021 Anda Cheng, Jiaxing Wang, Xi Sheryl Zhang, Qiang Chen, Peisong Wang, Jian Cheng

In light of this missing, we propose the very first framework that employs neural architecture search to automatic model design for private deep learning, dubbed as DPNAS.

Neural Architecture Search

Improving Binary Neural Networks through Fully Utilizing Latent Weights

no code implementations12 Oct 2021 Weixiang Xu, Qiang Chen, Xiangyu He, Peisong Wang, Jian Cheng

Binary Neural Networks (BNNs) rely on a real-valued auxiliary variable W to help binary training.

HIH: Towards More Accurate Face Alignment via Heatmap in Heatmap

1 code implementation7 Apr 2021 Xing Lan, Qinghao Hu, Qiang Chen, Jian Xue, Jian Cheng

In particular, our HIH reaches 4. 08 NME (Normalized Mean Error) on WFLW, and 3. 21 on COFW, which exceeds previous methods by a significant margin.

Face Alignment regression

You Only Look One-level Feature

6 code implementations CVPR 2021 Qiang Chen, Yingming Wang, Tong Yang, Xiangyu Zhang, Jian Cheng, Jian Sun

From the perspective of optimization, we introduce an alternative way to address the problem instead of adopting the complex feature pyramids - {\em utilizing only one-level feature for detection}.

object-detection Object Detection

Scaling of Magnetic Dissipation and Particle Acceleration in ABC Fields

no code implementations18 Feb 2021 Qiang Chen, Krzysztof Nalewajko, Bhupendra Mishra

Using particle-in-cell (PIC) numerical simulations with electron-positron pair plasma, we study how the efficiencies of magnetic dissipation and particle acceleration scale with the initial coherence length $\lambda_0$ in relation to the system size $L$ of the two-dimensional (2D) `Arnold-Beltrami-Childress' (ABC) magnetic field configurations.

High Energy Astrophysical Phenomena Plasma Physics

OCTA-500: A Retinal Dataset for Optical Coherence Tomography Angiography Study

7 code implementations14 Dec 2020 Mingchao Li, Kun Huang, Qiuzhuo Xu, Jiadong Yang, Yuhan Zhang, Zexuan Ji, Keren Xie, Songtao Yuan, Qinghuai Liu, Qiang Chen

Optical coherence tomography angiography (OCTA) is a novel imaging modality that has been widely utilized in ophthalmology and neuroscience studies to observe retinal vessels and microvascular systems.

Image Segmentation Segmentation +1

Simulation of Skin Stretching around the Forehead Wrinkles in Rhytidectomy

no code implementations1 Jan 2020 Ping Zhou, Shuo Huang, Qiang Chen, Siyuan He, Guochao Cai

Finally, the stress distribution and the residual wrinkles of forehead skin were employed to evaluate the surgical effect.

Location-aware Upsampling for Semantic Segmentation

1 code implementation13 Nov 2019 Xiangyu He, Zitao Mo, Qiang Chen, Anda Cheng, Peisong Wang, Jian Cheng

Many successful learning targets such as minimizing dice loss and cross-entropy loss have enabled unprecedented breakthroughs in segmentation tasks.

Decoder Segmentation +1

SpatialFlow: Bridging All Tasks for Panoptic Segmentation

1 code implementation19 Oct 2019 Qiang Chen, Anda Cheng, Xiangyu He, Peisong Wang, Jian Cheng

Object location is fundamental to panoptic segmentation as it is related to all things and stuff in the image scene.

Instance Segmentation Object +3

Meta Relational Learning for Few-Shot Link Prediction in Knowledge Graphs

1 code implementation IJCNLP 2019 Mingyang Chen, Wen Zhang, Wei zhang, Qiang Chen, Huajun Chen

Link prediction is an important way to complete knowledge graphs (KGs), while embedding-based methods, effective for link prediction in KGs, perform poorly on relations that only have a few associative triples.

Knowledge Graphs Link Prediction +2

Compact Global Descriptor for Neural Networks

1 code implementation23 Jul 2019 Xiangyu He, Ke Cheng, Qiang Chen, Qinghao Hu, Peisong Wang, Jian Cheng

Long-range dependencies modeling, widely used in capturing spatiotemporal correlation, has shown to be effective in CNN dominated computer vision tasks.

Audio Classification Deep Attention +2

Feature Map Pooling for Cross-View Gait Recognition Based on Silhouette Sequence Images

no code implementations26 Nov 2017 Qiang Chen, Yunhong Wang, Zheng Liu, Qingjie Liu, Di Huang

In this paper, we develop a novel convolutional neural network based approach to extract and aggregate useful information from gait silhouette sequence images instead of simply representing the gait process by averaging silhouette images.

Gait Recognition

FoveaNet: Perspective-aware Urban Scene Parsing

no code implementations ICCV 2017 Xin Li, Zequn Jie, Wei Wang, Changsong Liu, Jimei Yang, Xiaohui Shen, Zhe Lin, Qiang Chen, Shuicheng Yan, Jiashi Feng

Thus, they suffer from heterogeneous object scales caused by perspective projection of cameras on actual scenes and inevitably encounter parsing failures on distant objects as well as other boundary and recognition errors.

Scene Parsing

A Fast Factorization-based Approach to Robust PCA

no code implementations27 Sep 2016 Chong Peng, Zhao Kang, Qiang Chen

Our method can be used as a light-weight, scalable tool for RPCA in the absence of the precise value of the true rank.

LogDet Rank Minimization with Application to Subspace Clustering

no code implementations3 Jul 2015 Zhao Kang, Chong Peng, Jie Cheng, Qiang Chen

Most of the recent studies use the nuclear norm as a convex surrogate of the rank operator.

Clustering Face Clustering +1

Towards Unified Human Parsing and Pose Estimation

no code implementations CVPR 2014 Jian Dong, Qiang Chen, Xiaohui Shen, Jianchao Yang, Shuicheng Yan

We study the problem of human body configuration analysis, more specifically, human parsing and human pose estimation.

Human Parsing Pose Estimation

Network In Network

17 code implementations16 Dec 2013 Min Lin, Qiang Chen, Shuicheng Yan

With enhanced local modeling via the micro network, we are able to utilize global average pooling over feature maps in the classification layer, which is easier to interpret and less prone to overfitting than traditional fully connected layers.

Face Identification General Classification +1

Subcategory-Aware Object Classification

no code implementations CVPR 2013 Jian Dong, Wei Xia, Qiang Chen, Jianshi Feng, Zhongyang Huang, Shuicheng Yan

In this paper, we introduce a subcategory-aware object classification framework to boost category level object classification performance.

Classification General Classification +1

Efficient Maximum Appearance Search for Large-Scale Object Detection

no code implementations CVPR 2013 Qiang Chen, Zheng Song, Rogerio Feris, Ankur Datta, Liangliang Cao, Zhongyang Huang, Shuicheng Yan

In recent years, efficiency of large-scale object detection has arisen as an important topic due to the exponential growth in the size of benchmark object detection datasets.

Object object-detection +1

Cannot find the paper you are looking for? You can Submit a new open access paper.