Search Results for author: Qilong Wang

Found 27 papers, 22 papers with code

AMU-Tuning: Effective Logit Bias for CLIP-based Few-shot Learning

1 code implementation • 13 Apr 2024 • Yuwei Tang, Zhenyi Lin, Qilong Wang, Pengfei Zhu, QinGhua Hu

To this end, we disassemble three key components involved in computation of logit bias (i. e., logit features, logit predictor, and logit fusion) and empirically analyze the effect on performance of few-shot classification.

Few-Shot Learning

Paper
Code

A Decoupled Spatio-Temporal Framework for Skeleton-based Action Segmentation

1 code implementation • 10 Dec 2023 • Yunheng Li, Zhongyu Li, ShangHua Gao, Qilong Wang, Qibin Hou, Ming-Ming Cheng

Effectively modeling discriminative spatio-temporal information is essential for segmenting activities in long action sequences.

Action Segmentation

Paper
Code

Tuning Pre-trained Model via Moment Probing

1 code implementation • ICCV 2023 • Mingze Gao, Qilong Wang, Zhenyi Lin, Pengfei Zhu, QinGhua Hu, Jingbo Zhou

Distinguished from LP which builds a linear classification head based on the mean of final features (e. g., word tokens for ViT) or classification tokens, our MP performs a linear classifier on feature distribution, which provides the stronger representation ability by exploiting richer statistical information inherent in features.

Image Classification

Paper
Code

DropCov: A Simple yet Effective Method for Improving Deep Architectures

1 code implementation • NeurIPS 2022 Conference 2023 • Qilong Wang, Mingze Gao, Zhaolin Zhang, Jiangtao Xie, Peihua Li, QinGhua Hu

Particularly, we for the first time show that \textit{effective post-normalization can make a good trade-off between representation decorrelation and information preservation for GCP, which are crucial to alleviate over-fitting and increase representation ability of deep GCP networks, respectively}.

Paper
Code

Reliable and Interpretable Personalized Federated Learning

no code implementations • CVPR 2023 • Zixuan Qin, Liu Yang, Qilong Wang, Yahong Han, QinGhua Hu

When there are large differences in data distribution among clients, it is crucial for federated learning to design a reliable client selection strategy and an interpretable client communication framework to better utilize group knowledge.

Personalized Federated Learning

Paper
Add Code

Joint Distribution Matters: Deep Brownian Distance Covariance for Few-Shot Classification

1 code implementation • CVPR 2022 • Jiangtao Xie, Fei Long, Jiaming Lv, Qilong Wang, Peihua Li

Few-shot classification is a challenging problem as only very few training examples are given for each new task.

Ranked #38 on Few-Shot Image Classification on Mini-Imagenet 5-way (1-shot)

Classification domain classification +3

159

Paper
Code

Temporal-attentive Covariance Pooling Networks for Video Recognition

1 code implementation • NeurIPS 2021 • Zilin Gao, Qilong Wang, Bingbing Zhang, QinGhua Hu, Peihua Li

Then, a temporal covariance pooling performs temporal pooling of the attentive covariance representations to characterize both intra-frame correlations and inter-frame cross-correlations of the calibrated features.

Video Recognition

Paper
Code

Boosting Weakly Supervised Object Detection via Learning Bounding Box Adjusters

1 code implementation • ICCV 2021 • Bowen Dong, Zitong Huang, Yuelin Guo, Qilong Wang, Zhenxing Niu, WangMeng Zuo

In this paper, we defend the problem setting for improving localization performance by leveraging the bounding box regression knowledge from a well-annotated auxiliary dataset.

Object object-detection +3

Paper
Code

Detection, Tracking, and Counting Meets Drones in Crowds: A Benchmark

1 code implementation • CVPR 2021 • Longyin Wen, Dawei Du, Pengfei Zhu, QinGhua Hu, Qilong Wang, Liefeng Bo, Siwei Lyu

To promote the developments of object detection, tracking and counting algorithms in drone-captured videos, we construct a benchmark with a new drone-captured largescale dataset, named as DroneCrowd, formed by 112 video clips with 33, 600 HD frames in various scenarios.

object-detection Object Detection +1

148

Paper
Code

So-ViT: Mind Visual Tokens for Vision Transformer

1 code implementation • 22 Apr 2021 • Jiangtao Xie, Ruiren Zeng, Qilong Wang, Ziqi Zhou, Peihua Li

Therefore, we propose a new classification paradigm, where the second-order, cross-covariance pooling of visual tokens is combined with class token for final classification.

Classification General Classification

Paper
Code

What Deep CNNs Benefit from Global Covariance Pooling: An Optimization Perspective

1 code implementation • CVPR 2020 • Qilong Wang, Li Zhang, Banggu Wu, Dongwei Ren, Peihua Li, WangMeng Zuo, QinGhua Hu

Recent works have demonstrated that global covariance pooling (GCP) has the ability to improve performance of deep convolutional neural networks (CNNs) on visual classification task.

Instance Segmentation object-detection +2

Paper
Code

Drone-based Joint Density Map Estimation, Localization and Tracking with Space-Time Multi-Scale Attention Network

1 code implementation • 4 Dec 2019 • Longyin Wen, Dawei Du, Pengfei Zhu, QinGhua Hu, Qilong Wang, Liefeng Bo, Siwei Lyu

This paper proposes a space-time multi-scale attention network (STANet) to solve density map estimation, localization and tracking in dense crowds of video clips captured by drones with arbitrary crowd density, perspective, and flight altitude.

Crowd Counting

148

Paper
Code

ECA-Net: Efficient Channel Attention for Deep Convolutional Neural Networks

12 code implementations • CVPR 2020 • Qilong Wang, Banggu Wu, Pengfei Zhu, Peihua Li, WangMeng Zuo, QinGhua Hu

By dissecting the channel attention module in SENet, we empirically show avoiding dimensionality reduction is important for learning channel attention, and appropriate cross-channel interaction can preserve performance while significantly decreasing model complexity.

Ranked #735 on Image Classification on ImageNet

Dimensionality Reduction Image Classification +4

29,774

Paper
Code

Neural Blind Deconvolution Using Deep Priors

1 code implementation • CVPR 2020 • Dongwei Ren, Kai Zhang, Qilong Wang, QinGhua Hu, WangMeng Zuo

To connect MAP and deep models, we in this paper present two generative networks for respectively modeling the deep priors of clean image and blur kernel, and propose an unconstrained neural optimization solution to blind deconvolution.

Deblurring Self-Supervised Learning

329

Paper
Code

Data Augmentation for Object Detection via Progressive and Selective Instance-Switching

1 code implementation • 2 Jun 2019 • Hao Wang, Qilong Wang, Fan Yang, Weiqi Zhang, WangMeng Zuo

For guiding our IS to obtain better object performance, we explore issues of instance imbalance and class importance in datasets, which frequently occur and bring adverse effect on detection performance.

Data Augmentation Instance Segmentation +2

Paper
Code

Deep Global Generalized Gaussian Networks

1 code implementation • CVPR 2019 • Qilong Wang, Peihua Li, Qinghua Hu, Pengfei Zhu, Wangmeng Zuo

To handle this issue, this paper proposes a novel deep global generalized Gaussian network (3G-Net), whose core is to estimate a global covariance of generalized Gaussian for modeling the last convolutional activations.

Rolling Shutter Correction

Paper
Code

Deep CNNs Meet Global Covariance Pooling: Better Representation and Generalization

3 code implementations • 15 Apr 2019 • Qilong Wang, Jiangtao Xie, WangMeng Zuo, Lei Zhang, Peihua Li

The proposed methods are highly modular, readily plugged into existing deep CNNs.

Ranked #1 on Image Classification on iNaturalist (Top 3 Error metric)

Fine-Grained Visual Recognition General Classification +3

268

Paper
Code

Global Gated Mixture of Second-order Pooling for Improving Deep Convolutional Neural Networks

1 code implementation • NeurIPS 2018 • Qilong Wang, Zilin Gao, Jiangtao Xie, WangMeng Zuo, Peihua Li

However, both GAP and existing HOP methods assume unimodal distributions, which cannot fully capture statistics of convolutional activations, limiting representation ability of deep CNNs, especially for samples with complex contents.

Paper
Code

Global Second-order Pooling Convolutional Networks

1 code implementation • CVPR 2019 • Zilin Gao, Jiangtao Xie, Qilong Wang, Peihua Li

Deep Convolutional Networks (ConvNets) are fundamental to, besides large-scale visual recognition, a lot of vision tasks.

Object Recognition

129

Paper
Code

Multi-scale Location-aware Kernel Representation for Object Detection

2 code implementations • CVPR 2018 • Hao Wang, Qilong Wang, Mingqi Gao, Peihua Li, WangMeng Zuo

Our MLKP can be efficiently computed on a modified multi-scale feature map using a low-dimensional polynomial kernel approximation. Moreover, different from existing orderless global representations based on high-order statistics, our proposed MLKP is location retentive and sensitive so that it can be flexibly adopted to object detection.

General Classification Object +2

107

Paper
Code

Towards Faster Training of Global Covariance Pooling Networks by Iterative Matrix Square Root Normalization

4 code implementations • CVPR 2018 • Peihua Li, Jiangtao Xie, Qilong Wang, Zilin Gao

Towards addressing this problem, we propose an iterative matrix square root normalization method for fast end-to-end training of global covariance pooling networks.

Ranked #14 on Fine-Grained Image Classification on CUB-200-2011

Fine-Grained Image Classification Fine-Grained Image Recognition

2,917

Paper
Code

G2DeNet: Global Gaussian Distribution Embedding Network and Its Application to Visual Recognition

no code implementations • CVPR 2017 • Qilong Wang, Peihua Li, Lei Zhang

Recently, plugging trainable structural layers into deep convolutional neural networks (CNNs) as image representations has made promising progress.

Paper
Add Code

Mind the Class Weight Bias: Weighted Maximum Mean Discrepancy for Unsupervised Domain Adaptation

3 code implementations • CVPR 2017 • Hongliang Yan, Yukang Ding, Peihua Li, Qilong Wang, Yong Xu, WangMeng Zuo

Specifically, we introduce class-specific auxiliary weights into the original MMD for exploiting the class prior probability on source and target domains, whose challenge lies in the fact that the class label in target domain is unavailable.

Unsupervised Domain Adaptation

Paper
Code

Is Second-order Information Helpful for Large-scale Visual Recognition?

1 code implementation • ICCV 2017 • Peihua Li, Jiangtao Xie, Qilong Wang, WangMeng Zuo

The main challenges involved are robust covariance estimation given a small sample of large-dimensional features and usage of the manifold structure of covariance matrices.

Object Recognition

Paper
Code

RAID-G: Robust Estimation of Approximate Infinite Dimensional Gaussian With Application to Material Recognition

no code implementations • CVPR 2016 • Qilong Wang, Peihua Li, WangMeng Zuo, Lei Zhang

Infinite dimensional covariance descriptors can provide richer and more discriminative information than their low dimensional counterparts.

Material Recognition

Paper
Add Code

Towards Effective Codebookless Model for Image Classification

no code implementations • 9 Jul 2015 • Qilong Wang, Peihua Li, Lei Zhang, WangMeng Zuo

The bag-of-features (BoF) model for image classification has been thoroughly studied over the last decade.

Classification General Classification +2

Paper
Add Code

From Dictionary of Visual Words to Subspaces: Locality-Constrained Affine Subspace Coding

no code implementations • CVPR 2015 • Peihua Li, Xiaoxiao Lu, Qilong Wang

The locality-constrained linear coding (LLC) is a very successful feature coding method in image classification.

General Classification Image Classification

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.