Search Results for author: Guiguang Ding

Found 55 papers, 29 papers with code

YOLOv10: Real-Time End-to-End Object Detection

1 code implementation • 23 May 2024 • Ao Wang, Hui Chen, Lihao Liu, Kai Chen, Zijia Lin, Jungong Han, Guiguang Ding

In this work, we aim to further advance the performance-efficiency boundary of YOLOs from both the post-processing and model architecture.

Data Augmentation Object +2

6,340

Paper
Code

More is Better: Deep Domain Adaptation with Multiple Sources

no code implementations • 1 May 2024 • Sicheng Zhao, Hui Chen, Hu Huang, Pengfei Xu, Guiguang Ding

Domain adaptation (DA) aims to address this problem by aligning the distributions between the source and target domains.

Domain Adaptation

Paper
Add Code

Temporal Scaling Law for Large Language Models

no code implementations • 27 Apr 2024 • Yizhe Xiong, Xiansheng Chen, Xin Ye, Hui Chen, Zijia Lin, Haoran Lian, Jianwei Niu, Guiguang Ding

We first investigate the imbalance of loss on each token positions and develop a reciprocal-law across model scales and training stages.

Paper
Add Code

Scaffold-BPE: Enhancing Byte Pair Encoding with Simple and Effective Scaffold Token Removal

no code implementations • 27 Apr 2024 • Haoran Lian, Yizhe Xiong, Jianwei Niu, Shasha Mo, Zhenpeng Su, Zijia Lin, Peng Liu, Hui Chen, Guiguang Ding

Due to their infrequent appearance in the text corpus, Scaffold Tokens pose a learning imbalance issue for language models.

Language Modelling Machine Translation

Paper
Add Code

PYRA: Parallel Yielding Re-Activation for Training-Inference Efficient Task Adaptation

no code implementations • 14 Mar 2024 • Yizhe Xiong, Hui Chen, Tianxiang Hao, Zijia Lin, Jungong Han, Yuesong Zhang, Guoxin Wang, Yongjun Bao, Guiguang Ding

Consequently, a simple combination of them cannot guarantee accomplishing both training efficiency and inference efficiency with minimal costs.

Model Compression

Paper
Add Code

Debiased Novel Category Discovering and Localization

no code implementations • 29 Feb 2024 • Juexiao Feng, Yuhong Yang, Yanchun Xie, Yaqian Li, Yandong Guo, Yuchen Guo, Yuwei He, Liuyu Xiang, Guiguang Ding

In recent years, object detection in deep learning has experienced rapid development.

Contrastive Learning Novel Class Discovery +4

Paper
Add Code

One-Dimensional Adapter to Rule Them All: Concepts, Diffusion Models and Erasing Applications

1 code implementation • 26 Dec 2023 • Mengyao Lyu, Yuhong Yang, Haiwen Hong, Hui Chen, Xuan Jin, Yuan He, Hui Xue, Jungong Han, Guiguang Ding

The prevalent use of commercial and open-source diffusion models (DMs) for text-to-image generation prompts risk mitigation to prevent undesired behaviors.

Text-to-Image Generation

112

Paper
Code

Re-parameterized Low-rank Prompt: Generalize a Vision-Language Model within 0.5K Parameters

no code implementations • 17 Dec 2023 • Tianxiang Hao, Mengyao Lyu, Hui Chen, Sicheng Zhao, Jungong Han, Guiguang Ding

On the other hand, complicated structures and update rules largely increase the computation and storage cost.

Language Modelling

Paper
Add Code

RepViT-SAM: Towards Real-Time Segmenting Anything

2 code implementations • 10 Dec 2023 • Ao Wang, Hui Chen, Zijia Lin, Jungong Han, Guiguang Ding

Here, to achieve real-time segmenting anything on mobile devices, following MobileSAM, we replace the heavyweight image encoder in SAM with RepViT model, ending up with the RepViT-SAM model.

617

Paper
Code

MiLe Loss: a New Loss for Mitigating the Bias of Learning Difficulties in Generative Language Models

1 code implementation • 30 Oct 2023 • Zhenpeng Su, Xing Wu, Xue Bai, Zijia Lin, Hui Chen, Guiguang Ding, Wei Zhou, Songlin Hu

Experiments reveal that models incorporating the proposed MiLe Loss can gain consistent performance improvement on downstream benchmarks.

Ranked #90 on Multi-task Language Understanding on MMLU

Language Modelling Multi-task Language Understanding

Paper
Code

CAIT: Triple-Win Compression towards High Accuracy, Fast Inference, and Favorable Transferability For ViTs

no code implementations • 27 Sep 2023 • Ao Wang, Hui Chen, Zijia Lin, Sicheng Zhao, Jungong Han, Guiguang Ding

We further employ a consistent dynamic channel pruning (CDCP) strategy to dynamically prune unimportant channels in ViTs.

Model Compression Semantic Segmentation

Paper
Add Code

Confidence-based Visual Dispersal for Few-shot Unsupervised Domain Adaptation

1 code implementation • ICCV 2023 • Yizhe Xiong, Hui Chen, Zijia Lin, Sicheng Zhao, Guiguang Ding

To address this issue, recent works consider the Few-shot Unsupervised Domain Adaptation (FUDA) where only a few source samples are labeled, and conduct knowledge transfer via self-supervised learning methods.

Self-Supervised Learning Transfer Learning +1

Paper
Code

RepViT: Revisiting Mobile CNN From ViT Perspective

7 code implementations • 18 Jul 2023 • Ao Wang, Hui Chen, Zijia Lin, Jungong Han, Guiguang Ding

Recently, lightweight Vision Transformers (ViTs) demonstrate superior performance and lower latency, compared with lightweight Convolutional Neural Networks (CNNs), on resource-constrained mobile devices.

30,231

Paper
Code

Consolidator: Mergeable Adapter with Grouped Connections for Visual Adaptation

1 code implementation • 30 Apr 2023 • Tianxiang Hao, Hui Chen, Yuchen Guo, Guiguang Ding

To further enhance the model's capacity to transfer knowledge under a constrained storage budget and keep inference efficient, we consolidate the parameters in two stages: 1. between adaptation and storage, and 2. between loading and inference.

Paper
Code

Box-Level Active Detection

1 code implementation • CVPR 2023 • Mengyao Lyu, Jundong Zhou, Hui Chen, YiJie Huang, Dongdong Yu, Yaqian Li, Yandong Guo, Yuchen Guo, Liuyu Xiang, Guiguang Ding

Active learning selects informative samples for annotation within budget, which has proven efficient recently on object detection.

Active Learning object-detection +1

Paper
Code

X-ReID: Cross-Instance Transformer for Identity-Level Person Re-Identification

no code implementations • 4 Feb 2023 • Leqi Shen, Tao He, Yuchen Guo, Guiguang Ding

In this paper, we propose to promote Instance-Level features to Identity-Level features by employing cross-attention to incorporate information from one image to another of the same identity, thus more unified and discriminative pedestrian information can be obtained.

Person Re-Identification

Paper
Add Code

Confidence-guided Centroids for Unsupervised Person Re-Identification

no code implementations • 22 Nov 2022 • Yunqi Miao, Jiankang Deng, Guiguang Ding, Jungong Han

Since samples with high confidence are exclusively involved in the formation of centroids, the identity information of low-confidence samples, i. e., boundary samples, are NOT likely to contribute to the corresponding centroid.

Pseudo Label Retrieval +1

Paper
Add Code

Ground Plane Matters: Picking Up Ground Plane Prior in Monocular 3D Object Detection

no code implementations • 3 Nov 2022 • Fan Yang, Xinhao Xu, Hui Chen, Yuchen Guo, Jungong Han, Kai Ni, Guiguang Ding

To pick up the ground plane prior for M3OD, we propose a Ground Plane Enhanced Network (GPENet) which resolves both issues at one go.

Monocular 3D Object Detection object-detection

Paper
Add Code

Re-parameterizing Your Optimizers rather than Architectures

1 code implementation • 30 May 2022 • Xiaohan Ding, Honghao Chen, Xiangyu Zhang, Kaiqi Huang, Jungong Han, Guiguang Ding

For the extreme simplicity of model structure, we focus on a VGG-style plain model and showcase that such a simple model trained with a RepOptimizer, which is referred to as RepOpt-VGG, performs on par with or better than the recent well-designed models.

Quantization

246

Paper
Code

A High-Accuracy Unsupervised Person Re-identification Method Using Auxiliary Information Mined from Datasets

1 code implementation • 6 May 2022 • Hehan Teng, Tao He, Yuchen Guo, Guiguang Ding

Combined with auxiliary information exploiting modules, our methods achieve mAP of 89. 9% on DukeMTMC, where TOC, STS and SCP all contributed considerable performance improvements.

Ranked #1 on Unsupervised Person Re-Identification on MARS

STS Unsupervised Person Re-Identification

Paper
Code

A Free Lunch to Person Re-identification: Learning from Automatically Generated Noisy Tracklets

no code implementations • 2 Apr 2022 • Hehan Teng, Tao He, Yuchen Guo, Zhenhua Guo, Guiguang Ding

Extensive experiments on MARS with various manually generated noises show the effectiveness of the proposed framework.

Person Re-Identification Pseudo Label

Paper
Add Code

Scaling Up Your Kernels to 31x31: Revisiting Large Kernel Design in CNNs

7 code implementations • CVPR 2022 • Xiaohan Ding, Xiangyu Zhang, Yizhuang Zhou, Jungong Han, Guiguang Ding, Jian Sun

We revisit large kernel design in modern convolutional neural networks (CNNs).

Ranked #75 on Image Classification on <h2>oi</h2>

Image Classification

3,030

Paper
Code

TAGPerson: A Target-Aware Generation Pipeline for Person Re-identification

1 code implementation • 28 Dec 2021 • Kai Chen, Weihua Chen, Tao He, Rong Du, Fan Wang, Xiuyu Sun, Yuchen Guo, Guiguang Ding

In TAGPerson, we extract information from target scenes and use them to control our parameterized rendering process to generate target-aware synthetic images, which would hold a smaller gap to the real images in the target domain.

Person Re-Identification

Paper
Code

RepMLPNet: Hierarchical Vision MLP with Re-parameterized Locality

4 code implementations • CVPR 2022 • Xiaohan Ding, Honghao Chen, Xiangyu Zhang, Jungong Han, Guiguang Ding

Our results reveal that 1) Locality Injection is a general methodology for MLP models; 2) RepMLPNet has favorable accuracy-efficiency trade-off compared to the other MLPs; 3) RepMLPNet is the first MLP that seamlessly transfer to Cityscapes semantic segmentation.

Ranked #61 on Semantic Segmentation on Cityscapes val

Image Classification Semantic Segmentation

3,030

Paper
Code

Camera Bias Regularization for Person Re-identification

no code implementations • 29 Sep 2021 • Tao He, Tongkun Xu, Weihua Chen, Yuchen Guo, Guiguang Ding, Zhenhua Guo

Due to the discrepancies between cameras caused by illumination, background, or viewpoint, the underlying difficulty for Re-ID is the camera bias problem, which leads to the large gap of within-identity features from different cameras.

Person Re-Identification

Paper
Add Code

LODE: Deep Local Deblurring and A New Benchmark

1 code implementation • 19 Sep 2021 • Zerun Wang, Liuyu Xiang, Fan Yang, Jinzhao Qian, Jie Hu, Haidong Huang, Jungong Han, Yuchen Guo, Guiguang Ding

While recent deep deblurring algorithms have achieved remarkable progress, most existing methods focus on the global deblurring problem, where the image blur mostly arises from severe camera shake.

Deblurring

Paper
Code

Emotion Recognition from Multiple Modalities: Fundamentals and Methodologies

no code implementations • 18 Aug 2021 • Sicheng Zhao, Guoli Jia, Jufeng Yang, Guiguang Ding, Kurt Keutzer

In this tutorial, we discuss several key aspects of multi-modal emotion recognition (MER).

Domain Adaptation Emotional Intelligence +2

Paper
Add Code

Manipulating Identical Filter Redundancy for Efficient Pruning on Deep and Complicated CNN

2 code implementations • 30 Jul 2021 • Xiaohan Ding, Tianxiang Hao, Jungong Han, Yuchen Guo, Guiguang Ding

The existence of redundancy in Convolutional Neural Networks (CNNs) enables us to remove some filters/channels with acceptable performance drops.

Network Pruning

Paper
Code

Affective Image Content Analysis: Two Decades Review and New Perspectives

no code implementations • 30 Jun 2021 • Sicheng Zhao, Xingxu Yao, Jufeng Yang, Guoli Jia, Guiguang Ding, Tat-Seng Chua, Björn W. Schuller, Kurt Keutzer

Images can convey rich semantics and induce various emotions in viewers.

Emotional Intelligence Emotion Recognition +1

Paper
Add Code

RepMLP: Re-parameterizing Convolutions into Fully-connected Layers for Image Recognition

10 code implementations • 5 May 2021 • Xiaohan Ding, Chunlong Xia, Xiangyu Zhang, Xiaojie Chu, Jungong Han, Guiguang Ding

We propose RepMLP, a multi-layer-perceptron-style neural network building block for image recognition, which is composed of a series of fully-connected (FC) layers.

Ranked #754 on Image Classification on <h2>oi</h2>

Face Recognition Image Classification +1

11,005

Paper
Code

Diverse Branch Block: Building a Convolution as an Inception-like Unit

2 code implementations • CVPR 2021 • Xiaohan Ding, Xiangyu Zhang, Jungong Han, Guiguang Ding

We propose a universal building block of Convolutional Neural Network (ConvNet) to improve the performance without any inference-time costs.

Image Classification object-detection +2

309

Paper
Code

Computational Emotion Analysis From Images: Recent Advances and Future Directions

no code implementations • 19 Mar 2021 • Sicheng Zhao, Quanwei Huang, YouBao Tang, Xingxu Yao, Jufeng Yang, Guiguang Ding, Björn W. Schuller

Recently, extensive research efforts have been dedicated to understanding the emotions of images.

Domain Adaptation Emotion Recognition

Paper
Add Code

Automated Model Design and Benchmarking of 3D Deep Learning Models for COVID-19 Detection with Chest CT Scans

2 code implementations • 14 Jan 2021 • Xin He, Shihao Wang, Xiaowen Chu, Shaohuai Shi, Jiangping Tang, Xin Liu, Chenggang Yan, Jiyong Zhang, Guiguang Ding

The experimental results show that our automatically searched models (CovidNet3D) outperform the baseline human-designed models on the three datasets with tens of times smaller model size and higher accuracy.

Benchmarking Medical Diagnosis +1

Paper
Code

RepVGG: Making VGG-style ConvNets Great Again

23 code implementations • CVPR 2021 • Xiaohan Ding, Xiangyu Zhang, Ningning Ma, Jungong Han, Guiguang Ding, Jian Sun

We present a simple but powerful architecture of convolutional neural network, which has a VGG-like inference-time body composed of nothing but a stack of 3x3 convolution and ReLU, while the training-time model has a multi-branch topology.

Ranked #44 on Semantic Segmentation on Cityscapes val

Image Classification Semantic Segmentation

30,231

Paper
Code

Emotional Semantics-Preserved and Feature-Aligned CycleGAN for Visual Emotion Adaptation

no code implementations • 25 Nov 2020 • Sicheng Zhao, Xuanbai Chen, Xiangyu Yue, Chuang Lin, Pengfei Xu, Ravi Krishna, Jufeng Yang, Guiguang Ding, Alberto L. Sangiovanni-Vincentelli, Kurt Keutzer

First, we generate an adapted domain to align the source and target domains on the pixel-level by improving CycleGAN with a multi-scale structured cycle-consistency loss.

Emotion Classification Emotion Recognition +1

Paper
Add Code

ResRep: Lossless CNN Pruning via Decoupling Remembering and Forgetting

6 code implementations • ICCV 2021 • Xiaohan Ding, Tianxiang Hao, Jianchao Tan, Ji Liu, Jungong Han, Yuchen Guo, Guiguang Ding

Via training with regular SGD on the former but a novel update rule with penalty gradients on the latter, we realize structured sparsity.

309

Paper
Code

Shallow Feature Based Dense Attention Network for Crowd Counting

no code implementations • 17 Jun 2020 • Yunqi Miao, Zijia Lin, Guiguang Ding, Jungong Han

In this paper, we propose a Shallow feature based Dense Attention Network (SDANet) for crowd counting from still images, which diminishes the impact of backgrounds via involving a shallow feature based attention model, and meanwhile, captures multi-scale information via densely connecting hierarchical image features.

Crowd Counting

Paper
Add Code

Personalized Image Aesthetics Assessment via Meta-Learning With Bilevel Gradient Optimization

1 code implementation • IEEE Transactions on Cybernetics 2020 • Hancheng Zhu, Leida Li, Jinjian Wu, Sicheng Zhao, Guiguang Ding, and Guangming Shi

Typical image aesthetics assessment (IAA) is modeled for the generic aesthetics perceived by an ``average'' user.

Aesthetics Quality Assessment Meta-Learning

Paper
Code

PANDA: A Gigapixel-level Human-centric Video Dataset

no code implementations • CVPR 2020 • Xueyang Wang, Xiya Zhang, Yinheng Zhu, Yuchen Guo, Xiaoyun Yuan, Liuyu Xiang, Zerun Wang, Guiguang Ding, David J. Brady, Qionghai Dai, Lu Fang

We believe PANDA will contribute to the community of artificial intelligence and praxeology by understanding human behaviors and interactions in large-scale real-world scenes.

4k Attribute +1

Paper
Add Code

IMRAM: Iterative Matching with Recurrent Attention Memory for Cross-Modal Image-Text Retrieval

1 code implementation • CVPR 2020 • Hui Chen, Guiguang Ding, Xudong Liu, Zijia Lin, Ji Liu, Jungong Han

Existing methods leverage the attention mechanism to explore such correspondence in a fine-grained manner.

Ranked #19 on Cross-Modal Retrieval on Flickr30k

Cross-Modal Retrieval Retrieval +1

Paper
Code

Learning From Multiple Experts: Self-paced Knowledge Distillation for Long-tailed Classification

1 code implementation • ECCV 2020 • Liuyu Xiang, Guiguang Ding, Jungong Han

We refer to these models as 'Experts', and the proposed LFME framework aggregates the knowledge from multiple 'Experts' to learn a unified student model.

Ranked #26 on Long-tail Learning on Places-LT

General Classification Knowledge Distillation +1

Paper
Code

Global Sparse Momentum SGD for Pruning Very Deep Neural Networks

4 code implementations • NeurIPS 2019 • Xiaohan Ding, Guiguang Ding, Xiangxin Zhou, Yuchen Guo, Jungong Han, Ji Liu

Deep Neural Network (DNN) is powerful but computationally expensive and memory intensive, thus impeding its practical usage on resource-constrained front-end devices.

Model Compression

821

Paper
Code

PDANet: Polarity-consistent Deep Attention Network for Fine-grained Visual Emotion Regression

1 code implementation • 11 Sep 2019 • Sicheng Zhao, Zizhou Jia, Hui Chen, Leida Li, Guiguang Ding, Kurt Keutzer

By optimizing the PCR loss, PDANet can generate a polarity preserved attention map and thus improve the emotion regression performance.

Deep Attention Emotion Classification +2

Paper
Code

ACNet: Strengthening the Kernel Skeletons for Powerful CNN via Asymmetric Convolution Blocks

5 code implementations • ICCV 2019 • Xiaohan Ding, Yuchen Guo, Guiguang Ding, Jungong Han

We propose Asymmetric Convolution Block (ACB), an architecture-neutral structure as a CNN building block, which uses 1D asymmetric convolutions to strengthen the square convolution kernels.

Attribute

5,295

Paper
Code

GRN: Gated Relation Network to Enhance Convolutional Neural Network for Named Entity Recognition

1 code implementation • 12 Jul 2019 • Hui Chen, Zijia Lin, Guiguang Ding, JianGuang Lou, Yusen Zhang, Borje Karlsson

The dominant approaches for named entity recognition (NER) mostly adopt complex recurrent neural networks (RNN), e. g., long-short-term-memory (LSTM).

Ranked #23 on Named Entity Recognition (NER) on Ontonotes v5 (English)

Computational Efficiency named-entity-recognition +5

265

Paper
Code

Incremental Few-Shot Learning for Pedestrian Attribute Recognition

no code implementations • 2 Jun 2019 • Liuyu Xiang, Xiaoming Jin, Guiguang Ding, Jungong Han, Leida Li

Pedestrian attribute recognition has received increasing attention due to its important role in video surveillance applications.

Attribute Few-Shot Learning +1

Paper
Add Code

Adaptive Region Embedding for Text Classification

no code implementations • 28 May 2019 • Liuyu Xiang, Xiaoming Jin, Lan Yi, Guiguang Ding

Deep learning models such as convolutional neural networks and recurrent networks are widely applied in text classification.

General Classification text-classification +1

Paper
Add Code

Approximated Oracle Filter Pruning for Destructive CNN Width Optimization

1 code implementation • 12 May 2019 • Xiaohan Ding, Guiguang Ding, Yuchen Guo, Jungong Han, Chenggang Yan

It is not easy to design and run Convolutional Neural Networks (CNNs) due to: 1) finding the optimal number of filters (i. e., the width) at each layer is tricky, given an architecture; and 2) the computational intensity of CNNs impedes the deployment on computationally limited devices.

Paper
Code

Centripetal SGD for Pruning Very Deep Convolutional Networks with Complicated Structure

1 code implementation • CVPR 2019 • Xiaohan Ding, Guiguang Ding, Yuchen Guo, Jungong Han

The redundancy is widely recognized in Convolutional Neural Networks (CNNs), which enables to remove unimportant filters from convolutional layers so as to slim the network with acceptable performance drop.

Paper
Code

From Zero-shot Learning to Conventional Supervised Classification: Unseen Visual Data Synthesis

no code implementations • CVPR 2017 • Yang Long, Li Liu, Ling Shao, Fumin Shen, Guiguang Ding, Jungong Han

Using the proposed Unseen Visual Data Synthesis (UVDS) algorithm, semantic attributes are effectively utilised as an intermediate clue to synthesise unseen visual features at the training stage.

General Classification Object Recognition +1

Paper
Add Code

Semantics-Preserving Hashing for Cross-View Retrieval

no code implementations • CVPR 2015 • Zijia Lin, Guiguang Ding, Mingqing Hu, Jian-Min Wang

With benefits of low storage costs and high query speeds, hashing methods are widely researched for efficiently retrieving large-scale data, which commonly contains multiple views, e. g. a news report with images, videos and texts.

Retrieval

Paper
Add Code

Transfer Joint Matching for Unsupervised Domain Adaptation

no code implementations • CVPR 2014 • Mingsheng Long, Jian-Min Wang, Guiguang Ding, Jiaguang Sun, Philip S. Yu

Visual domain adaptation, which learns an accurate classifier for a new domain using labeled images from an old domain, has shown promising value in computer vision yet still been a challenging problem.

Dimensionality Reduction Unsupervised Domain Adaptation

Paper
Add Code

Collective Matrix Factorization Hashing for Multimodal Data

no code implementations • CVPR 2014 • Guiguang Ding, Yuchen Guo, Jile Zhou

In this paper, we study the problems of learning hash functions in the context of multimodal data for cross-view similarity search.

Information Retrieval Retrieval

Paper
Add Code

Image Tag Completion via Image-Specific and Tag-Specific Linear Sparse Reconstructions

no code implementations • CVPR 2013 • Zijia Lin, Guiguang Ding, Mingqing Hu, Jian-Min Wang, Xiaojun Ye

Though widely utilized for facilitating image management, user-provided image tags are usually incomplete and insufficient to describe the whole semantic content of corresponding images, resulting in performance degradations in tag-dependent applications and thus necessitating effective tag completion methods.

Management TAG

Paper
Add Code

Transfer Sparse Coding for Robust Image Representation

no code implementations • CVPR 2013 • Mingsheng Long, Guiguang Ding, Jian-Min Wang, Jiaguang Sun, Yuchen Guo, Philip S. Yu

In this paper, we propose a Transfer Sparse Coding (TSC) approach to construct robust sparse representations for classifying cross-distribution images accurately.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.