Search Results for author: Kun Ding

Found 8 papers, 3 papers with code

PackDet: Packed Long-Head Object Detector

1 code implementation ECCV 2020 Kun Ding, Guojin He, Huxiang Gu, Zisha Zhong, Shiming Xiang, Chunhong Pan

State-of-the-art object detectors exploit multi-branch structure and predict objects at several different scales, although substantially boosted accuracy is acquired, low efficiency is inevitable as fragmented structure is hardware unfriendly.

Object

Weak Distribution Detectors Lead to Stronger Generalizability of Vision-Language Prompt Tuning

no code implementations31 Mar 2024 Kun Ding, Haojian Zhang, Qiang Yu, Ying Wang, Shiming Xiang, Chunhong Pan

The idea is realized by exploiting out-of-distribution (OOD) detection to predict whether a sample belongs to a base distribution or a novel distribution and then using the score generated by a dedicated competition based scoring function to fuse the zero-shot and few-shot classifier.

Out of Distribution (OOD) Detection

Compositional Kronecker Context Optimization for Vision-Language Models

no code implementations18 Mar 2024 Kun Ding, Xiaohui Li, Qiang Yu, Ying Wang, Haojian Zhang, Shiming Xiang

Context Optimization (CoOp) has emerged as a simple yet effective technique for adapting CLIP-like vision-language models to downstream image recognition tasks.

Zero-shot Generalizable Incremental Learning for Vision-Language Object Detection

no code implementations4 Mar 2024 Jieren Deng, Haojian Zhang, Kun Ding, Jianhua Hu, Xingxuan Zhang, Yunkuan Wang

This paper presents Incremental Vision-Language Object Detection (IVLOD), a novel learning task designed to incrementally adapt pre-trained Vision-Language Object Detection Models (VLODMs) to various specialized domains, while simultaneously preserving their zero-shot generalization capabilities for the generalized domain.

Incremental Learning object-detection +2

PAD: Self-Supervised Pre-Training with Patchwise-Scale Adapter for Infrared Images

1 code implementation13 Dec 2023 Tao Zhang, Kun Ding, Jinyong Wen, Yu Xiong, Zeyu Zhang, Shiming Xiang, Chunhong Pan

Self-supervised learning (SSL) for RGB images has achieved significant success, yet there is still limited research on SSL for infrared images, primarily due to three prominent challenges: 1) the lack of a suitable large-scale infrared pre-training dataset, 2) the distinctiveness of non-iconic infrared images rendering common pre-training tasks like masked image modeling (MIM) less effective, and 3) the scarcity of fine-grained textures making it particularly challenging to learn general image features.

Self-Supervised Learning

Prompt Tuning with Soft Context Sharing for Vision-Language Models

1 code implementation29 Aug 2022 Kun Ding, Ying Wang, Pengzhang Liu, Qiang Yu, Haojian Zhang, Shiming Xiang, Chunhong Pan

Inspired by the fact that modeling task relationship by multi-task learning can usually boost performance, we propose a novel method SoftCPT (Soft Context Sharing for Prompt Tuning) to tune pre-trained vision-language models on multiple target few-shot tasks jointly.

Few-Shot Learning Multi-Task Learning

AMVH: Asymmetric Multi-Valued Hashing

no code implementations CVPR 2017 Cheng Da, Shibiao Xu, Kun Ding, Gaofeng Meng, Shiming Xiang, Chunhong Pan

(2) A multi-integer-embedding is employed for compressing the whole database, which is modeled by binary sparse representation with fixed sparsity.

kNN Hashing With Factorized Neighborhood Representation

no code implementations ICCV 2015 Kun Ding, Chunlei Huo, Bin Fan, Chunhong Pan

Hashing is very effective for many tasks in reducing the processing time and in compressing massive databases.

Retrieval

Cannot find the paper you are looking for? You can Submit a new open access paper.