Search Results for author: Jianlong Chang

Found 27 papers, 8 papers with code

When Parameter-efficient Tuning Meets General-purpose Vision-language Models

1 code implementation • 16 Dec 2023 • Yihang Zhai, Haixin Wang, Jianlong Chang, Xinlong Yang, Jinan Sun, Shikun Zhang, Qi Tian

Instruction tuning has shown promising potential for developing general-purpose AI capabilities by using large-scale pre-trained models and boosts growing research to integrate multimodal information for creative applications.

Paper
Code

Towards AGI in Computer Vision: Lessons Learned from GPT and Large Language Models

no code implementations • 14 Jun 2023 • Lingxi Xie, Longhui Wei, Xiaopeng Zhang, Kaifeng Bi, Xiaotao Gu, Jianlong Chang, Qi Tian

In this paper, we start with a conceptual definition of AGI and briefly review how NLP solves a wide range of tasks via a chat system.

Paper
Add Code

Visual Tuning

no code implementations • 10 May 2023 • Bruce X. B. Yu, Jianlong Chang, Haixin Wang, Lingbo Liu, Shijie Wang, Zhiyu Wang, Junfan Lin, Lingxi Xie, Haojie Li, Zhouchen Lin, Qi Tian, Chang Wen Chen

With the surprising development of pre-trained visual foundation models, visual tuning jumped out of the standard modus operandi that fine-tunes the whole pre-trained model or just the fully connected layer.

Paper
Add Code

LION: Implicit Vision Prompt Tuning

no code implementations • 17 Mar 2023 • Haixin Wang, Jianlong Chang, Xiao Luo, Jinan Sun, Zhouchen Lin, Qi Tian

Despite recent competitive performance across a range of vision tasks, vision Transformers still have an issue of heavy computational costs.

Transfer Learning

Paper
Add Code

Constraint and Union for Partially-Supervised Temporal Sentence Grounding

no code implementations • 20 Feb 2023 • Chen Ju, Haicheng Wang, Jinxiang Liu, Chaofan Ma, Ya zhang, Peisen Zhao, Jianlong Chang, Qi Tian

Temporal sentence grounding aims to detect the event timestamps described by the natural language query from given untrimmed videos.

Sentence Temporal Sentence Grounding

Paper
Add Code

Open-Set Fine-Grained Retrieval via Prompting Vision-Language Evaluator

no code implementations • CVPR 2023 • Shijie Wang, Jianlong Chang, Haojie Li, Zhihui Wang, Wanli Ouyang, Qi Tian

PLEor could leverage pre-trained CLIP model to infer the discrepancies encompassing both pre-defined and unknown subcategories, called category-specific discrepancies, and transfer them to the backbone network trained in the close-set scenarios.

Knowledge Distillation Retrieval +1

Paper
Add Code

Distilling Vision-Language Pre-training to Collaborate with Weakly-Supervised Temporal Action Localization

no code implementations • CVPR 2023 • Chen Ju, Kunhao Zheng, Jinxiang Liu, Peisen Zhao, Ya zhang, Jianlong Chang, Yanfeng Wang, Qi Tian

And as a result, the dual-branch complementarity is effectively fused to promote a strong alliance.

Weakly-supervised Temporal Action Localization Weakly Supervised Temporal Action Localization

Paper
Add Code

Being Comes from Not-being: Open-vocabulary Text-to-Motion Generation with Wordless Training

1 code implementation • CVPR 2023 • Junfan Lin, Jianlong Chang, Lingbo Liu, Guanbin Li, Liang Lin, Qi Tian, Chang Wen Chen

During inference, instead of changing the motion generator, our method reformulates the input text into a masked motion as the prompt for the motion generator to ``reconstruct'' the motion.

Language Modelling Zero-Shot Learning

Paper
Code

Towards a Unified View on Visual Parameter-Efficient Transfer Learning

1 code implementation • 3 Oct 2022 • Bruce X. B. Yu, Jianlong Chang, Lingbo Liu, Qi Tian, Chang Wen Chen

Towards this goal, we propose a framework with a unified view of PETL called visual-PETL (V-PETL) to investigate the effects of different PETL techniques, data scales of downstream domains, positions of trainable parameters, and other aspects affecting the trade-off.

Action Recognition Image Classification +2

Paper
Code

Prompt-Matched Semantic Segmentation

no code implementations • 22 Aug 2022 • Lingbo Liu, Jianlong Chang, Bruce X. B. Yu, Liang Lin, Qi Tian, Chang-Wen Chen

Previous methods usually fine-tuned the entire networks for each specific dataset, which will be burdensome to store massive parameters of these networks.

Representation Learning Segmentation +2

Paper
Add Code

Fine-grained Retrieval Prompt Tuning

no code implementations • 29 Jul 2022 • Shijie Wang, Jianlong Chang, Zhihui Wang, Haojie Li, Wanli Ouyang, Qi Tian

In this paper, we develop Fine-grained Retrieval Prompt Tuning (FRPT), which steers a frozen pre-trained model to perform the fine-grained retrieval task from the perspectives of sample prompting and feature adaptation.

Retrieval

Paper
Add Code

Pro-tuning: Unified Prompt Tuning for Vision Tasks

no code implementations • 28 Jul 2022 • Xing Nie, Bolin Ni, Jianlong Chang, Gaomeng Meng, Chunlei Huo, Zhaoxiang Zhang, Shiming Xiang, Qi Tian, Chunhong Pan

To this end, we propose parameter-efficient Prompt tuning (Pro-tuning) to adapt frozen vision models to various downstream vision tasks.

Adversarial Robustness Image Classification +4

Paper
Add Code

HiVLP: Hierarchical Vision-Language Pre-Training for Fast Image-Text Retrieval

no code implementations • 24 May 2022 • Feilong Chen, Xiuyi Chen, Jiaxin Shi, Duzhen Zhang, Jianlong Chang, Qi Tian

It also achieves about +4. 9 AR on COCO and +3. 8 AR on Flickr30K than LightingDot and achieves comparable performance with the state-of-the-art (SOTA) fusion-based model METER.

Cross-Modal Retrieval Retrieval +1

Paper
Add Code

AME: Attention and Memory Enhancement in Hyper-Parameter Optimization

no code implementations • CVPR 2022 • Nuo Xu, Jianlong Chang, Xing Nie, Chunlei Huo, Shiming Xiang, Chunhong Pan

Training Deep Neural Networks (DNNs) is inherently subject to sensitive hyper-parameters and untimely feedbacks of performance evaluation.

Image Classification object-detection +2

Paper
Add Code

Deep Encryption: Protecting Pre-Trained Neural Networks with Confusion Neurons

no code implementations • 29 Sep 2021 • Mengbiao Zhao, Shixiong Xu, Jianlong Chang, Lingxi Xie, Jie Chen, Qi Tian

Having consumed huge amounts of training data and computational resource, large-scale pre-trained models are often considered key assets of AI service providers.

Position

Paper
Add Code

Differentiable Convolution Search for Point Cloud Processing

no code implementations • ICCV 2021 • Xing Nie, Yongcheng Liu, Shaohong Chen, Jianlong Chang, Chunlei Huo, Gaofeng Meng, Qi Tian, Weiming Hu, Chunhong Pan

It can work in a purely data-driven manner and thus is capable of auto-creating a group of suitable convolutions for geometric shape modeling.

Paper
Add Code

What Is Considered Complete for Visual Recognition?

no code implementations • 28 May 2021 • Lingxi Xie, Xiaopeng Zhang, Longhui Wei, Jianlong Chang, Qi Tian

This is an opinion paper.

Paper
Add Code

Camera-Space Hand Mesh Recovery via Semantic Aggregation and Adaptive 2D-1D Registration

1 code implementation • CVPR 2021 • Xingyu Chen, Yufeng Liu, Chongyang Ma, Jianlong Chang, Huayan Wang, Tian Chen, Xiaoyan Guo, Pengfei Wan, Wen Zheng

In the root-relative mesh recovery task, we exploit semantic relations among joints to generate a 3D mesh from the extracted 2D cues.

Position

325

Paper
Code

Can Semantic Labels Assist Self-Supervised Visual Representation Learning?

no code implementations • 17 Nov 2020 • Longhui Wei, Lingxi Xie, Jianzhong He, Jianlong Chang, Xiaopeng Zhang, Wengang Zhou, Houqiang Li, Qi Tian

Recently, contrastive learning has largely advanced the progress of unsupervised visual representation learning.

Contrastive Learning Representation Learning +1

Paper
Add Code

Weight-Sharing Neural Architecture Search: A Battle to Shrink the Optimization Gap

no code implementations • 4 Aug 2020 • Lingxi Xie, Xin Chen, Kaifeng Bi, Longhui Wei, Yuhui Xu, Zhengsu Chen, Lanfei Wang, An Xiao, Jianlong Chang, Xiaopeng Zhang, Qi Tian

Neural architecture search (NAS) has attracted increasing attentions in both academia and industry.

Neural Architecture Search

Paper
Add Code

Spatio-Temporal Graph Structure Learning for Traffic Forecasting

no code implementations • AAAI 2020 • Qi Zhang, Jianlong Chang, Gaofeng Meng, Shiming Xiang, Chunhong Pan

To address these issues, we propose a novel framework named Structure Learning Convolution (SLC) that enables to extend the traditional convolutional neural network (CNN) to graph domains and learn the graph structure for traffic forecasting.

Ranked #3 on Traffic Prediction on METR-LA

Graph structure learning Time Series +2

Paper
Add Code

Cross-Modality Paired-Images Generation for RGB-Infrared Person Re-Identification

2 code implementations • 10 Feb 2020 • Guan-An Wang, Tianzhu Zhang. Yang Yang, Jian Cheng, Jianlong Chang, Xu Liang, Zeng-Guang Hou

Second, given cross-modality unpaired-images of a person, our method can generate cross-modality paired images from exchanged images.

Person Re-Identification

498

Paper
Code

DATA: Differentiable ArchiTecture Approximation

1 code implementation • NeurIPS 2019 • Jianlong Chang, Xinbang Zhang, Yiwen Guo, Gaofeng Meng, Shiming Xiang, Chunhong Pan

Neural architecture search (NAS) is inherently subject to the gap of architectures during searching and validating.

Image Classification Language Modelling +2

Paper
Code

Differentiable Architecture Search with Ensemble Gumbel-Softmax

no code implementations • 6 May 2019 • Jianlong Chang, Xinbang Zhang, Yiwen Guo, Gaofeng Meng, Shiming Xiang, Chunhong Pan

For network architecture search (NAS), it is crucial but challenging to simultaneously guarantee both effectiveness and efficiency.

Neural Architecture Search

Paper
Add Code

Deep Discriminative Clustering Analysis

no code implementations • 5 May 2019 • Jianlong Chang, Yiwen Guo, Lingfeng Wang, Gaofeng Meng, Shiming Xiang, Chunhong Pan

Traditional clustering methods often perform clustering with low-level indiscriminative representations and ignore relationships between patterns, resulting in slight achievements in the era of deep learning.

Clustering

Paper
Add Code

Structure-Aware Convolutional Neural Networks

1 code implementation • NeurIPS 2018 • Jianlong Chang, Jie Gu, Lingfeng Wang, Gaofeng Meng, Shiming Xiang, Chunhong Pan

Convolutional neural networks (CNNs) are inherently subject to invariable filters that can only aggregate local inputs with the same topological structures.

Action Recognition Activity Detection +5

Paper
Code

Deep Adaptive Image Clustering

1 code implementation • ICCV 2017 • Jianlong Chang, Lingfeng Wang, Gaofeng Meng, Shiming Xiang, Chunhong Pan

The main challenge is that the ground-truth similarities are unknown in image clustering.

Ranked #8 on Image Clustering on Tiny-ImageNet

Clustering Image Clustering

111

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.