Search Results for author: Hao Luo

Found 53 papers, 26 papers with code

Uncovering the Text Embedding in Text-to-Image Diffusion Models

no code implementations • 1 Apr 2024 • Hu Yu, Hao Luo, Fan Wang, Feng Zhao

The correspondence between input text and the generated image exhibits opacity, wherein minor textual modifications can induce substantial deviations in the generated image.

Paper
Add Code

Text Data-Centric Image Captioning with Interactive Prompts

no code implementations • 28 Mar 2024 • Yiyu Wang, Hao Luo, Jungang Xu, Yingfei Sun, Fan Wang

Among them, the mainstream solution is to project image embeddings into the text embedding space with the assistance of consistent representations between image-text pairs from the CLIP model.

Image Captioning

Paper
Add Code

Enhancing Hyperspectral Images via Diffusion Model and Group-Autoencoder Super-resolution Network

1 code implementation • 27 Feb 2024 • Zhaoyang Wang, Dongyang Li, Mingyang Zhang, Hao Luo, Maoguo Gong

Existing hyperspectral image (HSI) super-resolution (SR) methods struggle to effectively capture the complex spectral-spatial relationships and low-level details, while diffusion models represent a promising generative model known for their exceptional performance in modeling complex relations and learning high and low-level visual features.

Super-Resolution

Paper
Code

Accelerating Parallel Sampling of Diffusion Models

no code implementations • 15 Feb 2024 • Zhiwei Tang, Jiasheng Tang, Hao Luo, Fan Wang, Tsung-Hui Chang

Our experiments demonstrate that ParaTAA can decrease the inference steps required by common sequential sampling algorithms such as DDIM and DDPM by a factor of 4~14 times.

Image Generation

Paper
Add Code

Integrated Imaging and Communication with Reconfigurable Intelligent Surfaces

1 code implementation • 29 Jan 2024 • Hao Luo, Ahmed Alkhateeb

In particular, using the RIS as a wireless imaging device, our system constructs the scene depth map of the environment, including the mobile user.

Paper
Code

ISAC with Backscattering RFID Tags: Joint Beamforming Design

1 code implementation • 18 Jan 2024 • Hao Luo, Umut Demirhan, Ahmed Alkhateeb

Then, we study a joint beamforming design problem with the goal of minimizing the total transmit power while satisfying the tag detection and communication requirements.

TAG

Paper
Code

CL2CM: Improving Cross-Lingual Cross-Modal Retrieval via Cross-Lingual Knowledge Transfer

no code implementations • 14 Dec 2023 • Yabing Wang, Fan Wang, Jianfeng Dong, Hao Luo

Cross-lingual cross-modal retrieval has garnered increasing attention recently, which aims to achieve the alignment between vision and target language (V-T) without using any annotated V-T data pairs.

Cross-Lingual Transfer Cross-Modal Retrieval +4

Paper
Add Code

VGSG: Vision-Guided Semantic-Group Network for Text-based Person Search

no code implementations • 13 Nov 2023 • Shuting He, Hao Luo, Wei Jiang, Xudong Jiang, Henghui Ding

With the help of relational knowledge transfer, VGKT is capable of aligning semantic-group textual features with corresponding visual features without external tools and complex pairwise interaction.

Ranked #6 on Text based Person Retrieval on CUHK-PEDES (using extra training data)

Person Search Text based Person Retrieval +2

Paper
Add Code

Deep Learning Enables Large Depth-of-Field Images for Sub-Diffraction-Limit Scanning Superlens Microscopy

no code implementations • 27 Oct 2023 • Hui Sun, Hao Luo, Feifei Wang, Qingjiu Chen, Meng Chen, Xiaoduo Wang, Haibo Yu, Guanglie Zhang, Lianqing Liu, JianPing Wang, Dapeng Wu, Wen Jung Li

Scanning electron microscopy (SEM) is indispensable in diverse applications ranging from microelectronics to food processing because it provides large depth-of-field images with a resolution beyond the optical diffraction limit.

Defect Detection Image-to-Image Translation +1

Paper
Add Code

Variational Quantum Linear Solver-based Combination Rules in Dempster–Shafer Theory

1 code implementation • journal 2023 • Hao Luo, Qianli Zhou, Zhen Li, Yong Deng

Dempster–Shafer Theory (DST), as a method of handling uncertain information, is widely used in decisionmaking and information fusion.

Paper
Code

SCT: A Simple Baseline for Parameter-Efficient Fine-Tuning via Salient Channels

2 code implementations • 15 Sep 2023 • Henry Hengyuan Zhao, Pichao Wang, Yuyang Zhao, Hao Luo, Fan Wang, Mike Zheng Shou

Recently, many parameter-efficient fine-tuning (PEFT) methods have been proposed, and their experiments demonstrate that tuning only 1% of extra parameters could surpass full fine-tuning in low-data resource scenarios.

Domain Generalization Few-Shot Learning

Paper
Code

Dual-view Curricular Optimal Transport for Cross-lingual Cross-modal Retrieval

no code implementations • 11 Sep 2023 • Yabing Wang, Shuhui Wang, Hao Luo, Jianfeng Dong, Fan Wang, Meng Han, Xun Wang, Meng Wang

Therefore, we propose Dual-view Curricular Optimal Transport (DCOT) to learn with noisy correspondence in CCR.

Cross-Lingual Transfer Cross-Modal Retrieval +2

Paper
Add Code

Boosting Unsupervised Contrastive Learning Using Diffusion-Based Data Augmentation From Scratch

no code implementations • 10 Sep 2023 • Zelin Zang, Hao Luo, Kai Wang, Panpan Zhang, Fan Wang, Stan. Z Li, Yang You

When applied to biological data, DiffAug improves performance by up to 10. 1%, with an average improvement of 5. 8%.

Contrastive Learning Data Augmentation +1

Paper
Add Code

Region Generation and Assessment Network for Occluded Person Re-Identification

no code implementations • 7 Sep 2023 • Shuting He, Weihua Chen, Kai Wang, Hao Luo, Fan Wang, Wei Jiang, Henghui Ding

Then, to measure the importance of each generated region, we introduce a Region Assessment Module (RAM) that assigns confidence scores to different regions and reduces the negative impact of the occlusion regions by lower scores.

Person Re-Identification

Paper
Add Code

Color Prompting for Data-Free Continual Unsupervised Domain Adaptive Person Re-Identification

1 code implementation • 21 Aug 2023 • Jianyang Gu, Hao Luo, Kai Wang, Wei Jiang, Yang You, Jian Zhao

In this work, we propose a Color Prompting (CoP) method for data-free continual unsupervised domain adaptive person Re-ID.

Domain Adaptive Person Re-Identification Person Re-Identification +1

Paper
Code

Revisiting Vision Transformer from the View of Path Ensemble

no code implementations • ICCV 2023 • Shuning Chang, Pichao Wang, Hao Luo, Fan Wang, Mike Zheng Shou

Therefore, we propose the path pruning and EnsembleScale skills for improvement, which cut out the underperforming paths and re-weight the ensemble components, respectively, to optimize the path combination and make the short paths focus on providing high-quality representation for subsequent paths.

Paper
Add Code

Millimeter Wave V2V Beam Tracking using Radar: Algorithms and Real-World Demonstration

1 code implementation • 3 Aug 2023 • Hao Luo, Umut Demirhan, Ahmed Alkhateeb

Utilizing radar sensing for assisting communication has attracted increasing interest thanks to its potential in dynamic environments.

Paper
Code

Beyond Appearance: a Semantic Controllable Self-Supervised Learning Framework for Human-Centric Visual Tasks

4 code implementations • CVPR 2023 • Weihua Chen, Xianzhe Xu, Jian Jia, Hao Luo, Yaohua Wang, Fan Wang, Rong Jin, Xiuyu Sun

Unlike the existing self-supervised learning methods, prior knowledge from human images is utilized in SOLIDER to build pseudo semantic labels and import more semantic information into the learned representation.

Ranked #1 on Person Search on PRW

Human Parsing Pedestrian Attribute Recognition +6

6,005

Paper
Code

CLIP4MC: An RL-Friendly Vision-Language Model for Minecraft

1 code implementation • 19 Mar 2023 • Ziluo Ding, Hao Luo, Ke Li, Junpeng Yue, Tiejun Huang, Zongqing Lu

One of the essential missions in the AI research community is to build an autonomous embodied agent that can attain high-level performance across a wide spectrum of tasks.

Contrastive Learning Language Modelling +1

Paper
Code

Revisit Parameter-Efficient Transfer Learning: A Two-Stage Paradigm

no code implementations • 14 Mar 2023 • Hengyuan Zhao, Hao Luo, Yuyang Zhao, Pichao Wang, Fan Wang, Mike Zheng Shou

In view of the practicality of PETL, previous works focus on tuning a small set of parameters for each downstream task in an end-to-end manner while rarely considering the task distribution shift issue between the pre-training task and the downstream task.

Transfer Learning Vocal Bursts Valence Prediction

Paper
Add Code

MSINet: Twins Contrastive Search of Multi-Scale Interaction for Object ReID

1 code implementation • CVPR 2023 • Jianyang Gu, Kai Wang, Hao Luo, Chen Chen, Wei Jiang, Yuqiang Fang, Shanghang Zhang, Yang You, Jian Zhao

Neural Architecture Search (NAS) has been increasingly appealing to the society of object Re-Identification (ReID), for that task-specific architectures significantly improve the retrieval performance.

Ranked #8 on Vehicle Re-Identification on VehicleID Large

Image Classification Neural Architecture Search +3

Paper
Code

Model-Based Decentralized Policy Optimization

no code implementations • 16 Feb 2023 • Hao Luo, Jiechuan Jiang, Zongqing Lu

To help the policy improvement be stable and monotonic, we propose model-based decentralized policy optimization (MDPO), which incorporates a latent variable function to help construct the transition and reward function from an individual perspective.

Paper
Add Code

A Survey on Self-supervised Learning: Algorithms, Applications, and Future Trends

1 code implementation • 13 Jan 2023 • Jie Gui, Tuo Chen, Jing Zhang, Qiong Cao, Zhenan Sun, Hao Luo, DaCheng Tao

Deep supervised learning algorithms typically require a large volume of labeled data to achieve satisfactory performance.

Self-Supervised Learning

Paper
Code

A Survey on Transformers in Reinforcement Learning

no code implementations • 8 Jan 2023 • Wenzhe Li, Hao Luo, Zichuan Lin, Chongjie Zhang, Zongqing Lu, Deheng Ye

Transformer has been considered the dominating neural architecture in NLP and CV, mostly under supervised settings.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Good helper is around you: Attention-driven Masked Image Modeling

1 code implementation • 28 Nov 2022 • Zhengqi Liu, Jie Gui, Hao Luo

Most previous works mask patches of the image randomly, which underutilizes the semantic information that is beneficial to visual representation learning.

Representation Learning Self-Supervised Learning

Paper
Code

Reconfigurable Intelligent Surface Aided Wireless Sensing for Scene Depth Estimation

1 code implementation • 15 Nov 2022 • Abdelrahman Taha, Hao Luo, Ahmed Alkhateeb

In this paper, we propose to employ RIS-aided wireless sensing systems for scene depth estimation.

Depth Estimation

Paper
Code

VTC-LFC: Vision Transformer Compression with Low-Frequency Components

1 code implementation • NIPS 2022 • Zhenyu Wang, Hao Luo, Pichao Wang, Feng Ding, Fan Wang, Hao Li

Although Vision transformers (ViTs) have recently dominated many vision tasks, deploying ViT models on resource-limited devices remains a challenging problem.

Paper
Code

MimCo: Masked Image Modeling Pre-training with Contrastive Teacher

no code implementations • 7 Sep 2022 • Qiang Zhou, Chaohui Yu, Hao Luo, Zhibin Wang, Hao Li

Specifically, MimCo takes a pre-trained contrastive learning model as the teacher model and is pre-trained with two types of learning targets: patch-level and image-level reconstruction losses.

Contrastive Learning Self-Supervised Learning

Paper
Add Code

Dynamic Gradient Reactivation for Backward Compatible Person Re-identification

no code implementations • 12 Jul 2022 • Xiao Pan, Hao Luo, Weihua Chen, Fan Wang, Hao Li, Wei Jiang, Jianming Zhang, Jianyang Gu, Peike Li

To address this issue, we propose the Ranking-based Backward Compatible Learning (RBCL), which directly optimizes the ranking metric between new features and old features.

Person Re-Identification Retrieval

Paper
Add Code

Self-Supervised Pre-Training for Transformer-Based Person Re-Identification

2 code implementations • 23 Nov 2021 • Hao Luo, Pichao Wang, Yi Xu, Feng Ding, Yanxin Zhou, Fan Wang, Hao Li, Rong Jin

We first investigate self-supervised learning (SSL) methods with Vision Transformer (ViT) pretrained on unlabelled person images (the LUPerson dataset), and empirically find it significantly surpasses ImageNet supervised pre-training models on ReID tasks.

Ranked #1 on Unsupervised Person Re-Identification on Market-1501 (using extra training data)

Self-Supervised Learning Unsupervised Domain Adaptation +1

215

Paper
Code

Scaled ReLU Matters for Training Vision Transformers

no code implementations • 8 Sep 2021 • Pichao Wang, Xue Wang, Hao Luo, Jingkai Zhou, Zhipeng Zhou, Fan Wang, Hao Li, Rong Jin

In this paper, we further investigate this problem and extend the above conclusion: only early convolutions do not help for stable training, but the scaled ReLU operation in the \textit{convolutional stem} (\textit{conv-stem}) matters.

Paper
Add Code

An Empirical Study of Vehicle Re-Identification on the AI City Challenge

1 code implementation • 20 May 2021 • Hao Luo, Weihua Chen, Xianzhe Xu, Jianyang Gu, Yuqi Zhang, Chong Liu, Yiqi Jiang, Shuting He, Fan Wang, Hao Li

We mainly focus on four points, i. e. training data, unsupervised domain-adaptive (UDA) training, post-processing, model ensembling in this challenge.

Re-Ranking Retrieval +1

115

Paper
Code

City-Scale Multi-Camera Vehicle Tracking Guided by Crossroad Zones

1 code implementation • 14 May 2021 • Chong Liu, Yuqi Zhang, Hao Luo, Jiasheng Tang, Weihua Chen, Xianzhe Xu, Fan Wang, Hao Li, Yi-Dong Shen

Multi-Target Multi-Camera Tracking has a wide range of applications and is the basis for many advanced inferences and predictions.

Clustering Vehicle Re-Identification

120

Paper
Code

Accelerated differential inclusion for convex optimization

no code implementations • 11 Mar 2021 • Hao Luo

This paper introduces a second-order differential inclusion for unconstrained convex optimization.

Optimization and Control 37M15, 34E10, 90C25

Paper
Add Code

A primal-dual flow for affine constrained convex optimization

no code implementations • 11 Mar 2021 • Hao Luo

We introduce a novel primal-dual flow for affine constrained convex optimization problems.

Optimization and Control 37M99, 37N40, 65K05, 90C25

Paper
Add Code

TransReID: Transformer-based Object Re-Identification

4 code implementations • ICCV 2021 • Shuting He, Hao Luo, Pichao Wang, Fan Wang, Hao Li, Wei Jiang

Extracting robust feature representation is one of the key challenges in object re-identification (ReID).

Ranked #1 on Person Re-Identification on Market-1501-C

Object Person Re-Identification +1

750

Paper
Code

1st Place Solution to VisDA-2020: Bias Elimination for Domain Adaptive Pedestrian Re-identification

1 code implementation • 25 Dec 2020 • Jianyang Gu, Hao Luo, Weihua Chen, Yiqi Jiang, Yuqi Zhang, Shuting He, Fan Wang, Hao Li, Wei Jiang

Considering the large gap between the source domain and target domain, we focused on solving two biases that influenced the performance on domain adaptive pedestrian Re-ID and proposed a two-stage training procedure.

Domain Adaptation Pseudo Label

Paper
Code

Counterfactual-based minority oversampling for imbalanced classification

no code implementations • 21 Aug 2020 • Hao Luo, Li Liu

A key challenge of oversampling in imbalanced classification is that the generation of new minority samples often neglects the usage of majority classes, resulting in most new minority sampling spreading the whole minority space.

Classification counterfactual +2

Paper
Add Code

Structure-Aware Network for Lane Marker Extraction with Dynamic Vision Sensor

no code implementations • 14 Aug 2020 • Wensheng Cheng, Hao Luo, Wen Yang, Lei Yu, Wei Li

We then propose a structure-aware network for lane marker extraction in DVS images.

Autonomous Driving Semantic Segmentation

Paper
Add Code

Multi-Domain Learning and Identity Mining for Vehicle Re-Identification

2 code implementations • 22 Apr 2020 • Shuting He, Hao Luo, Weihua Chen, Miao Zhang, Yuqi Zhang, Fan Wang, Hao Li, Wei Jiang

Our solution is based on a strong baseline with bag of tricks (BoT-BS) proposed in person ReID.

Clustering Re-Ranking +1

2,200

Paper
Code

Cross-Spectrum Dual-Subspace Pairing for RGB-infrared Cross-Modality Person Re-Identification

no code implementations • 29 Feb 2020 • Xing Fan, Hao Luo, Chi Zhang, Wei Jiang

Another challenge of RGB-infrared ReID is that the intra-person (images from the same person) discrepancy is often larger than the inter-person (images from different persons) discrepancy, so a dual-subspace pairing strategy is proposed to alleviate this problem.

Cross-Modality Person Re-identification Image Generation +1

Paper
Add Code

Unsupervised Representation Disentanglement using Cross Domain Features and Adversarial Learning in Variational Autoencoder based Voice Conversion

1 code implementation • 22 Jan 2020 • Wen-Chin Huang, Hao Luo, Hsin-Te Hwang, Chen-Chou Lo, Yu-Huai Peng, Yu Tsao, Hsin-Min Wang

In this paper, we extend the CDVAE-VC framework by incorporating the concept of adversarial learning, in order to further increase the degree of disentanglement, thereby improving the quality and similarity of converted speech.

Disentanglement Voice Conversion

Paper
Code

Stripe-based and Attribute-aware Network: A Two-Branch Deep Model for Vehicle Re-identification

no code implementations • 12 Oct 2019 • Jingjing Qian, Wei Jiang, Hao Luo, Hongyan Yu

Vehicle re-identification (Re-ID) has been attracting increasing interest in the field of computer vision due to the growing utilization of surveillance cameras in public security.

Attribute Vehicle Re-Identification

Paper
Add Code

Object Detection in Video with Spatial-temporal Context Aggregation

no code implementations • 11 Jul 2019 • Hao Luo, Lichao Huang, Han Shen, Yuan Li, Chang Huang, Xinggang Wang

Without any bells and whistles, our method obtains 80. 3\% mAP on the ImageNet VID dataset, which is superior over the previous state-of-the-arts.

Object object-detection +1

Paper
Add Code

A Strong Baseline and Batch Normalization Neck for Deep Person Re-identification

3 code implementations • 19 Jun 2019 • Hao Luo, Wei Jiang, Youzhi Gu, Fuxu Liu, Xingyu Liao, Shenqi Lai, Jianyang Gu

The present study collects and evaluates these effective training tricks in person ReID.

Ranked #44 on Person Re-Identification on DukeMTMC-reID

Person Re-Identification

2,200

Paper
Code

Bag of Tricks and A Strong Baseline for Deep Person Re-identification

9 code implementations • 17 Mar 2019 • Hao Luo, Youzhi Gu, Xingyu Liao, Shenqi Lai, Wei Jiang

In the literature, some effective training tricks are briefly appeared in several papers or source codes.

Ranked #2 on Person Re-Identification on UAV-Human

Person Re-Identification

5,248

Paper
Code

STNReID : Deep Convolutional Networks with Pairwise Spatial Transformer Networks for Partial Person Re-identification

no code implementations • 17 Mar 2019 • Hao Luo, Xing Fan, Chi Zhang, Wei Jiang

Competition (or confrontation) is observed between the STN module and the ReID module, and two-stage training is applied to acquire a strong STNReID for partial ReID.

Person Re-Identification

Paper
Add Code

A Novel Self-Intersection Penalty Term for Statistical Body Shape Models and Its Applications in 3D Pose Estimation

no code implementations • 24 Jan 2019 • Zaiqiang Wu, Wei Jiang, Hao Luo, Lin Cheng

To calculate the partial derivatives with respect to the coordinates of the vertices, we employed detection rays to divide vertices of statistical body shape models into different groups depending on whether the vertex is in the region of self-intersection.

3D Pose Estimation 3D Reconstruction

Paper
Add Code

Detect or Track: Towards Cost-Effective Video Object Detection/Tracking

no code implementations • 13 Nov 2018 • Hao Luo, Wenxuan Xie, Xinggang Wang, Wen-Jun Zeng

Trackers are in general more efficient than detectors but bear the risk of drifting.

Object object-detection +1

Paper
Add Code

SCPNet: Spatial-Channel Parallelism Network for Joint Holistic and Partial Person Re-Identification

no code implementations • 16 Oct 2018 • Xing Fan, Hao Luo, Xuan Zhang, Lingxiao He, Chi Zhang, Wei Jiang

Holistic person re-identification (ReID) has received extensive study in the past few years and achieves impressive progress.

Person Re-Identification

Paper
Add Code

SphereReID: Deep Hypersphere Manifold Embedding for Person Re-Identification

2 code implementations • 2 Jul 2018 • Xing Fan, Wei Jiang, Hao Luo, Mengjuan Fei

In this paper, we use a modified softmax function, termed Sphere Softmax, to solve the classification problem and learn a hypersphere manifold embedding simultaneously.

Person Re-Identification Re-Ranking

3,947

Paper
Code