Search Results for author: Hao Luo

Found 53 papers, 26 papers with code

Uncovering the Text Embedding in Text-to-Image Diffusion Models

no code implementations1 Apr 2024 Hu Yu, Hao Luo, Fan Wang, Feng Zhao

The correspondence between input text and the generated image exhibits opacity, wherein minor textual modifications can induce substantial deviations in the generated image.

Text Data-Centric Image Captioning with Interactive Prompts

no code implementations28 Mar 2024 Yiyu Wang, Hao Luo, Jungang Xu, Yingfei Sun, Fan Wang

Among them, the mainstream solution is to project image embeddings into the text embedding space with the assistance of consistent representations between image-text pairs from the CLIP model.

Image Captioning

Enhancing Hyperspectral Images via Diffusion Model and Group-Autoencoder Super-resolution Network

1 code implementation27 Feb 2024 Zhaoyang Wang, Dongyang Li, Mingyang Zhang, Hao Luo, Maoguo Gong

Existing hyperspectral image (HSI) super-resolution (SR) methods struggle to effectively capture the complex spectral-spatial relationships and low-level details, while diffusion models represent a promising generative model known for their exceptional performance in modeling complex relations and learning high and low-level visual features.

Super-Resolution

Accelerating Parallel Sampling of Diffusion Models

no code implementations15 Feb 2024 Zhiwei Tang, Jiasheng Tang, Hao Luo, Fan Wang, Tsung-Hui Chang

Our experiments demonstrate that ParaTAA can decrease the inference steps required by common sequential sampling algorithms such as DDIM and DDPM by a factor of 4~14 times.

Image Generation

Integrated Imaging and Communication with Reconfigurable Intelligent Surfaces

1 code implementation29 Jan 2024 Hao Luo, Ahmed Alkhateeb

In particular, using the RIS as a wireless imaging device, our system constructs the scene depth map of the environment, including the mobile user.

ISAC with Backscattering RFID Tags: Joint Beamforming Design

1 code implementation18 Jan 2024 Hao Luo, Umut Demirhan, Ahmed Alkhateeb

Then, we study a joint beamforming design problem with the goal of minimizing the total transmit power while satisfying the tag detection and communication requirements.

TAG

CL2CM: Improving Cross-Lingual Cross-Modal Retrieval via Cross-Lingual Knowledge Transfer

no code implementations14 Dec 2023 Yabing Wang, Fan Wang, Jianfeng Dong, Hao Luo

Cross-lingual cross-modal retrieval has garnered increasing attention recently, which aims to achieve the alignment between vision and target language (V-T) without using any annotated V-T data pairs.

Cross-Lingual Transfer Cross-Modal Retrieval +4

VGSG: Vision-Guided Semantic-Group Network for Text-based Person Search

no code implementations13 Nov 2023 Shuting He, Hao Luo, Wei Jiang, Xudong Jiang, Henghui Ding

With the help of relational knowledge transfer, VGKT is capable of aligning semantic-group textual features with corresponding visual features without external tools and complex pairwise interaction.

Ranked #6 on Text based Person Retrieval on CUHK-PEDES (using extra training data)

Person Search Text based Person Retrieval +2

Deep Learning Enables Large Depth-of-Field Images for Sub-Diffraction-Limit Scanning Superlens Microscopy

no code implementations27 Oct 2023 Hui Sun, Hao Luo, Feifei Wang, Qingjiu Chen, Meng Chen, Xiaoduo Wang, Haibo Yu, Guanglie Zhang, Lianqing Liu, JianPing Wang, Dapeng Wu, Wen Jung Li

Scanning electron microscopy (SEM) is indispensable in diverse applications ranging from microelectronics to food processing because it provides large depth-of-field images with a resolution beyond the optical diffraction limit.

Defect Detection Image-to-Image Translation +1

Variational Quantum Linear Solver-based Combination Rules in Dempster–Shafer Theory

1 code implementation journal 2023 Hao Luo, Qianli Zhou, Zhen Li, Yong Deng

Dempster–Shafer Theory (DST), as a method of handling uncertain information, is widely used in decisionmaking and information fusion.

SCT: A Simple Baseline for Parameter-Efficient Fine-Tuning via Salient Channels

2 code implementations15 Sep 2023 Henry Hengyuan Zhao, Pichao Wang, Yuyang Zhao, Hao Luo, Fan Wang, Mike Zheng Shou

Recently, many parameter-efficient fine-tuning (PEFT) methods have been proposed, and their experiments demonstrate that tuning only 1% of extra parameters could surpass full fine-tuning in low-data resource scenarios.

Domain Generalization Few-Shot Learning

Region Generation and Assessment Network for Occluded Person Re-Identification

no code implementations7 Sep 2023 Shuting He, Weihua Chen, Kai Wang, Hao Luo, Fan Wang, Wei Jiang, Henghui Ding

Then, to measure the importance of each generated region, we introduce a Region Assessment Module (RAM) that assigns confidence scores to different regions and reduces the negative impact of the occlusion regions by lower scores.

Person Re-Identification

Revisiting Vision Transformer from the View of Path Ensemble

no code implementations ICCV 2023 Shuning Chang, Pichao Wang, Hao Luo, Fan Wang, Mike Zheng Shou

Therefore, we propose the path pruning and EnsembleScale skills for improvement, which cut out the underperforming paths and re-weight the ensemble components, respectively, to optimize the path combination and make the short paths focus on providing high-quality representation for subsequent paths.

Millimeter Wave V2V Beam Tracking using Radar: Algorithms and Real-World Demonstration

1 code implementation3 Aug 2023 Hao Luo, Umut Demirhan, Ahmed Alkhateeb

Utilizing radar sensing for assisting communication has attracted increasing interest thanks to its potential in dynamic environments.

Beyond Appearance: a Semantic Controllable Self-Supervised Learning Framework for Human-Centric Visual Tasks

4 code implementations CVPR 2023 Weihua Chen, Xianzhe Xu, Jian Jia, Hao Luo, Yaohua Wang, Fan Wang, Rong Jin, Xiuyu Sun

Unlike the existing self-supervised learning methods, prior knowledge from human images is utilized in SOLIDER to build pseudo semantic labels and import more semantic information into the learned representation.

Human Parsing Pedestrian Attribute Recognition +6

CLIP4MC: An RL-Friendly Vision-Language Model for Minecraft

1 code implementation19 Mar 2023 Ziluo Ding, Hao Luo, Ke Li, Junpeng Yue, Tiejun Huang, Zongqing Lu

One of the essential missions in the AI research community is to build an autonomous embodied agent that can attain high-level performance across a wide spectrum of tasks.

Contrastive Learning Language Modelling +1

Revisit Parameter-Efficient Transfer Learning: A Two-Stage Paradigm

no code implementations14 Mar 2023 Hengyuan Zhao, Hao Luo, Yuyang Zhao, Pichao Wang, Fan Wang, Mike Zheng Shou

In view of the practicality of PETL, previous works focus on tuning a small set of parameters for each downstream task in an end-to-end manner while rarely considering the task distribution shift issue between the pre-training task and the downstream task.

Transfer Learning Vocal Bursts Valence Prediction

MSINet: Twins Contrastive Search of Multi-Scale Interaction for Object ReID

1 code implementation CVPR 2023 Jianyang Gu, Kai Wang, Hao Luo, Chen Chen, Wei Jiang, Yuqiang Fang, Shanghang Zhang, Yang You, Jian Zhao

Neural Architecture Search (NAS) has been increasingly appealing to the society of object Re-Identification (ReID), for that task-specific architectures significantly improve the retrieval performance.

Image Classification Neural Architecture Search +3

Model-Based Decentralized Policy Optimization

no code implementations16 Feb 2023 Hao Luo, Jiechuan Jiang, Zongqing Lu

To help the policy improvement be stable and monotonic, we propose model-based decentralized policy optimization (MDPO), which incorporates a latent variable function to help construct the transition and reward function from an individual perspective.

A Survey on Self-supervised Learning: Algorithms, Applications, and Future Trends

1 code implementation13 Jan 2023 Jie Gui, Tuo Chen, Jing Zhang, Qiong Cao, Zhenan Sun, Hao Luo, DaCheng Tao

Deep supervised learning algorithms typically require a large volume of labeled data to achieve satisfactory performance.

Self-Supervised Learning

A Survey on Transformers in Reinforcement Learning

no code implementations8 Jan 2023 Wenzhe Li, Hao Luo, Zichuan Lin, Chongjie Zhang, Zongqing Lu, Deheng Ye

Transformer has been considered the dominating neural architecture in NLP and CV, mostly under supervised settings.

reinforcement-learning Reinforcement Learning (RL)

Good helper is around you: Attention-driven Masked Image Modeling

1 code implementation28 Nov 2022 Zhengqi Liu, Jie Gui, Hao Luo

Most previous works mask patches of the image randomly, which underutilizes the semantic information that is beneficial to visual representation learning.

Representation Learning Self-Supervised Learning

Reconfigurable Intelligent Surface Aided Wireless Sensing for Scene Depth Estimation

1 code implementation15 Nov 2022 Abdelrahman Taha, Hao Luo, Ahmed Alkhateeb

In this paper, we propose to employ RIS-aided wireless sensing systems for scene depth estimation.

Depth Estimation

VTC-LFC: Vision Transformer Compression with Low-Frequency Components

1 code implementation NIPS 2022 Zhenyu Wang, Hao Luo, Pichao Wang, Feng Ding, Fan Wang, Hao Li

Although Vision transformers (ViTs) have recently dominated many vision tasks, deploying ViT models on resource-limited devices remains a challenging problem.

MimCo: Masked Image Modeling Pre-training with Contrastive Teacher

no code implementations7 Sep 2022 Qiang Zhou, Chaohui Yu, Hao Luo, Zhibin Wang, Hao Li

Specifically, MimCo takes a pre-trained contrastive learning model as the teacher model and is pre-trained with two types of learning targets: patch-level and image-level reconstruction losses.

Contrastive Learning Self-Supervised Learning

Dynamic Gradient Reactivation for Backward Compatible Person Re-identification

no code implementations12 Jul 2022 Xiao Pan, Hao Luo, Weihua Chen, Fan Wang, Hao Li, Wei Jiang, Jianming Zhang, Jianyang Gu, Peike Li

To address this issue, we propose the Ranking-based Backward Compatible Learning (RBCL), which directly optimizes the ranking metric between new features and old features.

Person Re-Identification Retrieval

Self-Supervised Pre-Training for Transformer-Based Person Re-Identification

2 code implementations23 Nov 2021 Hao Luo, Pichao Wang, Yi Xu, Feng Ding, Yanxin Zhou, Fan Wang, Hao Li, Rong Jin

We first investigate self-supervised learning (SSL) methods with Vision Transformer (ViT) pretrained on unlabelled person images (the LUPerson dataset), and empirically find it significantly surpasses ImageNet supervised pre-training models on ReID tasks.

 Ranked #1 on Unsupervised Person Re-Identification on Market-1501 (using extra training data)

Self-Supervised Learning Unsupervised Domain Adaptation +1

Scaled ReLU Matters for Training Vision Transformers

no code implementations8 Sep 2021 Pichao Wang, Xue Wang, Hao Luo, Jingkai Zhou, Zhipeng Zhou, Fan Wang, Hao Li, Rong Jin

In this paper, we further investigate this problem and extend the above conclusion: only early convolutions do not help for stable training, but the scaled ReLU operation in the \textit{convolutional stem} (\textit{conv-stem}) matters.

An Empirical Study of Vehicle Re-Identification on the AI City Challenge

1 code implementation20 May 2021 Hao Luo, Weihua Chen, Xianzhe Xu, Jianyang Gu, Yuqi Zhang, Chong Liu, Yiqi Jiang, Shuting He, Fan Wang, Hao Li

We mainly focus on four points, i. e. training data, unsupervised domain-adaptive (UDA) training, post-processing, model ensembling in this challenge.

Re-Ranking Retrieval +1

City-Scale Multi-Camera Vehicle Tracking Guided by Crossroad Zones

1 code implementation14 May 2021 Chong Liu, Yuqi Zhang, Hao Luo, Jiasheng Tang, Weihua Chen, Xianzhe Xu, Fan Wang, Hao Li, Yi-Dong Shen

Multi-Target Multi-Camera Tracking has a wide range of applications and is the basis for many advanced inferences and predictions.

Clustering Vehicle Re-Identification

Accelerated differential inclusion for convex optimization

no code implementations11 Mar 2021 Hao Luo

This paper introduces a second-order differential inclusion for unconstrained convex optimization.

Optimization and Control 37M15, 34E10, 90C25

A primal-dual flow for affine constrained convex optimization

no code implementations11 Mar 2021 Hao Luo

We introduce a novel primal-dual flow for affine constrained convex optimization problems.

Optimization and Control 37M99, 37N40, 65K05, 90C25

1st Place Solution to VisDA-2020: Bias Elimination for Domain Adaptive Pedestrian Re-identification

1 code implementation25 Dec 2020 Jianyang Gu, Hao Luo, Weihua Chen, Yiqi Jiang, Yuqi Zhang, Shuting He, Fan Wang, Hao Li, Wei Jiang

Considering the large gap between the source domain and target domain, we focused on solving two biases that influenced the performance on domain adaptive pedestrian Re-ID and proposed a two-stage training procedure.

Domain Adaptation Pseudo Label

Counterfactual-based minority oversampling for imbalanced classification

no code implementations21 Aug 2020 Hao Luo, Li Liu

A key challenge of oversampling in imbalanced classification is that the generation of new minority samples often neglects the usage of majority classes, resulting in most new minority sampling spreading the whole minority space.

Classification counterfactual +2

Cross-Spectrum Dual-Subspace Pairing for RGB-infrared Cross-Modality Person Re-Identification

no code implementations29 Feb 2020 Xing Fan, Hao Luo, Chi Zhang, Wei Jiang

Another challenge of RGB-infrared ReID is that the intra-person (images from the same person) discrepancy is often larger than the inter-person (images from different persons) discrepancy, so a dual-subspace pairing strategy is proposed to alleviate this problem.

Cross-Modality Person Re-identification Image Generation +1

Unsupervised Representation Disentanglement using Cross Domain Features and Adversarial Learning in Variational Autoencoder based Voice Conversion

1 code implementation22 Jan 2020 Wen-Chin Huang, Hao Luo, Hsin-Te Hwang, Chen-Chou Lo, Yu-Huai Peng, Yu Tsao, Hsin-Min Wang

In this paper, we extend the CDVAE-VC framework by incorporating the concept of adversarial learning, in order to further increase the degree of disentanglement, thereby improving the quality and similarity of converted speech.

Disentanglement Voice Conversion

Stripe-based and Attribute-aware Network: A Two-Branch Deep Model for Vehicle Re-identification

no code implementations12 Oct 2019 Jingjing Qian, Wei Jiang, Hao Luo, Hongyan Yu

Vehicle re-identification (Re-ID) has been attracting increasing interest in the field of computer vision due to the growing utilization of surveillance cameras in public security.

Attribute Vehicle Re-Identification

Object Detection in Video with Spatial-temporal Context Aggregation

no code implementations11 Jul 2019 Hao Luo, Lichao Huang, Han Shen, Yuan Li, Chang Huang, Xinggang Wang

Without any bells and whistles, our method obtains 80. 3\% mAP on the ImageNet VID dataset, which is superior over the previous state-of-the-arts.

Object object-detection +1

Bag of Tricks and A Strong Baseline for Deep Person Re-identification

9 code implementations17 Mar 2019 Hao Luo, Youzhi Gu, Xingyu Liao, Shenqi Lai, Wei Jiang

In the literature, some effective training tricks are briefly appeared in several papers or source codes.

Person Re-Identification

STNReID : Deep Convolutional Networks with Pairwise Spatial Transformer Networks for Partial Person Re-identification

no code implementations17 Mar 2019 Hao Luo, Xing Fan, Chi Zhang, Wei Jiang

Competition (or confrontation) is observed between the STN module and the ReID module, and two-stage training is applied to acquire a strong STNReID for partial ReID.

Person Re-Identification

A Novel Self-Intersection Penalty Term for Statistical Body Shape Models and Its Applications in 3D Pose Estimation

no code implementations24 Jan 2019 Zaiqiang Wu, Wei Jiang, Hao Luo, Lin Cheng

To calculate the partial derivatives with respect to the coordinates of the vertices, we employed detection rays to divide vertices of statistical body shape models into different groups depending on whether the vertex is in the region of self-intersection.

3D Pose Estimation 3D Reconstruction

SCPNet: Spatial-Channel Parallelism Network for Joint Holistic and Partial Person Re-Identification

no code implementations16 Oct 2018 Xing Fan, Hao Luo, Xuan Zhang, Lingxiao He, Chi Zhang, Wei Jiang

Holistic person re-identification (ReID) has received extensive study in the past few years and achieves impressive progress.

Person Re-Identification

SphereReID: Deep Hypersphere Manifold Embedding for Person Re-Identification

2 code implementations2 Jul 2018 Xing Fan, Wei Jiang, Hao Luo, Mengjuan Fei

In this paper, we use a modified softmax function, termed Sphere Softmax, to solve the classification problem and learn a hypersphere manifold embedding simultaneously.

Person Re-Identification Re-Ranking

AlignedReID: Surpassing Human-Level Performance in Person Re-Identification

15 code implementations22 Nov 2017 Xuan Zhang, Hao Luo, Xing Fan, Weilai Xiang, Yixiao Sun, Qiqi Xiao, Wei Jiang, Chi Zhang, Jian Sun

In this paper, we propose a novel method called AlignedReID that extracts a global feature which is jointly learned with local features.

Person Re-Identification

Cannot find the paper you are looking for? You can Submit a new open access paper.