Search Results for author: Hai Huang

Found 34 papers, 16 papers with code

Bridging Domain Generalization to Multimodal Domain Generalization via Unified Representations

no code implementations4 Jul 2025 Hai Huang, Yan Xia, Sashuai Zhou, Hanting Wang, Shulei Wang, Zhou Zhao

Domain Generalization (DG) aims to enhance model robustness in unseen or distributionally shifted target domains through training exclusively on source domains.

Disentanglement Domain Generalization

IRBridge: Solving Image Restoration Bridge with Pre-trained Generative Diffusion Models

1 code implementation30 May 2025 Hanting Wang, Tao Jin, Wang Lin, Shulei Wang, Hai Huang, Shengpeng Ji, Zhou Zhao

The main challenge is that standard generative models are typically designed for a diffusion process that starts from pure noise, while restoration tasks begin with a low-quality image, resulting in a mismatch in the state distributions between the two processes.

Image Restoration

Enhancing Multi-modal Models with Heterogeneous MoE Adapters for Fine-tuning

no code implementations26 Mar 2025 Sashuai Zhou, Hai Huang, Yan Xia

Multi-modal models excel in cross-modal tasks but are computationally expensive due to their billions of parameters.

Mixture-of-Experts parameter-efficient fine-tuning

Query-Based and Unnoticeable Graph Injection Attack from Neighborhood Perspective

no code implementations4 Feb 2025 Chang Liu, Hai Huang, Yujie Xing, Xingquan Zuo

Various attack methods have been proposed to explore the vulnerabilities of GNNs, ranging from Graph Modification Attacks (GMA) to the more practical and flexible Graph Injection Attacks (GIA).

Compositional Concept-Based Neuron-Level Interpretability for Deep Reinforcement Learning

no code implementations2 Feb 2025 Zeyu Jiang, Hai Huang, Xingquan Zuo

In this work, we propose a novel concept-based interpretability method that provides fine-grained explanations of DRL models at the neuron level.

continuous-control Continuous Control +3

Semantic Residual for Multimodal Unified Discrete Representation

no code implementations26 Dec 2024 Hai Huang, Shulei Wang, Yan Xia

Recent research in the domain of multimodal unified representations predominantly employs codebook as representation forms, utilizing Vector Quantization(VQ) for quantization, yet there has been insufficient exploration of other quantization representation forms.

Disentanglement Quantization +1

Adversarial Score identity Distillation: Rapidly Surpassing the Teacher in One Step

2 code implementations19 Oct 2024 Mingyuan Zhou, Huangjie Zheng, Yi Gu, Zhendong Wang, Hai Huang

SiDA utilizes the encoder from the generator's score network as a discriminator, allowing it to distinguish between real images and those generated by SiD.

Conditional Image Generation GPU +1

ALLoRA: Adaptive Learning Rate Mitigates LoRA Fatal Flaws

no code implementations13 Oct 2024 Hai Huang, Randall Balestriero

We identify three core limitations to LoRA for finetuning--a setting that employs limited amount of data and training steps.

Large Language Model

ACE: A Generative Cross-Modal Retrieval Framework with Coarse-To-Fine Semantic Modeling

no code implementations25 Jun 2024 Minghui Fang, Shengpeng Ji, Jialong Zuo, Hai Huang, Yan Xia, Jieming Zhu, Xize Cheng, Xiaoda Yang, Wenrui Liu, Gang Wang, Zhenhua Dong, Zhou Zhao

Generative retrieval, which has demonstrated effectiveness in text-to-text retrieval, utilizes a sequence-to-sequence model to directly generate candidate identifiers based on natural language queries.

Cross-Modal Retrieval Natural Language Queries +2

ADBA:Approximation Decision Boundary Approach for Black-Box Adversarial Attacks

1 code implementation7 Jun 2024 Feiyang Wang, Xingquan Zuo, Hai Huang, Gang Chen

Many machine learning models are susceptible to adversarial attacks, with decision-based black-box attacks representing the most critical threat in real-world applications.

Long and Short Guidance in Score identity Distillation for One-Step Text-to-Image Generation

2 code implementations3 Jun 2024 Mingyuan Zhou, Zhendong Wang, Huangjie Zheng, Hai Huang

Specifically, its data-free distillation of Stable Diffusion 1. 5 achieves a record low FID of 8. 15 on the COCO-2014 validation set, with a CLIP score of 0. 304 at an LSG scale of 1. 5, and an FID of 9. 56 with a CLIP score of 0. 313 at an LSG scale of 2.

Text to Image Generation Text-to-Image Generation

ControlSpeech: Towards Simultaneous and Independent Zero-shot Speaker Cloning and Zero-shot Language Style Control

1 code implementation3 Jun 2024 Shengpeng Ji, Qian Chen, Wen Wang, Jialong Zuo, Minghui Fang, Ziyue Jiang, Hai Huang, Zehan Wang, Xize Cheng, Siqi Zheng, Zhou Zhao

In this paper, we present ControlSpeech, a text-to-speech (TTS) system capable of fully cloning the speaker's voice and enabling arbitrary control and adjustment of speaking style.

Speech Synthesis text-to-speech +1

Less is More: on the Over-Globalizing Problem in Graph Transformers

1 code implementation2 May 2024 Yujie Xing, Xiao Wang, Yibo Li, Hai Huang, Chuan Shi

Then we propose a novel Bi-Level Global Graph Transformer with Collaborative Training (CoBFormer), including the inter-cluster and intra-cluster Transformers, to prevent the over-globalizing problem while keeping the ability to extract valuable information from distant nodes.

GCC: Generative Calibration Clustering

no code implementations14 Apr 2024 Haifeng Xia, Hai Huang, Zhengming Ding

Deep clustering as an important branch of unsupervised representation learning focuses on embedding semantically similar samples into the identical feature space.

Clustering Contrastive Learning +4

Score identity Distillation: Exponentially Fast Distillation of Pretrained Diffusion Models for One-Step Generation

2 code implementations5 Apr 2024 Mingyuan Zhou, Huangjie Zheng, Zhendong Wang, Mingzhang Yin, Hai Huang

This achievement not only redefines the benchmarks for efficiency and effectiveness in diffusion distillation but also in the broader field of diffusion-based generation.

Image Generation

Relative Preference Optimization: Enhancing LLM Alignment through Contrasting Responses across Identical and Diverse Prompts

1 code implementation12 Feb 2024 Yueqin Yin, Zhendong Wang, Yi Gu, Hai Huang, Weizhu Chen, Mingyuan Zhou

In the field of large language models (LLMs), aligning models with the diverse preferences of users is a critical challenge.

Rethinking Evaluation Metrics of Open-Vocabulary Segmentaion

1 code implementation6 Nov 2023 Hao Zhou, Tiancheng Shen, Xu Yang, Hai Huang, Xiangtai Li, Lu Qi, Ming-Hsuan Yang

We benchmarked the proposed evaluation metrics on 12 open-vocabulary methods of three segmentation tasks.

Segmentation

Prompt Backdoors in Visual Prompt Learning

no code implementations11 Oct 2023 Hai Huang, Zhengyu Zhao, Michael Backes, Yun Shen, Yang Zhang

Specifically, the VPPTaaS provider optimizes a visual prompt given downstream data, and downstream users can use this prompt together with the large pre-trained model for prediction.

Backdoor Attack Prompt Learning

Composite Backdoor Attacks Against Large Language Models

1 code implementation11 Oct 2023 Hai Huang, Zhengyu Zhao, Michael Backes, Yun Shen, Yang Zhang

Such a Composite Backdoor Attack (CBA) is shown to be stealthier than implanting the same multiple trigger keys in only a single component.

Backdoor Attack

Parallel Computing Based Solution for Reliability-Constrained Distribution Network Planning

no code implementations9 Mar 2023 Yaqi Sun, Wenchuan Wu, Yi Lin, Hai Huang, Hao Chen

The main goal of distribution network (DN) expansion planning is essentially to achieve minimal investment constrained with specified reliability requirements.

On the Privacy Risks of Cell-Based NAS Architectures

1 code implementation4 Sep 2022 Hai Huang, Zhikun Zhang, Yun Shen, Michael Backes, Qi Li, Yang Zhang

Existing studies on neural architecture search (NAS) mainly focus on efficiently and effectively searching for network architectures with better performance.

Neural Architecture Search

Sliding Sequential CVAE with Time Variant Socially-aware Rethinking for Trajectory Prediction

no code implementations28 Oct 2021 Hao Zhou, Dongchun Ren, Xu Yang, Mingyu Fan, Hai Huang

First, with the continuation of time, the prediction error at each time step increases significantly, causing the final displacement error to be impossible to ignore.

Autonomous Driving Pedestrian Trajectory Prediction +4

Data Poisoning Attacks to Deep Learning Based Recommender Systems

no code implementations7 Jan 2021 Hai Huang, Jiaming Mu, Neil Zhenqiang Gong, Qi Li, Bin Liu, Mingwei Xu

Specifically, we formulate our attack as an optimization problem, such that the injected ratings would maximize the number of normal users to whom the target items are recommended.

Data Poisoning Deep Learning +1

Assimilating Soil Moisture Retrieved from Sentinel-1 and Sentinel-2 Data into WOFOST Model to Improve Winter Wheat Yield Estimation

no code implementations Remote Sensing 2019 Wen Zhuo, Jianxi Huang, Li Li, Xiaodong Zhang, Hongyuan Ma, Xinran Gao, Hai Huang, Baodong Xu, Xiangming Xiao

The aim of this study is to improve the accuracy for winter wheat yield estimation by assimilating time series soil moisture images, which are retrieved by a water cloud model using SAR and optical data as input, into the crop model.

A GPU-accelerated package for simulation of flow in nanoporous source rocks with many-body dissipative particle dynamics

2 code implementations25 Mar 2019 Yidong Xia, Ansel Blumers, Zhen Li, Lixiang Luo, Yu-Hang Tang, Joshua Kane, Hai Huang, Matthew Andrew, Milind Deo, Jan Goral

Lastly, we demonstrate, through a flow simulation in realistic shale pores, that the CPU counterpart requires 840 Power9 cores to rival the performance delivered by our package with four V100 GPUs on ORNL's Summit architecture.

Computational Physics

Cannot find the paper you are looking for? You can Submit a new open access paper.