no code implementations • 4 Jul 2025 • Hai Huang, Yan Xia, Sashuai Zhou, Hanting Wang, Shulei Wang, Zhou Zhao
Domain Generalization (DG) aims to enhance model robustness in unseen or distributionally shifted target domains through training exclusively on source domains.
1 code implementation • 13 Jun 2025 • Pradyut Sekhsaria, Marcel Mateos Salles, Hai Huang, Randall Balestriero
Second, for light SSTI, the reliance on spurious tokens is proportional to the LoRA rank.
1 code implementation • 30 May 2025 • Hanting Wang, Tao Jin, Wang Lin, Shulei Wang, Hai Huang, Shengpeng Ji, Zhou Zhao
The main challenge is that standard generative models are typically designed for a diffusion process that starts from pure noise, while restoration tasks begin with a low-quality image, resulting in a mismatch in the state distributions between the two processes.
no code implementations • 16 Apr 2025 • Jialei Song, Xingquan Zuo, Feiyang Wang, Hai Huang, Tianle Zhang
The average computation time of RDI is only 1/30 of the evaluation method based on the PGD attack.
no code implementations • 26 Mar 2025 • Sashuai Zhou, Hai Huang, Yan Xia
Multi-modal models excel in cross-modal tasks but are computationally expensive due to their billions of parameters.
no code implementations • CVPR 2025 • Shulei Wang, Wang Lin, Hai Huang, Hanting Wang, Sihang Cai, WenKang Han, Tao Jin, Jingyuan Chen, Jiacheng Sun, Jieming Zhu, Zhou Zhao
We introduce a novel, training-free approach for enhancing alignment in Transformer-based Text-Guided Diffusion Models (TGDMs).
no code implementations • 4 Feb 2025 • Chang Liu, Hai Huang, Yujie Xing, Xingquan Zuo
Various attack methods have been proposed to explore the vulnerabilities of GNNs, ranging from Graph Modification Attacks (GMA) to the more practical and flexible Graph Injection Attacks (GIA).
no code implementations • 2 Feb 2025 • Zeyu Jiang, Hai Huang, Xingquan Zuo
In this work, we propose a novel concept-based interpretability method that provides fine-grained explanations of DRL models at the neuron level.
no code implementations • 26 Dec 2024 • Hai Huang, Shulei Wang, Yan Xia
Recent research in the domain of multimodal unified representations predominantly employs codebook as representation forms, utilizing Vector Quantization(VQ) for quantization, yet there has been insufficient exploration of other quantization representation forms.
1 code implementation • 21 Dec 2024 • Ziming Guo, Chao Ma, Yinggang Sun, Tiancheng Zhao, Guangyao Wang, Hai Huang
Recent advancements in large language models (LLMs) have significantly advanced text-to-SQL systems.
Ranked #1 on
MMSQL performance
on MMSQL
no code implementations • 5 Nov 2024 • Lu Wang-Nöth, Philipp Heiler, Hai Huang, Daniel Lichtenstern, Alexandra Reichenbach, Luis Flacke, Linus Maisch, Helmut Mayer
Our work addresses the need for effective data utilization in biological data collection, offering a systematic and dynamic quantitative approach.
2 code implementations • 19 Oct 2024 • Mingyuan Zhou, Huangjie Zheng, Yi Gu, Zhendong Wang, Hai Huang
SiDA utilizes the encoder from the generator's score network as a discriminator, allowing it to distinguish between real images and those generated by SiD.
Ranked #1 on
Image Generation
on FFHQ 64x64
no code implementations • 13 Oct 2024 • Hai Huang, Randall Balestriero
We identify three core limitations to LoRA for finetuning--a setting that employs limited amount of data and training steps.
no code implementations • Applied Energy 2024 • Haoyu Chen, Hai Huang, Yong Zheng, Bing Yang
To improve the granularity of load decomposition within integrated energy system (IES), a novel multiple load forecasting approach is proposed.
no code implementations • 25 Jun 2024 • Minghui Fang, Shengpeng Ji, Jialong Zuo, Hai Huang, Yan Xia, Jieming Zhu, Xize Cheng, Xiaoda Yang, Wenrui Liu, Gang Wang, Zhenhua Dong, Zhou Zhao
Generative retrieval, which has demonstrated effectiveness in text-to-text retrieval, utilizes a sequence-to-sequence model to directly generate candidate identifiers based on natural language queries.
1 code implementation • 7 Jun 2024 • Feiyang Wang, Xingquan Zuo, Hai Huang, Gang Chen
Many machine learning models are susceptible to adversarial attacks, with decision-based black-box attacks representing the most critical threat in real-world applications.
2 code implementations • 3 Jun 2024 • Mingyuan Zhou, Zhendong Wang, Huangjie Zheng, Hai Huang
Specifically, its data-free distillation of Stable Diffusion 1. 5 achieves a record low FID of 8. 15 on the COCO-2014 validation set, with a CLIP score of 0. 304 at an LSG scale of 1. 5, and an FID of 9. 56 with a CLIP score of 0. 313 at an LSG scale of 2.
1 code implementation • 3 Jun 2024 • Shengpeng Ji, Qian Chen, Wen Wang, Jialong Zuo, Minghui Fang, Ziyue Jiang, Hai Huang, Zehan Wang, Xize Cheng, Siqi Zheng, Zhou Zhao
In this paper, we present ControlSpeech, a text-to-speech (TTS) system capable of fully cloning the speaker's voice and enabling arbitrary control and adjustment of speaking style.
1 code implementation • 2 May 2024 • Yujie Xing, Xiao Wang, Yibo Li, Hai Huang, Chuan Shi
Then we propose a novel Bi-Level Global Graph Transformer with Collaborative Training (CoBFormer), including the inter-cluster and intra-cluster Transformers, to prevent the over-globalizing problem while keeping the ability to extract valuable information from distant nodes.
no code implementations • 14 Apr 2024 • Haifeng Xia, Hai Huang, Zhengming Ding
Deep clustering as an important branch of unsupervised representation learning focuses on embedding semantically similar samples into the identical feature space.
2 code implementations • 5 Apr 2024 • Mingyuan Zhou, Huangjie Zheng, Zhendong Wang, Mingzhang Yin, Hai Huang
This achievement not only redefines the benchmarks for efficiency and effectiveness in diffusion distillation but also in the broader field of diffusion-based generation.
Ranked #2 on
Image Generation
on AFHQ-v2 64x64
no code implementations • 8 Mar 2024 • Hai Huang, Yan Xia, Shengpeng Ji, Shulei Wang, Hanting Wang, Minghui Fang, Jieming Zhu, Zhenhua Dong, Sashuai Zhou, Zhou Zhao
To enhance the interpretability of multimodal unified representations, many studies have focused on discrete unified representations.
1 code implementation • 19 Feb 2024 • Shengpeng Ji, Minghui Fang, Jialong Zuo, Ziyue Jiang, Dingdong Wang, Hanting Wang, Hai Huang, Zhou Zhao
Furthermore, we also validate the efficiency of the Language-Codec on downstream speech language models.
1 code implementation • 12 Feb 2024 • Yueqin Yin, Zhendong Wang, Yi Gu, Hai Huang, Weizhu Chen, Mingyuan Zhou
In the field of large language models (LLMs), aligning models with the diverse preferences of users is a critical challenge.
1 code implementation • 6 Nov 2023 • Hao Zhou, Tiancheng Shen, Xu Yang, Hai Huang, Xiangtai Li, Lu Qi, Ming-Hsuan Yang
We benchmarked the proposed evaluation metrics on 12 open-vocabulary methods of three segmentation tasks.
1 code implementation • 24 Oct 2023 • Jay Zhangjie Wu, Xiuyu Li, Difei Gao, Zhen Dong, Jinbin Bai, Aishani Singh, Xiaoyu Xiang, Youzeng Li, Zuwei Huang, Yuanxi Sun, Rui He, Feng Hu, Junhua Hu, Hai Huang, Hanyu Zhu, Xu Cheng, Jie Tang, Mike Zheng Shou, Kurt Keutzer, Forrest Iandola
In this paper we present a retrospective on the competition and describe the winning method.
no code implementations • 11 Oct 2023 • Hai Huang, Zhengyu Zhao, Michael Backes, Yun Shen, Yang Zhang
Specifically, the VPPTaaS provider optimizes a visual prompt given downstream data, and downstream users can use this prompt together with the large pre-trained model for prediction.
1 code implementation • 11 Oct 2023 • Hai Huang, Zhengyu Zhao, Michael Backes, Yun Shen, Yang Zhang
Such a Composite Backdoor Attack (CBA) is shown to be stealthier than implanting the same multiple trigger keys in only a single component.
no code implementations • 9 Mar 2023 • Yaqi Sun, Wenchuan Wu, Yi Lin, Hai Huang, Hao Chen
The main goal of distribution network (DN) expansion planning is essentially to achieve minimal investment constrained with specified reliability requirements.
1 code implementation • 4 Sep 2022 • Hai Huang, Zhikun Zhang, Yun Shen, Michael Backes, Qi Li, Yang Zhang
Existing studies on neural architecture search (NAS) mainly focus on efficiently and effectively searching for network architectures with better performance.
no code implementations • 28 Oct 2021 • Hao Zhou, Dongchun Ren, Xu Yang, Mingyu Fan, Hai Huang
First, with the continuation of time, the prediction error at each time step increases significantly, causing the final displacement error to be impossible to ignore.
no code implementations • 7 Jan 2021 • Hai Huang, Jiaming Mu, Neil Zhenqiang Gong, Qi Li, Bin Liu, Mingwei Xu
Specifically, we formulate our attack as an optimization problem, such that the injected ratings would maximize the number of normal users to whom the target items are recommended.
no code implementations • Remote Sensing 2019 • Wen Zhuo, Jianxi Huang, Li Li, Xiaodong Zhang, Hongyuan Ma, Xinran Gao, Hai Huang, Baodong Xu, Xiangming Xiao
The aim of this study is to improve the accuracy for winter wheat yield estimation by assimilating time series soil moisture images, which are retrieved by a water cloud model using SAR and optical data as input, into the crop model.
2 code implementations • 25 Mar 2019 • Yidong Xia, Ansel Blumers, Zhen Li, Lixiang Luo, Yu-Hang Tang, Joshua Kane, Hai Huang, Matthew Andrew, Milind Deo, Jan Goral
Lastly, we demonstrate, through a flow simulation in realistic shale pores, that the CPU counterpart requires 840 Power9 cores to rival the performance delivered by our package with four V100 GPUs on ORNL's Summit architecture.
Computational Physics