Search Results for author: Leyang Hu

Found 5 papers, 5 papers with code

Curvature Tuning: Provable Training-free Model Steering From a Single Parameter

1 code implementation11 Feb 2025 Leyang Hu, Randall Balestriero

Additionally, we apply CT to ReLU-based Swin-T/S, improving their generalization on nine downstream datasets by 2. 43\%/3. 33\%.

Revolve: Optimizing AI Systems by Tracking Response Evolution in Textual Optimization

1 code implementation4 Dec 2024 Peiyan Zhang, Haibo Jin, Leyang Hu, Xinnuo Li, Liying Kang, Man Luo, Yangqiu Song, Haohan Wang

However, relying solely on such feedback can be limited when the adjustments made in response to this feedback are either too small or fluctuate irregularly, potentially slowing down or even stalling the optimization process.

Prompt Engineering

DROJ: A Prompt-Driven Attack against Large Language Models

1 code implementation14 Nov 2024 Leyang Hu, Boran Wang

Here, we introduce a novel approach, Directed Rrepresentation Optimization Jailbreak (DROJ), which optimizes jailbreak prompts at the embedding level to shift the hidden representations of harmful queries towards directions that are more likely to elicit affirmative responses from the model.

JailbreakZoo: Survey, Landscapes, and Horizons in Jailbreaking Large Language and Vision-Language Models

1 code implementation26 Jun 2024 Haibo Jin, Leyang Hu, Xinuo Li, Peiyan Zhang, Chonghan Chen, Jun Zhuang, Haohan Wang

The rapid evolution of artificial intelligence (AI) through developments in Large Language Models (LLMs) and Vision-Language Models (VLMs) has brought significant advancements across various technological domains.

LLM Jailbreak Survey

Robustar: Interactive Toolbox Supporting Precise Data Annotation for Robust Vision Learning

1 code implementation18 Jul 2022 Chonghan Chen, Haohan Wang, Leyang Hu, Yuhao Zhang, Shuguang Lyu, Jingcheng Wu, Xinnuo Li, Linjing Sun, Eric P. Xing

We introduce the initial release of our software Robustar, which aims to improve the robustness of vision classification machine learning models through a data-driven perspective.

BIG-bench Machine Learning image-classification +1

Cannot find the paper you are looking for? You can Submit a new open access paper.