Search Results for author: Ali Naseh

Found 12 papers, 2 papers with code

LLM Misalignment via Adversarial RLHF Platforms

no code implementations4 Mar 2025 Erfan Entezami, Ali Naseh

Given the growing adoption of RLHF and open-source RLHF frameworks, we investigate the trustworthiness of these systems and their potential impact on behavior of LLMs.

OverThink: Slowdown Attacks on Reasoning LLMs

1 code implementation4 Feb 2025 Abhinav Kumar, Jaechul Roh, Ali Naseh, Marzena Karpinska, Mohit Iyyer, Amir Houmansadr, Eugene Bagdasarian

We evaluated our attack across closed-(OpenAI o1, o1-mini, o3-mini) and open-(DeepSeek R1) weights reasoning models on the FreshQA and SQuAD datasets.

RAG

Riddle Me This! Stealthy Membership Inference for Retrieval-Augmented Generation

no code implementations1 Feb 2025 Ali Naseh, Yuefeng Peng, Anshuman Suri, Harsh Chaudhari, Alina Oprea, Amir Houmansadr

Retrieval-Augmented Generation (RAG) enables Large Language Models (LLMs) to generate grounded responses by leveraging external knowledge databases without altering model parameters.

RAG Retrieval

Synthetic Data Can Mislead Evaluations: Membership Inference as Machine Text Detection

no code implementations20 Jan 2025 Ali Naseh, Niloofar Mireshghallah

Recent work shows membership inference attacks (MIAs) on large language models (LLMs) produce inconclusive results, partly due to difficulties in creating non-member datasets without temporal shifts.

Memorization Text Detection

Backdooring Bias into Text-to-Image Models

no code implementations21 Jun 2024 Ali Naseh, Jaechul Roh, Eugene Bagdasaryan, Amir Houmansadr

Furthermore, we show how the current state-of-the-art generative models make this attack both cheap and feasible for any adversary, with costs ranging between $12-$18.

Backdoor Attack Text-to-Image Generation

Iteratively Prompting Multimodal LLMs to Reproduce Natural and AI-Generated Images

no code implementations21 Apr 2024 Ali Naseh, Katherine Thai, Mohit Iyyer, Amir Houmansadr

With the digital imagery landscape rapidly evolving, image stocks and AI-generated image marketplaces have become central to visual media.

Descriptive

Diffence: Fencing Membership Privacy With Diffusion Models

no code implementations7 Dec 2023 Yuefeng Peng, Ali Naseh, Amir Houmansadr

A unique feature of DIFFENCE is that it works on input samples only, without modifying the training or inference phase of the target model.

Understanding (Un)Intended Memorization in Text-to-Image Generative Models

no code implementations6 Dec 2023 Ali Naseh, Jaechul Roh, Amir Houmansadr

Multimodal machine learning, especially text-to-image models like Stable Diffusion and DALL-E 3, has gained significance for transforming text into detailed images.

Image Generation Memorization

Memory Triggers: Unveiling Memorization in Text-To-Image Generative Models through Word-Level Duplication

no code implementations6 Dec 2023 Ali Naseh, Jaechul Roh, Amir Houmansadr

Diffusion-based models, such as the Stable Diffusion model, have revolutionized text-to-image synthesis with their ability to produce high-quality, high-resolution images.

Image Generation Memorization

Stealing the Decoding Algorithms of Language Models

1 code implementation8 Mar 2023 Ali Naseh, Kalpesh Krishna, Mohit Iyyer, Amir Houmansadr

A key component of generating text from modern language models (LM) is the selection and tuning of decoding algorithms.

Text Generation

On Free Energy for Deformed JT Gravity

no code implementations5 Oct 2020 Mohsen Alishahiha, Amin Faraji Astaneh, Ghadir Jafari, Ali Naseh, Behrad Taghavi

In this paper, we study a particular deformation of the Jackiw-Teitelboim gravity recently considered by Maxfield, Turiaci and independently by Witten.

High Energy Physics - Theory General Relativity and Quantum Cosmology

On the first law of holographic complexity

no code implementations22 Dec 2019 S. Sedigheh Hashemi, Ghadir Jafari, Ali Naseh

In this paper, we examine the proposed first law of holographic complexity through studying different perturbations around various spacetime backgrounds.

High Energy Physics - Theory General Relativity and Quantum Cosmology

Cannot find the paper you are looking for? You can Submit a new open access paper.