no code implementations • 4 Mar 2025 • Erfan Entezami, Ali Naseh
Given the growing adoption of RLHF and open-source RLHF frameworks, we investigate the trustworthiness of these systems and their potential impact on behavior of LLMs.
1 code implementation • 4 Feb 2025 • Abhinav Kumar, Jaechul Roh, Ali Naseh, Marzena Karpinska, Mohit Iyyer, Amir Houmansadr, Eugene Bagdasarian
We evaluated our attack across closed-(OpenAI o1, o1-mini, o3-mini) and open-(DeepSeek R1) weights reasoning models on the FreshQA and SQuAD datasets.
no code implementations • 1 Feb 2025 • Ali Naseh, Yuefeng Peng, Anshuman Suri, Harsh Chaudhari, Alina Oprea, Amir Houmansadr
Retrieval-Augmented Generation (RAG) enables Large Language Models (LLMs) to generate grounded responses by leveraging external knowledge databases without altering model parameters.
no code implementations • 20 Jan 2025 • Ali Naseh, Niloofar Mireshghallah
Recent work shows membership inference attacks (MIAs) on large language models (LLMs) produce inconclusive results, partly due to difficulties in creating non-member datasets without temporal shifts.
no code implementations • 21 Jun 2024 • Ali Naseh, Jaechul Roh, Eugene Bagdasaryan, Amir Houmansadr
Furthermore, we show how the current state-of-the-art generative models make this attack both cheap and feasible for any adversary, with costs ranging between $12-$18.
no code implementations • 21 Apr 2024 • Ali Naseh, Katherine Thai, Mohit Iyyer, Amir Houmansadr
With the digital imagery landscape rapidly evolving, image stocks and AI-generated image marketplaces have become central to visual media.
no code implementations • 7 Dec 2023 • Yuefeng Peng, Ali Naseh, Amir Houmansadr
A unique feature of DIFFENCE is that it works on input samples only, without modifying the training or inference phase of the target model.
no code implementations • 6 Dec 2023 • Ali Naseh, Jaechul Roh, Amir Houmansadr
Multimodal machine learning, especially text-to-image models like Stable Diffusion and DALL-E 3, has gained significance for transforming text into detailed images.
no code implementations • 6 Dec 2023 • Ali Naseh, Jaechul Roh, Amir Houmansadr
Diffusion-based models, such as the Stable Diffusion model, have revolutionized text-to-image synthesis with their ability to produce high-quality, high-resolution images.
1 code implementation • 8 Mar 2023 • Ali Naseh, Kalpesh Krishna, Mohit Iyyer, Amir Houmansadr
A key component of generating text from modern language models (LM) is the selection and tuning of decoding algorithms.
no code implementations • 5 Oct 2020 • Mohsen Alishahiha, Amin Faraji Astaneh, Ghadir Jafari, Ali Naseh, Behrad Taghavi
In this paper, we study a particular deformation of the Jackiw-Teitelboim gravity recently considered by Maxfield, Turiaci and independently by Witten.
High Energy Physics - Theory General Relativity and Quantum Cosmology
no code implementations • 22 Dec 2019 • S. Sedigheh Hashemi, Ghadir Jafari, Ali Naseh
In this paper, we examine the proposed first law of holographic complexity through studying different perturbations around various spacetime backgrounds.
High Energy Physics - Theory General Relativity and Quantum Cosmology