Search Results for author: Gopala Anumanchipalli

Found 8 papers, 2 papers with code

Is Bigger Edit Batch Size Always Better? -- An Empirical Study on Model Editing with Llama-3

no code implementations • 1 May 2024 • Junsang Yoon, Akshat Gupta, Gopala Anumanchipalli

This study presents a targeted model editing analysis focused on the latest large language model, Llama-3.

Paper
Add Code

A Unified Framework for Model Editing

2 code implementations • 21 Mar 2024 • Akshat Gupta, Dev Sajnani, Gopala Anumanchipalli

We introduce a unifying framework that brings two leading "locate-and-edit" model editing techniques -- ROME and MEMIT -- under a single conceptual umbrella, optimizing for the same goal, which we call the preservation-memorization objective.

Memorization Model Editing

Paper
Code

Rebuilding ROME : Resolving Model Collapse during Sequential Model Editing

1 code implementation • 11 Mar 2024 • Akshat Gupta, Sidharth Baskaran, Gopala Anumanchipalli

With this paper, we provide a more stable implementation ROME, which we call r-ROME and show that model collapse is no longer observed when making large scale sequential edits with r-ROME, while further improving generalization and locality of model editing compared to the original implementation of ROME.

Model Editing

Paper
Code

Identifying Multiple Personalities in Large Language Models with External Evaluation

no code implementations • 22 Feb 2024 • Xiaoyang Song, Yuta Adachi, Jessie Feng, Mouwei Lin, Linhao Yu, Frank Li, Akshat Gupta, Gopala Anumanchipalli, Simerjot Kaur

In this paper, we investigate LLM personalities using an alternate personality measurement method, which we refer to as the external evaluation method, where instead of prompting LLMs with multiple-choice questions in the Likert scale, we evaluate LLMs' personalities by analyzing their responses toward open-ended situational questions using an external machine learning model.

Multiple-choice

Paper
Add Code

Towards Hierarchical Spoken Language Dysfluency Modeling

no code implementations • 18 Jan 2024 • Jiachen Lian, Gopala Anumanchipalli

Speech disfluency modeling is the bottleneck for both speech therapy and language learning.

Paper
Add Code

Model Editing at Scale leads to Gradual and Catastrophic Forgetting

no code implementations • 15 Jan 2024 • Akshat Gupta, Anurag Rao, Gopala Anumanchipalli

With this in mind, we evaluate the current model editing methods at scale, focusing on two state of the art methods: ROME and MEMIT.

Model Editing Specificity

Paper
Add Code

Towards an Interpretable Representation of Speaker Identity via Perceptual Voice Qualities

no code implementations • 4 Oct 2023 • Robin Netzorg, Bohan Yu, Andrea Guzman, Peter Wu, Luna McNulty, Gopala Anumanchipalli

Unlike other data modalities such as text and vision, speech does not lend itself to easy interpretation.

Sentence

Paper
Add Code

Self-Assessment Tests are Unreliable Measures of LLM Personality

no code implementations • 15 Sep 2023 • Akshat Gupta, Xiaoyang Song, Gopala Anumanchipalli

These simple tests, done on ChatGPT and three Llama2 models of different sizes, show that self-assessment personality tests created for humans are unreliable measures of personality in LLMs.

Multiple-choice

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.