Search Results for author: Anirudh Ajith

Found 5 papers, 4 papers with code

LitSearch: A Retrieval Benchmark for Scientific Literature Search

1 code implementation10 Jul 2024 Anirudh Ajith, Mengzhou Xia, Alexis Chevalier, Tanya Goyal, Danqi Chen, Tianyu Gao

LitSearch is constructed using a combination of (1) questions generated by GPT-4 based on paragraphs containing inline citations from research papers and (2) questions manually written by authors about their recently published papers.

Reranking Retrieval

Downstream Trade-offs of a Family of Text Watermarks

1 code implementation16 Nov 2023 Anirudh Ajith, Sameer Singh, Danish Pruthi

In this work, we evaluate the performance of LLMs watermarked using three different strategies over a diverse suite of tasks including those cast as k-class classification (CLS), multiple choice question answering (MCQ), short-form generation (e. g., open-ended question answering) and long-form generation (e. g., translation) tasks.

Form Language Modelling +7

Detecting Pretraining Data from Large Language Models

1 code implementation25 Oct 2023 Weijia Shi, Anirudh Ajith, Mengzhou Xia, Yangsibo Huang, Daogao Liu, Terra Blevins, Danqi Chen, Luke Zettlemoyer

Min-K% Prob can be applied without any knowledge about the pretraining corpus or any additional training, departing from previous detection methods that require training a reference model on data that is similar to the pretraining data.

Machine Unlearning

InstructEval: Systematic Evaluation of Instruction Selection Methods

no code implementations1 Jul 2023 Anirudh Ajith, Chris Pan, Mengzhou Xia, Ameet Deshpande, Karthik Narasimhan

In-context learning (ICL) performs tasks by prompting a large language model (LLM) using an instruction and a small set of annotated examples called demonstrations.

Benchmarking In-Context Learning +2

Adapting Language Models to Compress Contexts

1 code implementation24 May 2023 Alexis Chevalier, Alexander Wettig, Anirudh Ajith, Danqi Chen

Transformer-based language models (LMs) are powerful and widely-applicable tools, but their usefulness is constrained by a finite context window and the expensive computational cost of processing long text documents.

In-Context Learning Language Modeling +4

Cannot find the paper you are looking for? You can Submit a new open access paper.