Search Results for author: Sanchit Ahuja

Found 8 papers, 3 papers with code

HYPHEN: Hyperbolic Hawkes Attention For Text Streams

1 code implementation ACL 2022 Shivam Agarwal, Ramit Sawhney, Sanchit Ahuja, Ritesh Soun, Sudheer Chava

Analyzing the temporal sequence of texts from sources such as social media, news, and parliamentary debates is a challenging problem as it exhibits time-varying scale-free properties and fine-grained timing irregularities.

Stock Price Prediction

Contamination Report for Multilingual Benchmarks

no code implementations21 Oct 2024 Sanchit Ahuja, Varun Gumma, Sunayana Sitaram

Benchmark contamination refers to the presence of test datasets in Large Language Model (LLM) pre-training or post-training data.

Language Modeling Language Modelling +1

Scaling Laws for Multilingual Language Models

no code implementations15 Oct 2024 Yifei He, Alon Benhaim, Barun Patra, Praneetha Vaddamanu, Sanchit Ahuja, Parul Chopra, Vishrav Chaudhary, Han Zhao, Xia Song

We propose a novel scaling law for general-purpose decoder-only language models (LMs) trained on multilingual data, tackling the problem of balancing languages during multilingual pretraining.

Cross-Lingual Transfer

sPhinX: Sample Efficient Multilingual Instruction Fine-Tuning Through N-shot Guided Prompting

no code implementations13 Jul 2024 Sanchit Ahuja, Kumar Tanmay, Hardik Hansrajbhai Chauhan, Barun Patra, Kriti Aggarwal, Luciano del Corro, Arindam Mitra, Tejas Indulal Dhamecha, Ahmed Awadallah, Monojit Choudhary, Vishrav Chaudhary, Sunayana Sitaram

In order to address this, we introduce a novel recipe for creating a multilingual synthetic instruction tuning dataset, sPhinX, which is created by selectively translating instruction response pairs from English into 50 languages.

Machine Translation Question Answering +1

DOSA: A Dataset of Social Artifacts from Different Indian Geographical Subcultures

no code implementations23 Feb 2024 Agrima Seth, Sanchit Ahuja, Kalika Bali, Sunayana Sitaram

Generative models are increasingly being used in various applications, such as text generation, commonsense reasoning, and question-answering.

Question Answering Text Generation

MEGAVERSE: Benchmarking Large Language Models Across Languages, Modalities, Models and Tasks

no code implementations13 Nov 2023 Sanchit Ahuja, Divyanshu Aggarwal, Varun Gumma, Ishaan Watts, Ashutosh Sathe, Millicent Ochieng, Rishav Hada, Prachi Jain, Maxamed Axmed, Kalika Bali, Sunayana Sitaram

We also perform a study on data contamination and find that several models are likely to be contaminated with multilingual evaluation benchmarks, necessitating approaches to detect and handle contamination while assessing the multilingual performance of LLMs.

Benchmarking

Cannot find the paper you are looking for? You can Submit a new open access paper.