Search Results for author: Vinija Jain

Found 39 papers, 12 papers with code

Multilingual State Space Models for Structured Question Answering in Indic Languages

1 code implementation1 Feb 2025 Arpita Vats, Rahul Raja, Mrinal Mathur, Vinija Jain, Aman Chadha

The diversity and complexity of Indic languages present unique challenges for natural language processing (NLP) tasks, particularly in the domain of question answering (QA). To address these challenges, this paper explores the application of State Space Models (SSMs), to build efficient and contextually aware QA systems tailored for Indic languages.

Answer Generation Diversity +2

IndicMMLU-Pro: Benchmarking Indic Large Language Models on Multi-Task Language Understanding

no code implementations27 Jan 2025 Sankalp KJ, Ashutosh Kumar, Laxmaan Balaji, Nikunj Kotecha, Vinija Jain, Aman Chadha, Sreyoshi Bhaduri

Known by more than 1. 5 billion people in the Indian subcontinent, Indic languages present unique challenges and opportunities for natural language processing (NLP) research due to their rich cultural heritage, linguistic diversity, and complex structures.

Benchmarking Diversity +2

On the Feasibility of Vision-Language Models for Time-Series Classification

1 code implementation23 Dec 2024 Vinay Prithyani, Mohsin Mohammed, Richa Gadgil, Ricardo Buitrago, Vinija Jain, Aman Chadha

We develop a novel approach that incorporates graphical data representations as images in conjunction with numerical data.

Time Series Time Series Classification

SKETCH: Structured Knowledge Enhanced Text Comprehension for Holistic Retrieval

no code implementations19 Dec 2024 Aakash Mahalingam, Vinesh Kumar Gande, Aman Chadha, Vinija Jain, Divya Chaudhary

Notably, on the Italian Cuisine dataset, SKETCH achieved an answer relevancy of 0. 94 and a context precision of 0. 99, representing the highest performance across all evaluated metrics.

Knowledge Graphs RAG +3

Guiding Vision-Language Model Selection for Visual Question-Answering Across Tasks, Domains, and Knowledge Types

1 code implementation14 Sep 2024 Neelabh Sinha, Vinija Jain, Aman Chadha

Visual Question-Answering (VQA) has become key to user experience, particularly after improved generalization capabilities of Vision-Language Models (VLMs).

Language Modeling Language Modelling +3

Hierarchical Prompting Taxonomy: A Universal Evaluation Framework for Large Language Models Aligned with Human Cognitive Principles

1 code implementation18 Jun 2024 Devichand Budagam, Ashutosh Kumar, Mahsa Khoshnoodi, Sankalp KJ, Vinija Jain, Aman Chadha

It assesses the complexity of tasks with the Hierarchical Prompting Index (HPI), which demonstrates the cognitive competencies of LLMs across diverse datasets and offers insights into the cognitive demands that datasets place on different LLMs.

Arithmetic Reasoning Code Generation +10

Investigating Annotator Bias in Large Language Models for Hate Speech Detection

3 code implementations17 Jun 2024 Amit Das, Zheng Zhang, Najib Hasan, Souvika Sarkar, Fatemeh Jamshidi, Tathagata Bhattacharya, Mostafa Rahgouy, Nilanjana Raychawdhary, Dongji Feng, Vinija Jain, Aman Chadha, Mary Sandage, Lauramarie Pope, Gerry Dozier, Cheryl Seals

This paper serves as a crucial resource, guiding researchers and practitioners in harnessing the potential of LLMs for data annotation, thereby fostering advancements in this critical field.

Descriptive Hate Speech Detection

Are Small Language Models Ready to Compete with Large Language Models for Practical Applications?

1 code implementation17 Jun 2024 Neelabh Sinha, Vinija Jain, Aman Chadha

The rapid rise of Language Models (LMs) has expanded their use in several applications.

Decoding the Diversity: A Review of the Indic AI Research Landscape

no code implementations13 Jun 2024 Sankalp KJ, Vinija Jain, Sreyoshi Bhaduri, Tamoghna Roy, Aman Chadha

This work aims to serve as a valuable resource for researchers and practitioners working in the field of NLP, particularly those focused on Indic languages, and contributes to the development of more accurate and efficient LLM applications for these languages.

Benchmarking Diversity +2

How Culturally Aware are Vision-Language Models?

no code implementations24 May 2024 Olena Burda-Lassen, Aman Chadha, Shashank Goswami, Vinija Jain

Our research compares the performance of four popular vision-language models (GPT-4V, Gemini Pro Vision, LLaVA, and OpenFlamingo) in identifying culturally specific information in such images and creating accurate and culturally sensitive image captions.

Image Captioning

A Comprehensive Survey of Hallucination in Large Language, Image, Video and Audio Foundation Models

no code implementations15 May 2024 Pranab Sahoo, Prabhash Meharia, Akash Ghosh, Sriparna Saha, Vinija Jain, Aman Chadha

The rapid advancement of foundation models (FMs) across language, image, audio, and video domains has shown remarkable capabilities in diverse tasks.

Hallucination

A Comprehensive Survey of Accelerated Generation Techniques in Large Language Models

1 code implementation15 May 2024 Mahsa Khoshnoodi, Vinija Jain, Mingye Gao, Malavika Srikanth, Aman Chadha

Despite the crucial importance of accelerating text generation in large language models (LLMs) for efficiently producing content, the sequential nature of this process often leads to high inference latency, posing challenges for real-time applications.

Survey Text Generation

Parameter Efficient Fine Tuning: A Comprehensive Analysis Across Applications

no code implementations21 Apr 2024 Charith Chandra Sai Balne, Sreyoshi Bhaduri, Tamoghna Roy, Vinija Jain, Aman Chadha

The rise of deep learning has marked significant progress in fields such as computer vision, natural language processing, and medical imaging, primarily through the adaptation of pre-trained models for specific tasks.

Computational Efficiency Model Optimization +3

Refining Text-to-Image Generation: Towards Accurate Training-Free Glyph-Enhanced Image Generation

no code implementations25 Mar 2024 Sanyam Lakhanpal, Shivang Chopra, Vinija Jain, Aman Chadha, Man Luo

We introduce a benchmark, LenCom-Eval, specifically designed for testing models' capability in generating images with Lengthy and Complex visual text.

Optical Character Recognition (OCR) Text-to-Image Generation

OffensiveLang: A Community Based Implicit Offensive Language Dataset

1 code implementation4 Mar 2024 Amit Das, Mostafa Rahgouy, Dongji Feng, Zheng Zhang, Tathagata Bhattacharya, Nilanjana Raychawdhary, Fatemeh Jamshidi, Vinija Jain, Aman Chadha, Mary Sandage, Lauramarie Pope, Gerry Dozier, Cheryl Seals

Firstly, the existing datasets primarily rely on the collection of texts containing explicit offensive keywords, making it challenging to capture implicitly offensive contents that are devoid of these keywords.

Language Modelling Large Language Model +1

Breaking Down the Defenses: A Comparative Survey of Attacks on Large Language Models

no code implementations3 Mar 2024 Arijit Ghosh Chowdhury, Md Mofijul Islam, Faysal Hossain Shezan, Vaibhav Kumar, Vinija Jain, Aman Chadha

Large Language Models (LLMs) have become a cornerstone in the field of Natural Language Processing (NLP), offering transformative capabilities in understanding and generating human-like text.

Data Poisoning

Cause and Effect: Can Large Language Models Truly Understand Causality?

no code implementations28 Feb 2024 Swagata Ashwani, Kshiteesh Hegde, Nishith Reddy Mannuru, Mayank Jindal, Dushyant Singh Sengar, Krishna Chaitanya Rao Kathala, Dishant Banga, Vinija Jain, Aman Chadha

The knowledge from ConceptNet enhances the performance of multiple causal reasoning tasks such as causal discovery, causal identification and counterfactual reasoning.

Causal Discovery Causal Identification +2

Born With a Silver Spoon? Investigating Socioeconomic Bias in Large Language Models

no code implementations16 Feb 2024 Smriti Singh, Shuvam Keshari, Vinija Jain, Aman Chadha

Socioeconomic bias in society exacerbates disparities, influencing access to opportunities and resources based on individuals' economic and social backgrounds.

Exploring the Impact of Large Language Models on Recommender Systems: An Extensive Review

no code implementations11 Feb 2024 Arpita Vats, Vinija Jain, Rahul Raja, Aman Chadha

The paper underscores the significance of Large Language Models (LLMs) in reshaping recommender systems, attributing their value to unique reasoning abilities absent in traditional recommenders.

Decision Making Recommendation Systems

A Systematic Survey of Prompt Engineering in Large Language Models: Techniques and Applications

no code implementations5 Feb 2024 Pranab Sahoo, Ayush Kumar Singh, Sriparna Saha, Vinija Jain, Samrat Mondal, Aman Chadha

This approach leverages task-specific instructions, known as prompts, to enhance model efficacy without modifying the core model parameters.

Prompt Engineering Question Answering

The What, Why, and How of Context Length Extension Techniques in Large Language Models -- A Detailed Survey

no code implementations15 Jan 2024 Saurav Pawar, S. M Towhidul Islam Tonmoy, S M Mehedi Zaman, Vinija Jain, Aman Chadha, Amitava Das

The advent of Large Language Models (LLMs) represents a notable breakthrough in Natural Language Processing (NLP), contributing to substantial progress in both text comprehension and generation.

Reading Comprehension Survey

A Comprehensive Survey of Hallucination Mitigation Techniques in Large Language Models

1 code implementation2 Jan 2024 S. M Towhidul Islam Tonmoy, S M Mehedi Zaman, Vinija Jain, Anku Rani, Vipula Rawte, Aman Chadha, Amitava Das

As Large Language Models (LLMs) continue to advance in their ability to write human-like text, a key challenge remains around their tendency to hallucinate generating content that appears factual but is ungrounded.

Financial Analysis Hallucination +3

Dynamic Corrective Self-Distillation for Better Fine-Tuning of Pretrained Models

no code implementations12 Dec 2023 Ibtihel Amara, Vinija Jain, Aman Chadha

We tackle the challenging issue of aggressive fine-tuning encountered during the process of transfer learning of pre-trained language models (PLMs) with limited labeled downstream data.

Transfer Learning

SEPSIS: I Can Catch Your Lies -- A New Paradigm for Deception Detection

no code implementations1 Dec 2023 Anku Rani, Dwip Dalal, Shreya Gautam, Pankaj Gupta, Vinija Jain, Aman Chadha, Amit Sheth, Amitava Das

This research explores the problem of deception through the lens of psychology, employing a framework that categorizes deception into three forms: lies of omission, lies of commission, and lies of influence.

Deception Detection Multi-Task Learning

On the Relationship between Sentence Analogy Identification and Sentence Structure Encoding in Large Language Models

1 code implementation11 Oct 2023 Thilini Wijesiriwardene, Ruwan Wickramarachchi, Aishwarya Naresh Reganti, Vinija Jain, Aman Chadha, Amit Sheth, Amitava Das

Through our analysis, we find that LLMs' ability to identify sentence analogies is positively correlated with their ability to encode syntactic and semantic structures of sentences.

Language Modeling Language Modelling +1

Counter Turing Test CT^2: AI-Generated Text Detection is Not as Easy as You May Think -- Introducing AI Detectability Index

no code implementations8 Oct 2023 Megha Chakraborty, S. M Towhidul Islam Tonmoy, S M Mehedi Zaman, Krish Sharma, Niyar R Barman, Chandan Gupta, Shreya Gautam, Tanay Kumar, Vinija Jain, Aman Chadha, Amit P. Sheth, Amitava Das

Given this cynosural spotlight on generative AI, AI-generated text detection (AGTD) has emerged as a topic that has already received immediate attention in research, with some initial methods having been proposed, soon followed by emergence of techniques to bypass detection.

Text Detection

Few-shot Multimodal Multitask Multilingual Learning

no code implementations19 Feb 2023 Aman Chadha, Vinija Jain

While few-shot learning as a transfer learning paradigm has gained significant traction for scenarios with limited data, it has primarily been explored in the context of building unimodal and unilingual models.

Few-Shot Learning In-Context Learning +10

iReason: Multimodal Commonsense Reasoning using Videos and Natural Language with Interpretability

no code implementations25 Jun 2021 Aman Chadha, Vinija Jain

We demonstrate the effectiveness of iReason using a two-pronged comparative analysis with language representation learning models (BERT, GPT-2) as well as current state-of-the-art multimodal causality models.

Bias Detection Question Answering +4

Cannot find the paper you are looking for? You can Submit a new open access paper.