Search Results for author: Soumen Chakrabarti

Found 48 papers, 26 papers with code

A Data Bootstrapping Recipe for Low-Resource Multilingual Relation Classification

no code implementations CoNLL (EMNLP) 2021 Arijit Nag, Bidisha Samanta, Animesh Mukherjee, Niloy Ganguly, Soumen Chakrabarti

Data collection is challenging for Indian languages, because they are syntactically and morphologically diverse, as well as different from resource-rich languages like English.

Classification Relation +1

Cost-Performance Optimization for Processing Low-Resource Language Tasks Using Commercial LLMs

no code implementations8 Mar 2024 Arijit Nag, Animesh Mukherjee, Niloy Ganguly, Soumen Chakrabarti

As means to reduce the number of tokens processed by the LLM, we consider code-mixing, translation, and transliteration of LRLs to HRLs.

Transliteration

How to think step-by-step: A mechanistic understanding of chain-of-thought reasoning

1 code implementation28 Feb 2024 Subhabrata Dutta, Joykirat Singh, Soumen Chakrabarti, Tanmoy Chakraborty

Despite superior reasoning prowess demonstrated by Large Language Models (LLMs) with Chain-of-Thought (CoT) prompting, a lack of understanding prevails around the internal mechanisms of the models that facilitate CoT generation.

Answer Generation

Graph Regularized Encoder Training for Extreme Classification

no code implementations28 Feb 2024 Anshul Mittal, Shikhar Mohan, Deepak Saini, Suchith C. Prabhu, Jain jiao, Sumeet Agarwal, Soumen Chakrabarti, Purushottam Kar, Manik Varma

The paper notices that in these settings, it is much more effective to use graph data to regularize encoder training than to implement a GCN.

Classification TAG

Frugal LMs Trained to Invoke Symbolic Solvers Achieve Parameter-Efficient Arithmetic Reasoning

1 code implementation9 Dec 2023 Subhabrata Dutta, Joykirat Singh, Ishan Pandey, Sunny Manchanda, Soumen Chakrabarti, Tanmoy Chakraborty

In this paper, we start with the hypothesis that much smaller LMs, which are weak at multi-step reasoning, can achieve reasonable arithmetic reasoning if arithmetic word problems are posed as a formalize-then-solve task.

Ranked #11 on Math Word Problem Solving on SVAMP (using extra training data)

Arithmetic Reasoning Math Word Problem Solving

CoRE-CoG: Conversational Recommendation of Entities using Constrained Generation

no code implementations14 Nov 2023 Harshvardhan Srivastava, Kanav Pruthi, Soumen Chakrabarti, Mausam

End-to-end conversational recommendation systems (CRS) generate responses by leveraging both dialog history and a knowledge base (KB).

Recommendation Systems

CRUSH4SQL: Collective Retrieval Using Schema Hallucination For Text2SQL

1 code implementation2 Nov 2023 Mayank Kothyari, Dhruva Dhingra, Sunita Sarawagi, Soumen Chakrabarti

Standard dense retrieval techniques are inadequate for schema subsetting of a large structured database, where the correct semantics of retrieval demands that we rank sets of schema elements rather than individual elements.

Hallucination Retrieval +1

Small Language Models Fine-tuned to Coordinate Larger Language Models improve Complex Reasoning

1 code implementation21 Oct 2023 Gurusha Juneja, Subhabrata Dutta, Soumen Chakrabarti, Sunny Manchanda, Tanmoy Chakraborty

Additionally, we show that DaSLaM is not limited by the solver's capabilities as a function of scale; e. g., solver LMs with diverse sizes give significant performance improvement with our solver-agnostic decomposition technique.

Ranked #6 on Overall - Test on JEEBench (using extra training data)

Overall - Test Problem Decomposition

Structured Case-based Reasoning for Inference-time Adaptation of Text-to-SQL parsers

no code implementations10 Jan 2023 Abhijeet Awasthi, Soumen Chakrabarti, Sunita Sarawagi

To the best of our knowledge, we are the first to attempt inference-time adaptation of Text-to-SQL models, and harness trainable structured similarity between subqueries.

Semantic Parsing Text-To-SQL

mOKB6: A Multilingual Open Knowledge Base Completion Benchmark

1 code implementation13 Nov 2022 Shubham Mittal, Keshav Kolluru, Soumen Chakrabarti, Mausam

Automated completion of open knowledge bases (Open KBs), which are constructed from triples of the form (subject phrase, relation phrase, object phrase), obtained via open information extraction (Open IE) system, are useful for discovering novel facts that may not be directly present in the text.

coreference-resolution Knowledge Base Completion +1

Neural Estimation of Submodular Functions with Applications to Differentiable Subset Selection

no code implementations20 Oct 2022 Abir De, Soumen Chakrabarti

We do not draw the concave function from a restricted family, but rather learn from data using a highly expressive neural network that implements a differentiable quadrature procedure.

Maximum Common Subgraph Guided Graph Retrieval: Late and Early Interaction Networks

no code implementations20 Oct 2022 Indradyumna Roy, Soumen Chakrabarti, Abir De

A common consideration for scoring similarity is the maximum common subgraph (MCS) between the query and corpus graphs, usually counting the number of common edges (i. e., MCES).

Retrieval

TwiRGCN: Temporally Weighted Graph Convolution for Question Answering over Temporal Knowledge Graphs

no code implementations12 Oct 2022 Aditya Sharma, Apoorv Saxena, Chitrank Gupta, Seyed Mehran Kazemi, Partha Talukdar, Soumen Chakrabarti

Recent years have witnessed much interest in temporal reasoning over knowledge graphs (KG) for complex question answering (QA), but there remains a substantial gap in human capabilities.

Knowledge Graphs Question Answering

Semi-supervised Stance Detection of Tweets Via Distant Network Supervision

2 code implementations3 Jan 2022 Subhabrata Dutta, Samiya Caur, Soumen Chakrabarti, Tanmoy Chakraborty

Detecting and labeling stance in social media text is strongly motivated by hate speech detection, poll prediction, engagement forecasting, and concerted propaganda detection.

Hate Speech Detection Propaganda detection +1

Multi-Row, Multi-Span Distant Supervision For Table+Text Question

no code implementations14 Dec 2021 Vishwajeet Kumar, Yash Gupta, Saneem Chemmengath, Jaydeep Sen, Soumen Chakrabarti, Samarth Bharadwaj, Feifei Pan

Question answering (QA) over tables and linked text, also called TextTableQA, has witnessed significant research in recent years, as tables are often found embedded in documents along with related text.

Question Answering Reading Comprehension

A Data Bootstrapping Recipe for Low Resource Multilingual Relation Classification

no code implementations18 Oct 2021 Arijit Nag, Bidisha Samanta, Animesh Mukherjee, Niloy Ganguly, Soumen Chakrabarti

Relation classification (sometimes called 'extraction') requires trustworthy datasets for fine-tuning large language models, as well as for evaluation.

Classification Relation +1

Topic Transferable Table Question Answering

1 code implementation EMNLP 2021 Saneem Ahmed Chemmengath, Vishwajeet Kumar, Samarth Bharadwaj, Jaydeep Sen, Mustafa Canim, Soumen Chakrabarti, Alfio Gliozzo, Karthik Sankaranarayanan

Weakly-supervised table question-answering(TableQA) models have achieved state-of-art performance by using pre-trained BERT transformer to jointly encoding a question and a table to produce structured query for the question.

Question Answering Question Generation +1

Incomplete Gamma Integrals for Deep Cascade Prediction using Content, Network, and Exogenous Signals

1 code implementation13 Jun 2021 Subhabrata Dutta, Shravika Mittal, Dipankar Das, Soumen Chakrabarti, Tanmoy Chakraborty

Second, there is a measurable positive correlation between the novelty of the root content (with respect to a streaming external corpus) and the relative size of the resulting cascade.

Question Answering Over Temporal Knowledge Graphs

2 code implementations ACL 2021 Apoorv Saxena, Soumen Chakrabarti, Partha Talukdar

Temporal Knowledge Graphs (Temporal KGs) extend regular Knowledge Graphs by providing temporal scopes (start and end times) on each edge in the KG.

Knowledge Graphs Question Answering

OpenIE6: Iterative Grid Labeling and Coordination Analysis for Open Information Extraction

1 code implementation EMNLP 2020 Keshav Kolluru, Vaibhav Adlakha, Samarth Aggarwal, Mausam, Soumen Chakrabarti

This IGL based coordination analyzer helps our OpenIE system handle complicated coordination structures, while also establishing a new state of the art on the task of coordination analysis, with a 12. 3 pts improvement in F1 over previous analyzers.

Open Information Extraction

IMoJIE: Iterative Memory-Based Joint Open Information Extraction

1 code implementation ACL 2020 Keshav Kolluru, Samarth Aggarwal, Vipul Rathore, Mausam, Soumen Chakrabarti

While traditional systems for Open Information Extraction were statistical and rule-based, recently neural models have been introduced for the task.

Open Information Extraction Sentence

Knowledge Base Completion: Baseline strikes back (Again)

1 code implementation2 May 2020 Prachi Jain, Sushant Rathi, Mausam, Soumen Chakrabarti

Most existing methods train with a small number of negative samples for each positive instance in these datasets to save computational costs.

Knowledge Base Completion Knowledge Base Population +2

Scene Graph based Image Retrieval -- A case study on the CLEVR Dataset

no code implementations3 Nov 2019 Sahana Ramnath, Amrita Saha, Soumen Chakrabarti, Mitesh M. Khapra

With the prolification of multimodal interaction in various domains, recently there has been much interest in text based image retrieval in the computer vision community.

Graph Matching Image Retrieval +2

Differentially Private Link Prediction With Protected Connections

no code implementations20 Jul 2019 Abir De, Soumen Chakrabarti

Link prediction (LP) algorithms propose to each node a ranked list of nodes that are currently non-neighbors, as the most likely candidates for future linkage.

Learning-To-Rank Link Prediction

A Deep Generative Model for Code-Switched Text

1 code implementation21 Jun 2019 Bidisha Samanta, Sharmila Reddy, Hussain Jagirdar, Niloy Ganguly, Soumen Chakrabarti

Code-switching, the interleaving of two or more languages within a sentence or discourse is pervasive in multilingual societies.

Sentence

Topic Sensitive Attention on Generic Corpora Corrects Sense Bias in Pretrained Embeddings

1 code implementation ACL 2019 Vihari Piratla, Sunita Sarawagi, Soumen Chakrabarti

Given a small corpus $\mathcal D_T$ pertaining to a limited set of focused topics, our goal is to train embeddings that accurately capture the sense of words in the topic in spite of the limited size of $\mathcal D_T$.

GIRNet: Interleaved Multi-Task Recurrent State Sequence Models

1 code implementation28 Nov 2018 Divam Gupta, Tanmoy Chakraborty, Soumen Chakrabarti

A primary instance is also submitted to each auxiliary RNN, but their state sequences are gated and merged into a novel composite state sequence tailored to the primary inference task.

Part-Of-Speech Tagging Position +2

Type-Sensitive Knowledge Base Inference Without Explicit Type Supervision

1 code implementation ACL 2018 Prachi Jain, Pankaj Kumar, {Mausam}, Soumen Chakrabarti

State-of-the-art knowledge base completion (KBC) models predict a score for every known or unknown fact via a latent factorization over entity and relation embeddings.

Entity Typing Knowledge Base Completion +6

Discriminative Link Prediction using Local Links, Node Features and Community Structure

no code implementations17 Oct 2013 Abir De, Niloy Ganguly, Soumen Chakrabarti

Apart from the new predictor, another contribution is a rigorous protocol for benchmarking and reporting LP algorithms, which reveals the regions of strengths and weaknesses of all the predictors studied here, and establishes the new proposal as the most robust.

Benchmarking Clustering +1

Cannot find the paper you are looking for? You can Submit a new open access paper.