Search Results for author: Subhabrata Dutta

Found 16 papers, 11 papers with code

Problem Solving Through Human-AI Preference-Based Cooperation

no code implementations14 Aug 2024 Subhabrata Dutta, Timo Kaufmann, Goran Glavaš, Ivan Habernal, Kristian Kersting, Frauke Kreuter, Mira Mezini, Iryna Gurevych, Eyke Hüllermeier, Hinrich Schuetze

While there is a widespread belief that artificial general intelligence (AGI) -- or even superhuman AI -- is imminent, complex problems in expert domains are far from being solved.

Language Models can Exploit Cross-Task In-context Learning for Data-Scarce Novel Tasks

1 code implementation17 May 2024 Anwoy Chatterjee, Eshaan Tanwar, Subhabrata Dutta, Tanmoy Chakraborty

We design a cross-task prompting setup with three LLMs and show that LLMs achieve significant performance improvements despite no examples from the target task in the context.

In-Context Learning

$\texttt{LM}^\texttt{2}$: A Simple Society of Language Models Solves Complex Reasoning

no code implementations2 Apr 2024 Gurusha Juneja, Subhabrata Dutta, Tanmoy Chakraborty

The solver model generates the solution to the subproblems that are then checked by the verifier module; depending upon the feedback from the verifier, the reasoning context is constructed using the subproblems and the solutions.

Math

How to think step-by-step: A mechanistic understanding of chain-of-thought reasoning

1 code implementation28 Feb 2024 Subhabrata Dutta, Joykirat Singh, Soumen Chakrabarti, Tanmoy Chakraborty

Despite superior reasoning prowess demonstrated by Large Language Models (LLMs) with Chain-of-Thought (CoT) prompting, a lack of understanding prevails around the internal mechanisms of the models that facilitate CoT generation.

Answer Generation

Frugal LMs Trained to Invoke Symbolic Solvers Achieve Parameter-Efficient Arithmetic Reasoning

1 code implementation9 Dec 2023 Subhabrata Dutta, Joykirat Singh, Ishan Pandey, Sunny Manchanda, Soumen Chakrabarti, Tanmoy Chakraborty

In this paper, we start with the hypothesis that much smaller LMs, which are weak at multi-step reasoning, can achieve reasonable arithmetic reasoning if arithmetic word problems are posed as a formalize-then-solve task.

Ranked #12 on Math Word Problem Solving on SVAMP (using extra training data)

Arithmetic Reasoning Math Word Problem Solving

Small Language Models Fine-tuned to Coordinate Larger Language Models improve Complex Reasoning

1 code implementation21 Oct 2023 Gurusha Juneja, Subhabrata Dutta, Soumen Chakrabarti, Sunny Manchanda, Tanmoy Chakraborty

Additionally, we show that DaSLaM is not limited by the solver's capabilities as a function of scale; e. g., solver LMs with diverse sizes give significant performance improvement with our solver-agnostic decomposition technique.

Ranked #6 on Overall - Test on JEEBench (using extra training data)

Overall - Test Problem Decomposition

Multilingual LLMs are Better Cross-lingual In-context Learners with Alignment

1 code implementation10 May 2023 Eshaan Tanwar, Subhabrata Dutta, Manish Borthakur, Tanmoy Chakraborty

In-context learning (ICL) unfolds as large language models become capable of inferring test labels conditioned on a few labeled samples without any gradient update.

In-Context Learning text-classification +1

Hatemongers ride on echo chambers to escalate hate speech diffusion

1 code implementation5 Feb 2023 Vasu Goel, Dhruv Sahnan, Subhabrata Dutta, Anil Bandhakavi, Tanmoy Chakraborty

We analyze more than 32 million posts from over 6. 8 million users across three popular online social networks to investigate the interrelations between hateful behavior, information dissemination, and polarised organization mediated by echo chambers.

Can Unsupervised Knowledge Transfer from Social Discussions Help Argument Mining?

1 code implementation ACL 2022 Subhabrata Dutta, Jeevesh Juneja, Dipankar Das, Tanmoy Chakraborty

Identifying argument components from unstructured texts and predicting the relationships expressed among them are two primary steps of argument mining.

Argument Mining Language Modelling +2

Semi-supervised Stance Detection of Tweets Via Distant Network Supervision

2 code implementations3 Jan 2022 Subhabrata Dutta, Samiya Caur, Soumen Chakrabarti, Tanmoy Chakraborty

Detecting and labeling stance in social media text is strongly motivated by hate speech detection, poll prediction, engagement forecasting, and concerted propaganda detection.

Hate Speech Detection Propaganda detection +1

Incomplete Gamma Integrals for Deep Cascade Prediction using Content, Network, and Exogenous Signals

1 code implementation13 Jun 2021 Subhabrata Dutta, Shravika Mittal, Dipankar Das, Soumen Chakrabarti, Tanmoy Chakraborty

Second, there is a measurable positive correlation between the novelty of the root content (with respect to a streaming external corpus) and the relative size of the resulting cascade.

Hate is the New Infodemic: A Topic-aware Modeling of Hate Speech Diffusion on Twitter

1 code implementation9 Oct 2020 Sarah Masud, Subhabrata Dutta, Sakshi Makkar, Chhavi Jain, Vikram Goyal, Amitava Das, Tanmoy Chakraborty

Meanwhile, to predict the retweet dynamics on Twitter, we propose RETINA, a novel neural architecture that incorporates exogenous influence using scaled dot-product attention.

World Knowledge

Modeling Engagement Dynamics of Online Discussions using Relativistic Gravitational Theory

no code implementations10 Aug 2019 Subhabrata Dutta, Dipankar Das, Tanmoy Chakraborty

Unlike previous studies which model a discussion in a static manner, in the present study, we model it as a time-varying process and solve two inter-related problems -- predict which user groups will get engaged with an ongoing discussion, and forecast the growth rate of a discussion in terms of the number of comments.

Normalyzing Numeronyms -- A NLP approach

no code implementations31 Jul 2019 Avishek Garain, Sainik Kumar Mahata, Subhabrata Dutta

This paper presents a method to apply Natural Language Processing for normalizing numeronyms to make them understandable by humans.

How did the discussion go: Discourse act classification in social media conversations

no code implementations7 Aug 2018 Subhabrata Dutta, Tanmoy Chakraborty, Dipankar Das

Our proposed model outperformed the previous one in terms of domain independence; without using platform-dependent structural features, our hierarchical LSTM with word relevance attention mechanism achieved F1-scores of 71\% and 66\% respectively to predict discourse roles of comments in Reddit and Facebook discussions.

General Classification Sentence +1

Cannot find the paper you are looking for? You can Submit a new open access paper.