Search Results for author: Soham Dan

Found 30 papers, 6 papers with code

Goodness-of-Fit Tests for Inhomogeneous Random Graphs

no code implementations • ICML 2020 • Soham Dan, Bhaswar B. Bhattacharya

Hypothesis testing of random networks is an emerging area of modern research, especially in the high-dimensional regime, where the number of samples is smaller or comparable to the size of the graph.

Two-sample testing

Paper
Add Code

Few-Shot Novel Concept Learning for Semantic Parsing

no code implementations • Findings (EMNLP) 2021 • Soham Dan, Osbert Bastani, Dan Roth

This way the concept learning problem is naturally a program synthesis problem and our algorithm learns from a few examples to synthesize a program representing the novel concept.

Novel Concepts Program Synthesis +1

Paper
Add Code

Compositional Data and Task Augmentation for Instruction Following

no code implementations • Findings (EMNLP) 2021 • Soham Dan, Xinran Han, Dan Roth

Executing natural language instructions in a physically grounded domain requires a model that understands both spatial concepts such as “left of” and “above”, and the compositional language used to identify landmarks and articulate instructions relative to them.

Instruction Following

Paper
Add Code

On the Effects of Transformer Size on In- and Out-of-Domain Calibration

no code implementations • Findings (EMNLP) 2021 • Soham Dan, Dan Roth

To reduce the cost of training such large models, prior work has developed smaller, more compact models which achieves a significant speedup in training time while maintaining competitive accuracy to the original model on downstream tasks.

Paper
Add Code

On the Effects of Fine-tuning Language Models for Text-Based Reinforcement Learning

no code implementations • 15 Apr 2024 • Mauricio Gruppi, Soham Dan, Keerthiram Murugesan, Subhajit Chaudhury

Moreover, we describe the occurrence of semantic degeneration as a consequence of inappropriate fine-tuning of language models in text-based reinforcement learning (TBRL).

reinforcement-learning

Paper
Add Code

Larimar: Large Language Models with Episodic Memory Control

no code implementations • 18 Mar 2024 • Payel Das, Subhajit Chaudhury, Elliot Nelson, Igor Melnyk, Sarath Swaminathan, Sihui Dai, Aurélie Lozano, Georgios Kollias, Vijil Chenthamarakshan, Jiří, Navrátil, Soham Dan, Pin-Yu Chen

Efficient and accurate updating of knowledge stored in Large Language Models (LLMs) is one of the most pressing research challenges today.

Paper
Add Code

Language Guided Exploration for RL Agents in Text Environments

no code implementations • 5 Mar 2024 • Hitesh Golchha, Sahil Yerawar, Dhruvesh Patel, Soham Dan, Keerthiram Murugesan

Real-world sequential decision making is characterized by sparse rewards and large decision spaces, posing significant difficulty for experiential learning systems like $\textit{tabula rasa}$ reinforcement learning (RL) agents.

Decision Making Language Modelling +2

Paper
Add Code

NeuroPrune: A Neuro-inspired Topological Sparse Training Algorithm for Large Language Models

no code implementations • 28 Feb 2024 • Amit Dhurandhar, Tejaswini Pedapati, Ronny Luss, Soham Dan, Aurelie Lozano, Payel Das, Georgios Kollias

Transformer-based Language Models have become ubiquitous in Natural Language Processing (NLP) due to their impressive performance on various tasks.

Machine Translation Natural Language Inference

Paper
Add Code

API-BLEND: A Comprehensive Corpora for Training and Benchmarking API LLMs

1 code implementation • 23 Feb 2024 • Kinjal Basu, Ibrahim Abdelaziz, Subhajit Chaudhury, Soham Dan, Maxwell Crouse, Asim Munawar, Sadhana Kumaravel, Vinod Muthusamy, Pavan Kapanipathi, Luis A. Lastras

There is a growing need for Large Language Models (LLMs) to effectively use tools and external Application Programming Interfaces (APIs) to plan and complete tasks.

Benchmarking slot-filling +2

Paper
Code

On the generalization capacity of neural networks during generic multimodal reasoning

1 code implementation • 26 Jan 2024 • Takuya Ito, Soham Dan, Mattia Rigotti, James Kozloski, Murray Campbell

On the other hand, neither of these architectural features led to productive generalization, suggesting fundamental limitations of existing architectures for specific types of multimodal generalization.

Multimodal Reasoning Systematic Generalization

Paper
Code

On the Calibration of Multilingual Question Answering LLMs

no code implementations • 15 Nov 2023 • Yahan Yang, Soham Dan, Dan Roth, Insup Lee

We also conduct several ablation experiments to study the effect of language distances, language corpus size, and model size on calibration, and how multilingual models compare with their monolingual counterparts for diverse tasks and languages.

Cross-Lingual Transfer Data Augmentation +4

Paper
Add Code

Formally Specifying the High-Level Behavior of LLM-Based Agents

no code implementations • 12 Oct 2023 • Maxwell Crouse, Ibrahim Abdelaziz, Ramon Astudillo, Kinjal Basu, Soham Dan, Sadhana Kumaravel, Achille Fokoue, Pavan Kapanipathi, Salim Roukos, Luis Lastras

We demonstrate how the proposed framework can be used to implement recent LLM-based agents (e. g., ReACT), and show how the flexibility of our approach can be leveraged to define a new agent with more complex behavior, the Plan-Act-Summarize-Solve (PASS) agent.

Question Answering

Paper
Add Code

Compositional Program Generation for Few-Shot Systematic Generalization

1 code implementation • 28 Sep 2023 • Tim Klinger, Luke Liu, Soham Dan, Maxwell Crouse, Parikshit Ram, Alexander Gray

Compositional generalization is a key ability of humans that enables us to learn new concepts from only a handful examples.

Systematic Generalization

Paper
Code

MISMATCH: Fine-grained Evaluation of Machine-generated Text with Mismatch Error Types

no code implementations • 18 Jun 2023 • Keerthiram Murugesan, Sarathkrishna Swaminathan, Soham Dan, Subhajit Chaudhury, Chulaka Gunasekara, Maxwell Crouse, Diwakar Mahajan, Ibrahim Abdelaziz, Achille Fokoue, Pavan Kapanipathi, Salim Roukos, Alexander Gray

In this work, we propose a new evaluation scheme to model human judgments in 7 NLP tasks, based on the fine-grained mismatches between a pair of texts.

Sentence

Paper
Add Code

Generalized Planning in PDDL Domains with Pretrained Large Language Models

1 code implementation • 18 May 2023 • Tom Silver, Soham Dan, Kavitha Srinivas, Joshua B. Tenenbaum, Leslie Pack Kaelbling, Michael Katz

We investigate whether LLMs can serve as generalized planners: given a domain and training tasks, generate a program that efficiently produces plans for other tasks in the domain.

Paper
Code

In and Out-of-Domain Text Adversarial Robustness via Label Smoothing

no code implementations • 20 Dec 2022 • Yahan Yang, Soham Dan, Dan Roth, Insup Lee

Recently it has been shown that state-of-the-art NLP models are vulnerable to adversarial attacks, where the predictions of a model can be drastically altered by slight modifications to the input (such as synonym substitutions).

Adversarial Robustness

Paper
Add Code

One Arrow, Two Kills: An Unified Framework for Achieving Optimal Regret Guarantees in Sleeping Bandits

no code implementations • 26 Oct 2022 • Pierre Gaillard, Aadirupa Saha, Soham Dan

We address the problem of \emph{`Internal Regret'} in \emph{Sleeping Bandits} in the fully adversarial setup, as well as draw connections between different existing notions of sleeping regrets in the multiarmed bandits (MAB) literature and consequently analyze the implications: Our first contribution is to propose the new notion of \emph{Internal Regret} for sleeping MAB.

Paper
Add Code

Human-guided Collaborative Problem Solving: A Natural Language based Framework

no code implementations • 19 Jul 2022 • Harsha Kokel, Mayukh Das, Rakibul Islam, Julia Bonn, Jon Cai, Soham Dan, Anjali Narayan-Chen, Prashant Jayannavar, Janardhan Rao Doppa, Julia Hockenmaier, Sriraam Natarajan, Martha Palmer, Dan Roth

We consider the problem of human-machine collaborative problem solving as a planning task coupled with natural language communication.

Paper
Add Code

Cross-modal Map Learning for Vision and Language Navigation

1 code implementation • CVPR 2022 • Georgios Georgakis, Karl Schmeckpeper, Karan Wanchoo, Soham Dan, Eleni Miltsakaki, Dan Roth, Kostas Daniilidis

We consider the problem of Vision-and-Language Navigation (VLN).

Vision and Language Navigation

Paper
Code

Understanding Robust Generalization in Learning Regular Languages

no code implementations • 20 Feb 2022 • Soham Dan, Osbert Bastani, Dan Roth

Currently, deep neural networks struggle to generalize robustly to such shifts in the data distribution.

Paper
Add Code

Dynamics of Local Elasticity During Training of Neural Nets

1 code implementation • 1 Nov 2021 • Soham Dan, Anirbit Mukherjee, Avirup Das, Phanideep Gampa

On various state-of-the-art neural network training on SVHN, CIFAR-10 and CIFAR-100 we demonstrate how our new proposal of $S_{\rm rel}$, as opposed to the original definition, much more sharply detects the property of the weight updates preferring to make prediction changes within the same class as the sampled data.

regression

Paper
Code

Generalization in Instruction Following Systems

no code implementations • NAACL 2021 • Soham Dan, Michael Zhou, Dan Roth

Understanding and executing natural language instructions in a grounded domain is one of the hallmarks of artificial intelligence.

Data Augmentation Instruction Following

Paper
Add Code

A Locally Linear Procedure for Word Translation

no code implementations • COLING 2020 • Soham Dan, Hagai Taitelbaum, Jacob Goldberger

We propose a natural extension of the PA algorithm that uses multiple orthogonal translation matrices to model the mapping and derive an algorithm to learn these multiple matrices.

Translation Word Embeddings +2

Paper
Add Code

Understanding Spatial Relations through Multiple Modalities

no code implementations • LREC 2020 • Soham Dan, Hangfeng He, Dan Roth

Recognizing spatial relations and reasoning about them is essential in multiple applications including navigation, direction giving and human-computer interaction in general.

Common Sense Reasoning Implicit Relations

Paper
Add Code

From Spatial Relations to Spatial Configurations

no code implementations • LREC 2020 • Soham Dan, Parisa Kordjamshidi, Julia Bonn, Archna Bhatia, Jon Cai, Martha Palmer, Dan Roth

To exhibit the applicability of our representation scheme, we annotate text taken from diverse datasets and show how we extend the capabilities of existing spatial representation languages with the fine-grained decomposition of semantics and blend it seamlessly with AMRs of sentences and discourse representations as a whole.

Natural Language Understanding

Paper
Add Code

Learning from Noisy Similar and Dissimilar Data

no code implementations • 3 Feb 2020 • Soham Dan, Han Bao, Masashi Sugiyama

We perform a detailed investigation of this problem under two realistic noise models and propose two algorithms to learn from noisy S-D data.

Paper
Add Code

Variance Reduced Stochastic Proximal Algorithm for AUC Maximization

no code implementations • 8 Nov 2019 • Soham Dan, Dushyant Sahoo

To combat this issue, several variance reduced methods have been proposed with faster convergence guarantees than vanilla stochastic gradient descent.

Paper
Add Code

AppTechMiner: Mining Applications and Techniques from Scientific Articles

no code implementations • 10 Sep 2017 • Mayank Singh, Soham Dan, Sanyam Agarwal, Pawan Goyal, Animesh Mukherjee

We also categorize individual research articles based on their application areas and the techniques proposed/improved in the article.

Information Retrieval Retrieval

Paper
Add Code

Towards Problem Solving Agents that Communicate and Learn

no code implementations • WS 2017 • Anjali Narayan-Chen, Colin Graber, Mayukh Das, Md. Rakibul Islam, Soham Dan, Sriraam Natarajan, Janardhan Rao Doppa, Julia Hockenmaier, Martha Palmer, Dan Roth

Agents that communicate back and forth with humans to help them execute non-linguistic tasks are a long sought goal of AI.

Semantic Parsing

Paper
Add Code

Which techniques does your application use?: An information extraction framework for scientific articles

no code implementations • 23 Aug 2016 • Soham Dan, Sanyam Agarwal, Mayank Singh, Pawan Goyal, Animesh Mukherjee

Every field of research consists of multiple application areas with various techniques routinely used to solve problems in these wide range of application areas.

Language Modelling

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.