Search Results for author: Satwik Bhattamishra

Found 12 papers, 8 papers with code

MAGNIFICo: Evaluating the In-Context Learning Ability of Large Language Models to Generalize to Novel Interpretations

1 code implementation • 18 Oct 2023 • Arkil Patel, Satwik Bhattamishra, Siva Reddy, Dzmitry Bahdanau

Additionally, our analysis uncovers the semantic predispositions in LLMs and reveals the impact of recency bias for information presented in long contexts.

In-Context Learning Semantic Parsing +1

Paper
Code

Understanding In-Context Learning in Transformers and LLMs by Learning to Learn Discrete Functions

no code implementations • 4 Oct 2023 • Satwik Bhattamishra, Arkil Patel, Phil Blunsom, Varun Kanade

In this work, we take a step towards answering these questions by demonstrating the following: (a) On a test-bed with a variety of Boolean function classes, we find that Transformers can nearly match the optimal learning algorithm for 'simpler' tasks, while their performance deteriorates on more 'complex' tasks.

In-Context Learning

Paper
Add Code

Structural Transfer Learning in NL-to-Bash Semantic Parsers

no code implementations • 31 Jul 2023 • Kyle Duffy, Satwik Bhattamishra, Phil Blunsom

Large-scale pre-training has made progress in many fields of natural language processing, though little is understood about the design of pre-training datasets.

Machine Translation Semantic Parsing +2

Paper
Add Code

DynaQuant: Compressing Deep Learning Training Checkpoints via Dynamic Quantization

no code implementations • 20 Jun 2023 • Amey Agrawal, Sameer Reddy, Satwik Bhattamishra, Venkata Prabhakara Sarath Nookala, Vidushi Vashishth, Kexin Rong, Alexey Tumanov

With the increase in the scale of Deep Learning (DL) training workloads in terms of compute resources and time consumption, the likelihood of encountering in-training failures rises substantially, leading to lost work and resource wastage.

Model Compression Quantization +1

Paper
Add Code

Simplicity Bias in Transformers and their Ability to Learn Sparse Boolean Functions

1 code implementation • 22 Nov 2022 • Satwik Bhattamishra, Arkil Patel, Varun Kanade, Phil Blunsom

(ii) When trained on Boolean functions, both Transformers and LSTMs prioritize learning functions of low sensitivity, with Transformers ultimately converging to functions of lower sensitivity.

Paper
Code

Revisiting the Compositional Generalization Abilities of Neural Sequence Models

1 code implementation • ACL 2022 • Arkil Patel, Satwik Bhattamishra, Phil Blunsom, Navin Goyal

Compositional generalization is a fundamental trait in humans, allowing us to effortlessly combine known phrases to form novel sentences.

Paper
Code

Are NLP Models really able to Solve Simple Math Word Problems?

3 code implementations • NAACL 2021 • Arkil Patel, Satwik Bhattamishra, Navin Goyal

Since existing solvers achieve high performance on the benchmark datasets for elementary level MWPs containing one-unknown arithmetic word problems, such problems are often considered "solved" with the bulk of research attention moving to more complex MWPs.

Ranked #1 on Math Word Problem SolvingΩ on MAWPS

Math Math Word Problem Solving +1

104

Paper
Code

On the Practical Ability of Recurrent Neural Networks to Recognize Hierarchical Languages

1 code implementation • COLING 2020 • Satwik Bhattamishra, Kabir Ahuja, Navin Goyal

We find that while recurrent models generalize nearly perfectly if the lengths of the training and test strings are from the same range, they perform poorly if the test strings are longer.

Paper
Code

On the Ability and Limitations of Transformers to Recognize Formal Languages

1 code implementation • EMNLP 2020 • Satwik Bhattamishra, Kabir Ahuja, Navin Goyal

Our analysis also provides insights on the role of self-attention mechanism in modeling certain behaviors and the influence of positional encoding schemes on the learning and generalization abilities of the model.

Paper
Code

On the Computational Power of Transformers and its Implications in Sequence Modeling

1 code implementation • CONLL 2020 • Satwik Bhattamishra, Arkil Patel, Navin Goyal

Transformers are being used extensively across several sequence modeling tasks.

Machine Translation Translation

Paper
Code

Unsung Challenges of Building and Deploying Language Technologies for Low Resource Language Communities

no code implementations • ICON 2019 • Pratik Joshi, Christain Barnes, Sebastin Santy, Simran Khanuja, Sanket Shah, Anirudh Srinivasan, Satwik Bhattamishra, Sunayana Sitaram, Monojit Choudhury, Kalika Bali

In this paper, we examine and analyze the challenges associated with developing and introducing language technologies to low-resource language communities.

Paper
Add Code

Submodular Optimization-based Diverse Paraphrasing and its Effectiveness in Data Augmentation

1 code implementation • NAACL 2019 • Ashutosh Kumar, Satwik Bhattamishra, Bh, Manik ari, Partha Talukdar

Inducing diversity in the task of paraphrasing is an important problem in NLP with applications in data augmentation and conversational agents.

Data Augmentation intent-classification +1

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.