Search Results for author: Ankit Anand

Found 16 papers, 4 papers with code

Code as Reward: Empowering Reinforcement Learning with VLMs

no code implementations • 7 Feb 2024 • David Venuto, Sami Nur Islam, Martin Klissarov, Doina Precup, Sherry Yang, Ankit Anand

Pre-trained Vision-Language Models (VLMs) are able to understand visual concepts, describe and decompose complex tasks into sub-tasks, and provide feedback on task completion.

Code Generation reinforcement-learning +1

Paper
Add Code

Transfer Learning for the Prediction of Entity Modifiers in Clinical Text: Application to Opioid Use Disorder Case Detection

no code implementations • 26 Jan 2024 • Abdullateef I. Almudaifer, Whitney Covington, JaMor Hairston, Zachary Deitch, Ankit Anand, Caleb M. Carroll, Estera Crisan, William Bradford, Lauren Walter, Eaton Ellen, Sue S. Feldman, John D. Osborne

Conclusions: We show that learned weights from our shared model can be effectively transferred to a new partially matched data set, validating the use of transfer learning for clinical text modifiers

Multi-Task Learning Negation

Paper
Add Code

GeomVerse: A Systematic Evaluation of Large Models for Geometric Reasoning

no code implementations • 19 Dec 2023 • Mehran Kazemi, Hamidreza Alvari, Ankit Anand, Jialin Wu, Xi Chen, Radu Soricut

In this paper, we evaluate the reasoning capabilities of VLMs along various axes through the lens of geometry problems.

Mathematical Reasoning

Paper
Add Code

Finding Increasingly Large Extremal Graphs with AlphaZero and Tabu Search

no code implementations • 6 Nov 2023 • Abbas Mehrabian, Ankit Anand, Hyunjik Kim, Nicolas Sonnerat, Matej Balog, Gheorghe Comanici, Tudor Berariu, Andrew Lee, Anian Ruoss, Anna Bulanova, Daniel Toyama, Sam Blackwell, Bernardino Romera Paredes, Petar Veličković, Laurent Orseau, Joonkyung Lee, Anurag Murty Naredla, Doina Precup, Adam Zsolt Wagner

This work studies a central extremal graph theory problem inspired by a 1975 conjecture of Erd\H{o}s, which aims to find graphs with a given size (number of nodes) that maximize the number of edges without having 3- or 4-cycles.

Decision Making Graph Generation

Paper
Add Code

AutoMix: Automatically Mixing Language Models

1 code implementation • 19 Oct 2023 • Aman Madaan, Pranjal Aggarwal, Ankit Anand, Srividya Pranavi Potharaju, Swaroop Mishra, Pei Zhou, Aditya Gupta, Dheeraj Rajagopal, Karthik Kappaganthu, Yiming Yang, Shyam Upadhyay, Mausam, Manaal Faruqui

Large language models (LLMs) are now available from cloud API providers in various sizes and configurations.

Paper
Code

Policy composition in reinforcement learning via multi-objective policy optimization

no code implementations • 29 Aug 2023 • Shruti Mishra, Ankit Anand, Jordan Hoffmann, Nicolas Heess, Martin Riedmiller, Abbas Abdolmaleki, Doina Precup

In two domains with continuous observation and action spaces, our agents successfully compose teacher policies in sequence and in parallel, and are also able to further extend the policies of the teachers in order to solve the task.

reinforcement-learning

Paper
Add Code

Accelerating exploration and representation learning with offline pre-training

no code implementations • 31 Mar 2023 • Bogdan Mazoure, Jake Bruce, Doina Precup, Rob Fergus, Ankit Anand

In this work, we follow the hypothesis that exploration and representation learning can be improved by separately learning two different models from a single offline dataset.

Decision Making NetHack +2

Paper
Add Code

Proving Theorems using Incremental Learning and Hindsight Experience Replay

no code implementations • 20 Dec 2021 • Eser Aygün, Laurent Orseau, Ankit Anand, Xavier Glorot, Vlad Firoiu, Lei M. Zhang, Doina Precup, Shibl Mourad

Traditional automated theorem provers for first-order logic depend on speed-optimized search and many handcrafted heuristics that are designed to work best over a wide range of domains.

Automated Theorem Proving Incremental Learning

Paper
Add Code

Training a First-Order Theorem Prover from Synthetic Data

no code implementations • 5 Mar 2021 • Vlad Firoiu, Eser Aygun, Ankit Anand, Zafarali Ahmed, Xavier Glorot, Laurent Orseau, Lei Zhang, Doina Precup, Shibl Mourad

A major challenge in applying machine learning to automated theorem proving is the scarcity of training data, which is a key ingredient in training successful deep learning models.

Automated Theorem Proving BIG-bench Machine Learning

Paper
Add Code

Learning Compositional Structures for Deep Learning: Why Routing-by-agreement is Necessary

no code implementations • 4 Oct 2020 • Sai Raam Venkatraman, Ankit Anand, S. Balasubramanian, R. Raghunatha Sarma

We present a formal grammar description of convolutional neural networks and capsule networks that shows how capsule networks can enforce such parse-tree structures, while CNNs do not.

Paper
Add Code

Learning to Prove from Synthetic Theorems

no code implementations • 19 Jun 2020 • Eser Aygün, Zafarali Ahmed, Ankit Anand, Vlad Firoiu, Xavier Glorot, Laurent Orseau, Doina Precup, Shibl Mourad

A major challenge in applying machine learning to automated theorem proving is the scarcity of training data, which is a key ingredient in training successful deep learning models.

Automated Theorem Proving

Paper
Add Code

AccentDB: A Database of Non-Native English Accents to Assist Neural Speech Recognition

no code implementations • LREC 2020 • Afroz Ahamad, Ankit Anand, Pranesh Bhargava

In this work, we first spell out the key requirements for creating a well-curated database of speech samples in non-native accents for training and testing robust ASR systems.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Add Code

Block-Value Symmetries in Probabilistic Graphical Models

1 code implementation • 2 Jul 2018 • Gagan Madan, Ankit Anand, Mausam, Parag Singla

These orbits are represented compactly using permutations over variables, and variable-value (VV) pairs, but they can miss several state symmetries in a domain.

Paper
Code

Non-Count Symmetries in Boolean & Multi-Valued Prob. Graphical Models

1 code implementation • 27 Jul 2017 • Ankit Anand, Ritesh Noothigattu, Parag Singla, Mausam

Moreover, algorithms for lifted inference in multi-valued domains also compute a multi-valued extension of count symmetries only.

Paper
Code

Coarse-to-Fine Lifted MAP Inference in Computer Vision

1 code implementation • 22 Jul 2017 • Haroun Habeeb, Ankit Anand, Mausam, Parag Singla

We demonstrate the performance of C2F inference by developing lifted versions of two near state-of-the-art CV algorithms for stereo vision and interactive image segmentation.

Image Segmentation Semantic Segmentation

Paper
Code

Contextual Symmetries in Probabilistic Graphical Models

no code implementations • 30 Jun 2016 • Ankit Anand, Aditya Grover, Mausam, Parag Singla

We extend previous work on exploiting symmetries in the MCMC framework to the case of contextual symmetries.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.