Search Results for author: Ankit Anand

Found 16 papers, 4 papers with code

Code as Reward: Empowering Reinforcement Learning with VLMs

no code implementations7 Feb 2024 David Venuto, Sami Nur Islam, Martin Klissarov, Doina Precup, Sherry Yang, Ankit Anand

Pre-trained Vision-Language Models (VLMs) are able to understand visual concepts, describe and decompose complex tasks into sub-tasks, and provide feedback on task completion.

Code Generation reinforcement-learning +1

GeomVerse: A Systematic Evaluation of Large Models for Geometric Reasoning

no code implementations19 Dec 2023 Mehran Kazemi, Hamidreza Alvari, Ankit Anand, Jialin Wu, Xi Chen, Radu Soricut

In this paper, we evaluate the reasoning capabilities of VLMs along various axes through the lens of geometry problems.

Mathematical Reasoning

Finding Increasingly Large Extremal Graphs with AlphaZero and Tabu Search

no code implementations6 Nov 2023 Abbas Mehrabian, Ankit Anand, Hyunjik Kim, Nicolas Sonnerat, Matej Balog, Gheorghe Comanici, Tudor Berariu, Andrew Lee, Anian Ruoss, Anna Bulanova, Daniel Toyama, Sam Blackwell, Bernardino Romera Paredes, Petar Veličković, Laurent Orseau, Joonkyung Lee, Anurag Murty Naredla, Doina Precup, Adam Zsolt Wagner

This work studies a central extremal graph theory problem inspired by a 1975 conjecture of Erd\H{o}s, which aims to find graphs with a given size (number of nodes) that maximize the number of edges without having 3- or 4-cycles.

Decision Making Graph Generation

Policy composition in reinforcement learning via multi-objective policy optimization

no code implementations29 Aug 2023 Shruti Mishra, Ankit Anand, Jordan Hoffmann, Nicolas Heess, Martin Riedmiller, Abbas Abdolmaleki, Doina Precup

In two domains with continuous observation and action spaces, our agents successfully compose teacher policies in sequence and in parallel, and are also able to further extend the policies of the teachers in order to solve the task.

reinforcement-learning

Accelerating exploration and representation learning with offline pre-training

no code implementations31 Mar 2023 Bogdan Mazoure, Jake Bruce, Doina Precup, Rob Fergus, Ankit Anand

In this work, we follow the hypothesis that exploration and representation learning can be improved by separately learning two different models from a single offline dataset.

Decision Making NetHack +2

Proving Theorems using Incremental Learning and Hindsight Experience Replay

no code implementations20 Dec 2021 Eser Aygün, Laurent Orseau, Ankit Anand, Xavier Glorot, Vlad Firoiu, Lei M. Zhang, Doina Precup, Shibl Mourad

Traditional automated theorem provers for first-order logic depend on speed-optimized search and many handcrafted heuristics that are designed to work best over a wide range of domains.

Automated Theorem Proving Incremental Learning

Training a First-Order Theorem Prover from Synthetic Data

no code implementations5 Mar 2021 Vlad Firoiu, Eser Aygun, Ankit Anand, Zafarali Ahmed, Xavier Glorot, Laurent Orseau, Lei Zhang, Doina Precup, Shibl Mourad

A major challenge in applying machine learning to automated theorem proving is the scarcity of training data, which is a key ingredient in training successful deep learning models.

Automated Theorem Proving BIG-bench Machine Learning

Learning Compositional Structures for Deep Learning: Why Routing-by-agreement is Necessary

no code implementations4 Oct 2020 Sai Raam Venkatraman, Ankit Anand, S. Balasubramanian, R. Raghunatha Sarma

We present a formal grammar description of convolutional neural networks and capsule networks that shows how capsule networks can enforce such parse-tree structures, while CNNs do not.

Learning to Prove from Synthetic Theorems

no code implementations19 Jun 2020 Eser Aygün, Zafarali Ahmed, Ankit Anand, Vlad Firoiu, Xavier Glorot, Laurent Orseau, Doina Precup, Shibl Mourad

A major challenge in applying machine learning to automated theorem proving is the scarcity of training data, which is a key ingredient in training successful deep learning models.

Automated Theorem Proving

AccentDB: A Database of Non-Native English Accents to Assist Neural Speech Recognition

no code implementations LREC 2020 Afroz Ahamad, Ankit Anand, Pranesh Bhargava

In this work, we first spell out the key requirements for creating a well-curated database of speech samples in non-native accents for training and testing robust ASR systems.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Block-Value Symmetries in Probabilistic Graphical Models

1 code implementation2 Jul 2018 Gagan Madan, Ankit Anand, Mausam, Parag Singla

These orbits are represented compactly using permutations over variables, and variable-value (VV) pairs, but they can miss several state symmetries in a domain.

Non-Count Symmetries in Boolean & Multi-Valued Prob. Graphical Models

1 code implementation27 Jul 2017 Ankit Anand, Ritesh Noothigattu, Parag Singla, Mausam

Moreover, algorithms for lifted inference in multi-valued domains also compute a multi-valued extension of count symmetries only.

Coarse-to-Fine Lifted MAP Inference in Computer Vision

1 code implementation22 Jul 2017 Haroun Habeeb, Ankit Anand, Mausam, Parag Singla

We demonstrate the performance of C2F inference by developing lifted versions of two near state-of-the-art CV algorithms for stereo vision and interactive image segmentation.

Image Segmentation Semantic Segmentation

Contextual Symmetries in Probabilistic Graphical Models

no code implementations30 Jun 2016 Ankit Anand, Aditya Grover, Mausam, Parag Singla

We extend previous work on exploiting symmetries in the MCMC framework to the case of contextual symmetries.

Cannot find the paper you are looking for? You can Submit a new open access paper.