Search Results for author: Athul Paul Jacob

Found 13 papers, 4 papers with code

Mode Regularized Generative Adversarial Networks

no code implementations • 7 Dec 2016 • Tong Che, Yan-ran Li, Athul Paul Jacob, Yoshua Bengio, Wenjie Li

Although Generative Adversarial Networks achieve state-of-the-art results on a variety of generative tasks, they are regarded as highly unstable and prone to miss modes.

Paper
Add Code

Boundary-Seeking Generative Adversarial Networks

6 code implementations • 27 Feb 2017 • R. Devon Hjelm, Athul Paul Jacob, Tong Che, Adam Trischler, Kyunghyun Cho, Yoshua Bengio

We introduce a method for training GANs with discrete data that uses the estimated difference measure from the discriminator to compute importance weights for generated samples, thus providing a policy gradient for training the generator.

Scene Understanding Text Generation

15,711

Paper
Code

Boundary Seeking GANs

no code implementations • ICLR 2018 • R. Devon Hjelm, Athul Paul Jacob, Adam Trischler, Gerry Che, Kyunghyun Cho, Yoshua Bengio

Scene Understanding Text Generation

Paper
Add Code

Straight to the Tree: Constituency Parsing with Neural Syntactic Distance

2 code implementations • ACL 2018 • Yikang Shen, Zhouhan Lin, Athul Paul Jacob, Alessandro Sordoni, Aaron Courville, Yoshua Bengio

In this work, we propose a novel constituency parsing scheme.

Constituency Parsing Position +1

Paper
Code

Learning Hierarchical Structures On-The-Fly with a Recurrent-Recursive Model for Sequences

no code implementations • WS 2018 • Athul Paul Jacob, Zhouhan Lin, Aless Sordoni, ro, Yoshua Bengio

We propose a hierarchical model for sequential data that learns a tree on-the-fly, i. e. while reading the sequence.

Language Modelling Math +3

Paper
Add Code

Multitasking Inhibits Semantic Drift

no code implementations • NAACL 2021 • Athul Paul Jacob, Mike Lewis, Jacob Andreas

When intelligent agents communicate to accomplish shared goals, how do these goals shape the agents' language?

Paper
Add Code

Modeling Strong and Human-Like Gameplay with KL-Regularized Search

no code implementations • 14 Dec 2021 • Athul Paul Jacob, David J. Wu, Gabriele Farina, Adam Lerer, Hengyuan Hu, Anton Bakhtin, Jacob Andreas, Noam Brown

We consider the task of building strong but human-like policies in multi-agent decision-making problems, given examples of human behavior.

Imitation Learning

Paper
Add Code

Mastering the Game of No-Press Diplomacy via Human-Regularized Reinforcement Learning and Planning

1 code implementation • 11 Oct 2022 • Anton Bakhtin, David J Wu, Adam Lerer, Jonathan Gray, Athul Paul Jacob, Gabriele Farina, Alexander H Miller, Noam Brown

We then show that DiL-piKL can be extended into a self-play reinforcement learning algorithm we call RL-DiL-piKL that provides a model of human play while simultaneously training an agent that responds well to this human model.

reinforcement-learning Reinforcement Learning (RL)

1,238

Paper
Code

Human-level play in the game of Diplomacy by combining language models with strategic reasoning

1 code implementation • Science 2022 • Anton Bakhtin, Noam Brown, Emily Dinan, Gabriele Farina, Colin Flaherty, Daniel Fried, Andrew Goff, Jonathan Gray, Hengyan Hu, Athul Paul Jacob, Mojtaba Komeili, Karthik Konath, Minae Kwon, Adam Lerer, Mike Lewis, Alexander H. Miller, Sash Mitts, Aditya Renduchintala, Stephen Roller, Dirk Rowe, Weiyan Shi, Joe Spisak, Alexander Wei, David Wu, Hugh Zhang, Markus Zijlstra

Despite much progress in training AI systems to imitate human language, building agents that use language to communicate intentionally with humans in interactive environments remains a major challenge.

1,238

Paper
Code

AutoReply: Detecting Nonsense in Dialogue Introspectively with Discriminative Replies

no code implementations • 22 Nov 2022 • Weiyan Shi, Emily Dinan, Adi Renduchintala, Daniel Fried, Athul Paul Jacob, Zhou Yu, Mike Lewis

Existing approaches built separate classifiers to detect nonsense in dialogues.

Paper
Add Code

The Consensus Game: Language Model Generation via Equilibrium Search

no code implementations • 13 Oct 2023 • Athul Paul Jacob, Yikang Shen, Gabriele Farina, Jacob Andreas

When applied to question answering and other text generation tasks, language models (LMs) may be queried generatively (by sampling answers from their output distribution) or discriminatively (by using them to score or rank a set of candidate outputs).

Language Modelling Question Answering +2

Paper
Add Code

Regularized Conventions: Equilibrium Computation as a Model of Pragmatic Reasoning

no code implementations • 16 Nov 2023 • Athul Paul Jacob, Gabriele Farina, Jacob Andreas

We present a model of pragmatic language understanding, where utterances are produced and understood by searching for regularized equilibria of signaling games.

Implicatures

Paper
Add Code

Modeling Boundedly Rational Agents with Latent Inference Budgets

no code implementations • 7 Dec 2023 • Athul Paul Jacob, Abhishek Gupta, Jacob Andreas

We study the problem of modeling a population of agents pursuing unknown goals subject to unknown computational constraints.

Decision Making Decision Making Under Uncertainty

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.