Search Results for author: Adam Trischler

Found 57 papers, 32 papers with code

Boundary-Seeking Generative Adversarial Networks

6 code implementations • 27 Feb 2017 • R. Devon Hjelm, Athul Paul Jacob, Tong Che, Adam Trischler, Kyunghyun Cho, Yoshua Bengio

We introduce a method for training GANs with discrete data that uses the estimated difference measure from the discriminator to compute importance weights for generated samples, thus providing a policy gradient for training the generator.

Scene Understanding Text Generation

15,676

Paper
Code

Twin Networks: Matching the Future for Sequence Generation

2 code implementations • ICLR 2018 • Dmitriy Serdyuk, Nan Rosemary Ke, Alessandro Sordoni, Adam Trischler, Chris Pal, Yoshua Bengio

We propose a simple technique for encouraging generative RNNs to plan ahead.

Caption Generation speech-recognition +1

2,351

Paper
Code

Learning General Purpose Distributed Sentence Representations via Large Scale Multi-task Learning

4 code implementations • ICLR 2018 • Sandeep Subramanian, Adam Trischler, Yoshua Bengio, Christopher J. Pal

In this work, we present a simple, effective multi-task learning framework for sentence representations that combines the inductive biases of diverse training objectives in a single model.

Ranked #1 on Semantic Textual Similarity on SentEval

Multi-Task Learning Natural Language Inference +2

2,279

Paper
Code

Learning deep representations by mutual information estimation and maximization

9 code implementations • ICLR 2019 • R. Devon Hjelm, Alex Fedorov, Samuel Lavoie-Marchildon, Karan Grewal, Phil Bachman, Adam Trischler, Yoshua Bengio

In this work, we perform unsupervised learning of representations by maximizing mutual information between an input and the output of a deep neural network encoder.

General Classification Mutual Information Estimation +1

791

Paper
Code

Iterative Alternating Neural Attention for Machine Reading

1 code implementation • 7 Jun 2016 • Alessandro Sordoni, Philip Bachman, Adam Trischler, Yoshua Bengio

We propose a novel neural attention architecture to tackle machine comprehension tasks, such as answering Cloze-style queries with respect to a document.

Ranked #3 on Question Answering on Children's Book Test (Accuracy-NE metric)

Question Answering Reading Comprehension

436

Paper
Code

ALFWorld: Aligning Text and Embodied Environments for Interactive Learning

1 code implementation • 8 Oct 2020 • Mohit Shridhar, Xingdi Yuan, Marc-Alexandre Côté, Yonatan Bisk, Adam Trischler, Matthew Hausknecht

ALFWorld enables the creation of a new BUTLER agent whose abstract knowledge, learned in TextWorld, corresponds directly to concrete, visually grounded actions.

Natural Language Visual Grounding Scene Understanding

249

Paper
Code

Machine Comprehension by Text-to-Text Neural Question Generation

4 code implementations • WS 2017 • Xingdi Yuan, Tong Wang, Caglar Gulcehre, Alessandro Sordoni, Philip Bachman, Sandeep Subramanian, Saizheng Zhang, Adam Trischler

We propose a recurrent neural model that generates natural-language questions from documents, conditioned on answers.

Question Answering Question Generation +4

216

Paper
Code

One Size Does Not Fit All: Generating and Evaluating Variable Number of Keyphrases

1 code implementation • ACL 2020 • Xingdi Yuan, Tong Wang, Rui Meng, Khushboo Thaker, Peter Brusilovsky, Daqing He, Adam Trischler

With both previous and new evaluation metrics, our model outperforms strong baselines on all datasets.

Keyphrase Generation

213

Paper
Code

Does Order Matter? An Empirical Study on Generating Multiple Keyphrases as a Sequence

1 code implementation • 9 Sep 2019 • Rui Meng, Xingdi Yuan, Tong Wang, Peter Brusilovsky, Adam Trischler, Daqing He

Recently, concatenating multiple keyphrases as a target sequence has been proposed as a new learning paradigm for keyphrase generation.

Keyphrase Generation

213

Paper
Code

An Empirical Study on Neural Keyphrase Generation

1 code implementation • NAACL 2021 • Rui Meng, Xingdi Yuan, Tong Wang, Sanqiang Zhao, Adam Trischler, Daqing He

Recent years have seen a flourishing of neural keyphrase generation (KPG) works, including the release of several large-scale datasets and a host of new models to tackle them.

Keyphrase Generation

213

Paper
Code

Plan, Attend, Generate: Planning for Sequence-to-Sequence Models

1 code implementation • NeurIPS 2017 • Francis Dutil, Caglar Gulcehre, Adam Trischler, Yoshua Bengio

We investigate the integration of a planning mechanism into sequence-to-sequence models using attention.

Question Generation Question-Generation +2

169

Paper
Code

Plan, Attend, Generate: Character-level Neural Machine Translation with Planning in the Decoder

1 code implementation • 13 Jun 2017 • Caglar Gulcehre, Francis Dutil, Adam Trischler, Yoshua Bengio

We investigate the integration of a planning mechanism into an encoder-decoder architecture with an explicit alignment for character-level machine translation.

Machine Translation Translation

169

Paper
Code

An Empirical Study of Example Forgetting during Deep Neural Network Learning

3 code implementations • ICLR 2019 • Mariya Toneva, Alessandro Sordoni, Remi Tachet des Combes, Adam Trischler, Yoshua Bengio, Geoffrey J. Gordon

Inspired by the phenomenon of catastrophic forgetting, we investigate the learning dynamics of neural networks as they train on single classification tasks.

General Classification

165

Paper
Code

Joint Prompt Optimization of Stacked LLMs using Variational Inference

1 code implementation • NeurIPS 2023 • Alessandro Sordoni, Xingdi Yuan, Marc-Alexandre Côté, Matheus Pereira, Adam Trischler, Ziang Xiao, Arian Hosseini, Friederike Niedtner, Nicolas Le Roux

Thus, they can be seen as stochastic language layers in a language network, where the learnable parameters are the natural language prompts at each layer.

Natural Language Understanding Variational Inference

Paper
Code

NewsQA: A Machine Comprehension Dataset

2 code implementations • WS 2017 • Adam Trischler, Tong Wang, Xingdi Yuan, Justin Harris, Alessandro Sordoni, Philip Bachman, Kaheer Suleman

We present NewsQA, a challenging machine comprehension dataset of over 100, 000 human-generated question-answer pairs.

Natural Language Inference Reading Comprehension

Paper
Code

Exploring and Predicting Transferability across NLP Tasks

1 code implementation • EMNLP 2020 • Tu Vu, Tong Wang, Tsendsuren Munkhdalai, Alessandro Sordoni, Adam Trischler, Andrew Mattarella-Micke, Subhransu Maji, Mohit Iyyer

We also develop task embeddings that can be used to predict the most transferable source tasks for a given target task, and we validate their effectiveness in experiments controlled for source and target data size.

Language Modelling Part-Of-Speech Tagging +4

Paper
Code

Interactive Language Learning by Question Answering

1 code implementation • IJCNLP 2019 • Xingdi Yuan, Marc-Alexandre Cote, Jie Fu, Zhouhan Lin, Christopher Pal, Yoshua Bengio, Adam Trischler

In QAit, an agent must interact with a partially observable text-based environment to gather information required to answer questions.

Machine Reading Comprehension Question Answering

Paper
Code

FigureQA: An Annotated Figure Dataset for Visual Reasoning

1 code implementation • ICLR 2018 • Samira Ebrahimi Kahou, Vincent Michalski, Adam Atkinson, Akos Kadar, Adam Trischler, Yoshua Bengio

To resolve, such questions often require reference to multiple plot elements and synthesis of information distributed spatially throughout a figure.

Ranked #3 on Visual Question Answering (VQA) on FigureQA - test 1

BIG-bench Machine Learning Chart Question Answering +2

Paper
Code

Learning Dynamic Belief Graphs to Generalize on Text-Based Games

1 code implementation • NeurIPS 2020 • Ashutosh Adhikari, Xingdi Yuan, Marc-Alexandre Côté, Mikuláš Zelinka, Marc-Antoine Rondeau, Romain Laroche, Pascal Poupart, Jian Tang, Adam Trischler, William L. Hamilton

Playing text-based games requires skills in processing natural language and sequential decision making.

Decision Making Knowledge Graphs +2

Paper
Code

Counting to Explore and Generalize in Text-based Games

2 code implementations • 29 Jun 2018 • Xingdi Yuan, Marc-Alexandre Côté, Alessandro Sordoni, Romain Laroche, Remi Tachet des Combes, Matthew Hausknecht, Adam Trischler

We propose a recurrent RL agent with an episodic exploration mechanism that helps discovering good policies in text-based game environments.

text-based games

Paper
Code

A Parallel-Hierarchical Model for Machine Comprehension on Sparse Data

1 code implementation • ACL 2016 • Adam Trischler, Zheng Ye, Xingdi Yuan, Jing He, Phillip Bachman, Kaheer Suleman

The parallel hierarchy enables our model to compare the passage, question, and answer from a variety of trainable perspectives, as opposed to using a manually designed, rigid feature set.

Ranked #1 on Question Answering on MCTest-160

Question Answering Reading Comprehension +1

Paper
Code

Interactive Machine Comprehension with Information Seeking Agents

1 code implementation • ACL 2020 • Xingdi Yuan, Jie Fu, Marc-Alexandre Cote, Yi Tay, Christopher Pal, Adam Trischler

Existing machine reading comprehension (MRC) models do not scale effectively to real-world applications like web-level information retrieval and question answering (QA).

Decision Making Information Retrieval +3

Paper
Code

Role-Wise Data Augmentation for Knowledge Distillation

1 code implementation • ICLR 2020 • Jie Fu, Xue Geng, Zhijian Duan, Bohan Zhuang, Xingdi Yuan, Adam Trischler, Jie Lin, Chris Pal, Hao Dong

To our knowledge, existing methods overlook the fact that although the student absorbs extra knowledge from the teacher, both models share the same input data -- and this data is the only medium by which the teacher's knowledge can be demonstrated.

Data Augmentation Knowledge Distillation

Paper
Code

Think Before You Act: Decision Transformers with Internal Working Memory

1 code implementation • 24 May 2023 • Jikun Kang, Romain Laroche, Xindi Yuan, Adam Trischler, Xue Liu, Jie Fu

We argue that this inefficiency stems from the forgetting phenomenon, in which a model memorizes its behaviors in parameters throughout training.

Atari Games Decision Making +2

Paper
Code

Building Dynamic Knowledge Graphs from Text-based Games

1 code implementation • 21 Oct 2019 • Mikuláš Zelinka, Xingdi Yuan, Marc-Alexandre Côté, Romain Laroche, Adam Trischler

We are interested in learning how to update Knowledge Graphs (KG) from text.

Knowledge Graphs text-based games

Paper
Code

Modeling Event Plausibility with Consistent Conceptual Abstraction

1 code implementation • NAACL 2021 • Ian Porada, Kaheer Suleman, Adam Trischler, Jackie Chi Kit Cheung

Understanding natural language requires common sense, one aspect of which is the ability to discern the plausibility of events.

Common Sense Reasoning

Paper
Code

The Knowref Coreference Corpus: Removing Gender and Number Cues for Difficult Pronominal Anaphora Resolution

1 code implementation • ACL 2019 • Ali Emami, Paul Trichelair, Adam Trischler, Kaheer Suleman, Hannes Schulz, Jackie Chi Kit Cheung

To explain this performance gap, we show empirically that state-of-the art models often fail to capture context, instead relying on the gender or number of candidate antecedents to make a decision.

Common Sense Reasoning coreference-resolution +2

Paper
Code

How Reasonable are Common-Sense Reasoning Tasks: A Case-Study on the Winograd Schema Challenge and SWAG

1 code implementation • IJCNLP 2019 • Paul Trichelair, Ali Emami, Adam Trischler, Kaheer Suleman, Jackie Chi Kit Cheung

Recent studies have significantly improved the state-of-the-art on common-sense reasoning (CSR) benchmarks like the Winograd Schema Challenge (WSC) and SWAG.

Ranked #36 on Coreference Resolution on Winograd Schema Challenge

Common Sense Reasoning Coreference Resolution

Paper
Code

On the Systematicity of Probing Contextualized Word Representations: The Case of Hypernymy in BERT

1 code implementation • Joint Conference on Lexical and Computational Semantics 2020 • Abhilasha Ravichander, Eduard Hovy, Kaheer Suleman, Adam Trischler, Jackie Chi Kit Cheung

In particular, we demonstrate through a simple consistency probe that the ability to correctly retrieve hypernyms in cloze tasks, as used in prior work, does not correspond to systematic knowledge in BERT.

Paper
Code

TextWorld: A Learning Environment for Text-based Games

1 code implementation • 29 Jun 2018 • Marc-Alexandre Côté, Ákos Kádár, Xingdi Yuan, Ben Kybartas, Tavian Barnes, Emery Fine, James Moore, Ruo Yu Tao, Matthew Hausknecht, Layla El Asri, Mahmoud Adada, Wendy Tay, Adam Trischler

We introduce TextWorld, a sandbox learning environment for the training and evaluation of RL agents on text-based games.

text-based games Transfer Learning

Paper
Code

The KITMUS Test: Evaluating Knowledge Integration from Multiple Sources in Natural Language Understanding Systems

1 code implementation • 15 Dec 2022 • Akshatha Arodi, Martin Pömsl, Kaheer Suleman, Adam Trischler, Alexandra Olteanu, Jackie Chi Kit Cheung

In this work, we propose a test suite of coreference resolution subtasks that require reasoning over multiple facts.

coreference-resolution Natural Language Understanding

Paper
Code

Focused Hierarchical RNNs for Conditional Sequence Processing

no code implementations • ICML 2018 • Nan Rosemary Ke, Konrad Zolna, Alessandro Sordoni, Zhouhan Lin, Adam Trischler, Yoshua Bengio, Joelle Pineau, Laurent Charlin, Chris Pal

We evaluate this method on several types of tasks with different attributes.

Ranked #3 on Open-Domain Question Answering on SearchQA (Unigram Acc metric)

Open-Domain Question Answering Policy Gradient Methods

Paper
Add Code

Neural Models for Key Phrase Detection and Question Generation

no code implementations • 14 Jun 2017 • Sandeep Subramanian, Tong Wang, Xingdi Yuan, Saizheng Zhang, Yoshua Bengio, Adam Trischler

We propose a two-stage neural model to tackle question generation from documents.

Question Answering Question Generation +2

Paper
Add Code

Rapid Adaptation with Conditionally Shifted Neurons

no code implementations • ICML 2018 • Tsendsuren Munkhdalai, Xingdi Yuan, Soroush Mehri, Adam Trischler

We describe a mechanism by which artificial neural networks can learn rapid adaptation - the ability to adapt on the fly, with little data, to new tasks - that we call conditionally shifted neurons.

Ranked #11 on Few-Shot Image Classification on OMNIGLOT - 1-Shot, 20-way

Few-Shot Image Classification

Paper
Add Code

Variational Bi-LSTMs

no code implementations • ICLR 2018 • Samira Shabanian, Devansh Arpit, Adam Trischler, Yoshua Bengio

Bidirectional LSTMs (Bi-LSTMs) on the other hand model sequences along both forward and backward directions and are generally known to perform better at such tasks because they capture a richer representation of the data.

Paper
Add Code

Learning Algorithms for Active Learning

no code implementations • ICML 2017 • Philip Bachman, Alessandro Sordoni, Adam Trischler

We introduce a model that learns active learning algorithms via metalearning.

Active Learning

Paper
Add Code

A Joint Model for Question Answering and Question Generation

no code implementations • 5 Jun 2017 • Tong Wang, Xingdi Yuan, Adam Trischler

We propose a generative machine comprehension model that learns jointly to ask and answer questions based on documents.

Question Answering Question Generation +2

Paper
Add Code

Towards Information-Seeking Agents

no code implementations • 8 Dec 2016 • Philip Bachman, Alessandro Sordoni, Adam Trischler

We develop a general problem setting for training and testing the ability of agents to gather information efficiently.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Natural Language Comprehension with the EpiReader

no code implementations • EMNLP 2016 • Adam Trischler, Zheng Ye, Xingdi Yuan, Kaheer Suleman

We present the EpiReader, a novel model for machine comprehension of text.

Ranked #7 on Question Answering on Children's Book Test

Question Answering Reading Comprehension

Paper
Add Code

Synthesis of recurrent neural networks for dynamical system simulation

no code implementations • 17 Dec 2015 • Adam Trischler, Gabriele MT D'Eleuterio

We review several of the most widely used techniques for training recurrent neural networks to approximate dynamical systems, then describe a novel algorithm for this task.

Paper
Add Code

Metalearning with Hebbian Fast Weights

no code implementations • 12 Jul 2018 • Tsendsuren Munkhdalai, Adam Trischler

We unify recent neural approaches to one-shot learning with older ideas of associative memory in a model for metalearning.

One-Shot Learning

Paper
Add Code

A Knowledge Hunting Framework for Common Sense Reasoning

no code implementations • EMNLP 2018 • Ali Emami, Noelia De La Cruz, Adam Trischler, Kaheer Suleman, Jackie Chi Kit Cheung

We introduce an automatic system that achieves state-of-the-art results on the Winograd Schema Challenge (WSC), a common sense reasoning task that requires diverse, complex forms of inference and knowledge.

Ranked #65 on Coreference Resolution on Winograd Schema Challenge

Common Sense Reasoning Coreference Resolution

Paper
Add Code

Building Dynamic Knowledge Graphs from Text using Machine Reading Comprehension

no code implementations • ICLR 2019 • Rajarshi Das, Tsendsuren Munkhdalai, Xingdi Yuan, Adam Trischler, Andrew McCallum

We harness and extend a recently proposed machine reading comprehension (MRC) model to query for entity states, since these states are generally communicated in spans of text and MRC models perform well in extracting entity-centric spans.

Ranked #3 on Procedural Text Understanding on ProPara

Knowledge Graphs Machine Reading Comprehension +2

Paper
Add Code

A Generalized Knowledge Hunting Framework for the Winograd Schema Challenge

no code implementations • NAACL 2018 • Ali Emami, Adam Trischler, Kaheer Suleman, Jackie Chi Kit Cheung

We introduce an automatic system that performs well on two common-sense reasoning tasks, the Winograd Schema Challenge (WSC) and the Choice of Plausible Alternatives (COPA).

Common Sense Reasoning Coreference Resolution +1

Paper
Add Code

Neural Models for Key Phrase Extraction and Question Generation

no code implementations • WS 2018 • S Subramanian, eep, Tong Wang, Xingdi Yuan, Saizheng Zhang, Adam Trischler, Yoshua Bengio

We propose a two-stage neural model to tackle question generation from documents.

Question Answering Question Generation +2

Paper
Add Code

Plan, Attend, Generate: Character-Level Neural Machine Translation with Planning

no code implementations • WS 2017 • Caglar Gulcehre, Francis Dutil, Adam Trischler, Yoshua Bengio

We investigate the integration of a planning mechanism into an encoder-decoder architecture with attention.

Hierarchical Reinforcement Learning Machine Translation +3

Paper
Add Code

Towards Text Generation with Adversarially Learned Neural Outlines

no code implementations • NeurIPS 2018 • Sandeep Subramanian, Sai Rajeswar Mudumba, Alessandro Sordoni, Adam Trischler, Aaron C. Courville, Chris Pal

We generate outlines with an adversarial model trained to approximate the distribution of sentences in a latent space induced by general-purpose sentence encoders.

Sentence Text Generation

Paper
Add Code

Boundary Seeking GANs

no code implementations • ICLR 2018 • R. Devon Hjelm, Athul Paul Jacob, Adam Trischler, Gerry Che, Kyunghyun Cho, Yoshua Bengio

Scene Understanding Text Generation

Paper
Add Code

A Study of State Aliasing in Structured Prediction with RNNs

no code implementations • ICLR Workshop drlStructPred 2019 • Layla El Asri, Adam Trischler

We show through extensive experiments and analysis that, when trained with policy gradient, recurrent neural networks often fail to learn a state representation that leads to an optimal policy in settings where the same action should be taken at different states.

reinforcement-learning Reinforcement Learning (RL) +1

Paper
Add Code

Metalearned Neural Memory

1 code implementation • NeurIPS 2019 • Tsendsuren Munkhdalai, Alessandro Sordoni, Tong Wang, Adam Trischler

We augment recurrent neural networks with an external memory mechanism that builds upon recent progress in metalearning.

Question Answering reinforcement-learning +1

Paper
Code

Exploiting Structured Knowledge in Text via Graph-Guided Representation Learning

no code implementations • EMNLP 2020 • Tao Shen, Yi Mao, Pengcheng He, Guodong Long, Adam Trischler, Weizhu Chen

In contrast to existing paradigms, our approach uses knowledge graphs implicitly, only during pre-training, to inject language models with structured knowledge via learning from raw text.

Entity Linking Knowledge Base Completion +5

Paper
Add Code

BUTLER: Building Understanding in TextWorld via Language for Embodied Reasoning

no code implementations • ICLR 2021 • Mohit Shridhar, Xingdi Yuan, Marc-Alexandre Cote, Yonatan Bisk, Adam Trischler, Matthew Hausknecht

ALFWorld enables the creation of a new BUTLER agent whose abstract knowledge, learned in TextWorld, corresponds directly to concrete, visually grounded actions.

Scene Understanding

Paper
Add Code

An Analysis of Dataset Overlap on Winograd-Style Tasks

no code implementations • COLING 2020 • Ali Emami, Adam Trischler, Kaheer Suleman, Jackie Chi Kit Cheung

The Winograd Schema Challenge (WSC) and variants inspired by it have become important benchmarks for common-sense reasoning (CSR).

Common Sense Reasoning

Paper
Add Code

ADEPT: An Adjective-Dependent Plausibility Task

no code implementations • ACL 2021 • Ali Emami, Ian Porada, Alexandra Olteanu, Kaheer Suleman, Adam Trischler, Jackie Chi Kit Cheung

A false contract is more likely to be rejected than a contract is, yet a false key is less likely than a key to open doors.

Common Sense Reasoning Natural Language Understanding +1

Paper
Add Code

Deconstructing NLG Evaluation: Evaluation Practices, Assumptions, and Their Implications

no code implementations • NAACL 2022 • Kaitlyn Zhou, Su Lin Blodgett, Adam Trischler, Hal Daumé III, Kaheer Suleman, Alexandra Olteanu

There are many ways to express similar things in text, which makes evaluating natural language generation (NLG) systems difficult.

nlg evaluation Text Generation

Paper
Add Code

Investigating Failures to Generalize for Coreference Resolution Models

no code implementations • 16 Mar 2023 • Ian Porada, Alexandra Olteanu, Kaheer Suleman, Adam Trischler, Jackie Chi Kit Cheung

We investigate the extent to which errors of current coreference resolution models are associated with existing differences in operationalization across datasets (OntoNotes, PreCo, and Winogrande).

coreference-resolution

Paper
Add Code

Responsible AI Considerations in Text Summarization Research: A Review of Current Practices

no code implementations • 18 Nov 2023 • Yu Lu Liu, Meng Cao, Su Lin Blodgett, Jackie Chi Kit Cheung, Alexandra Olteanu, Adam Trischler

We focus on how, which, and when responsible AI issues are covered, which relevant stakeholders are considered, and mismatches between stated and realized research goals.

Text Summarization

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.