Search Results for author: Gábor Melis

Found 12 papers, 7 papers with code

Circling Back to Recurrent Models of Language

no code implementations • 3 Nov 2022 • Gábor Melis

Just because some purely recurrent models suffer from being hard to optimize and inefficient on today's hardware, they are not necessarily bad models of language.

Language Modelling

Paper
Add Code

Two-Tailed Averaging: Anytime, Adaptive, Once-in-a-While Optimal Weight Averaging for Better Generalization

no code implementations • 26 Sep 2022 • Gábor Melis

In practice, with a finite number of optimization steps and a learning rate that cannot be annealed to zero, Tail Averaging can get much closer to a local minimum point of the training loss than either the individual iterates or the Polyak average.

Stochastic Optimization

Paper
Add Code

Mutual Information Constraints for Monte-Carlo Objectives

no code implementations • 1 Dec 2020 • Gábor Melis, András György, Phil Blunsom

A common failure mode of density models trained as variational autoencoders is to model the data without relying on their latent variables, rendering these variables useless.

Paper
Add Code

Capturing document context inside sentence-level neural machine translation models with self-training

no code implementations • CODI 2021 • Elman Mansimov, Gábor Melis, Lei Yu

Neural machine translation (NMT) has arguably achieved human level parity when trained and evaluated at the sentence-level.

Machine Translation NMT +2

Paper
Add Code

A Critical Analysis of Biased Parsers in Unsupervised Parsing

1 code implementation • 20 Sep 2019 • Chris Dyer, Gábor Melis, Phil Blunsom

A series of recent papers has used a parsing algorithm due to Shen et al. (2018) to recover phrase-structure trees based on proxies for "syntactic depth."

Language Modelling

Paper
Code

Mogrifier LSTM

3 code implementations • ICLR 2020 • Gábor Melis, Tomáš Kočiský, Phil Blunsom

Many advances in Natural Language Processing have been based upon more expressive models for how inputs interact with the context in which they occur.

Ranked #1 on Language Modelling on Penn Treebank (Character Level)

Language Modelling

137

Paper
Code

Unsupervised Recurrent Neural Network Grammars

1 code implementation • NAACL 2019 • Yoon Kim, Alexander M. Rush, Lei Yu, Adhiguna Kuncoro, Chris Dyer, Gábor Melis

On language modeling, unsupervised RNNGs perform as well their supervised counterparts on benchmarks in English and Chinese.

Ranked #8 on Constituency Grammar Induction on PTB Diagnostic ECG Database (Max F1 (WSJ) metric)

Constituency Grammar Induction Language Modelling +2

175

Paper
Code

Encoding Spatial Relations from Natural Language

1 code implementation • 4 Jul 2018 • Tiago Ramalho, Tomáš Kočiský, Frederic Besse, S. M. Ali Eslami, Gábor Melis, Fabio Viola, Phil Blunsom, Karl Moritz Hermann

Natural language processing has made significant inroads into learning the semantics of words through distributional approaches, however representations learnt via these methods fail to capture certain kinds of information implicit in the real world.

Paper
Code

Pushing the bounds of dropout

1 code implementation • ICLR 2019 • Gábor Melis, Charles Blundell, Tomáš Kočiský, Karl Moritz Hermann, Chris Dyer, Phil Blunsom

We show that dropout training is best understood as performing MAP estimation concurrently for a family of conditional models whose objectives are themselves lower bounded by the original dropout objective.

Ranked #24 on Language Modelling on Penn Treebank (Word Level)

Language Modelling

137

Paper
Code

The NarrativeQA Reading Comprehension Challenge

2 code implementations • TACL 2018 • Tomáš Kočiský, Jonathan Schwarz, Phil Blunsom, Chris Dyer, Karl Moritz Hermann, Gábor Melis, Edward Grefenstette

Reading comprehension (RC)---in contrast to information retrieval---requires integrating information and reasoning about events, entities, and their relations across a full document.

Ranked #9 on Question Answering on NarrativeQA (BLEU-1 metric)

Information Retrieval Question Answering +2

432

Paper
Code

On the State of the Art of Evaluation in Neural Language Models

1 code implementation • ICLR 2018 • Gábor Melis, Chris Dyer, Phil Blunsom

Ongoing innovations in recurrent neural network architectures have provided a steady influx of apparently state-of-the-art results on language modelling benchmarks.

Ranked #32 on Language Modelling on WikiText-2

Language Modelling

137

Paper
Code

Semantic Parsing with Semi-Supervised Sequential Autoencoders

no code implementations • EMNLP 2016 • Tomáš Kočiský, Gábor Melis, Edward Grefenstette, Chris Dyer, Wang Ling, Phil Blunsom, Karl Moritz Hermann

We present a novel semi-supervised approach for sequence transduction and apply it to semantic parsing.

Semantic Parsing

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.