1 code implementation • ACL 2022 • Pedro Henrique Martins, Zita Marinho, Andre Martins
Transformers are unable to model long-term memories effectively, since the amount of computation they need to perform grows with the context length.
1 code implementation • 24 May 2022 • Pedro Henrique Martins, Zita Marinho, André F. T. Martins
Semi-parametric models, which augment generation with retrieval, have led to impressive results in language modeling and machine translation, due to their ability to retrieve fine-grained information from a datastore of examples.
1 code implementation • SpaNLP (ACL) 2022 • Pedro Henrique Martins, Zita Marinho, André F. T. Martins
On the other hand, semi-parametric models have been shown to successfully perform domain adaptation by retrieving examples from an in-domain datastore (Khandelwal et al., 2021).
no code implementations • 8 Dec 2021 • Angelos Filos, Eszter Vértes, Zita Marinho, Gregory Farquhar, Diana Borsa, Abram Friesen, Feryal Behbahani, Tom Schaul, André Barreto, Simon Osindero
Unlike prior work which estimates uncertainty by training an ensemble of many models and/or value functions, this approach requires only the single model and value function which are already being learned in most model-based reinforcement learning algorithms.
Model-based Reinforcement Learning Rolling Shutter Correction
no code implementations • NeurIPS 2021 • Gregory Farquhar, Kate Baumli, Zita Marinho, Angelos Filos, Matteo Hessel, Hado van Hasselt, David Silver
Learned models of the environment provide reinforcement learning (RL) agents with flexible ways of making predictions about the environment.
1 code implementation • 1 Sep 2021 • Pedro Henrique Martins, Zita Marinho, André F. T. Martins
Transformers are unable to model long-term memories effectively, since the amount of computation they need to perform grows with the context length.
Ranked #1 on Dialogue Generation on CMU-DoG
1 code implementation • 12 May 2021 • Ruben Cardoso, Zita Marinho, Afonso Mendes, Sebastião Miranda
Information retrieval tools are crucial in order to navigate and provide meaningful recommendations for articles and treatments.
no code implementations • SEMEVAL 2020 • Susan Wang, Zita Marinho
Our best model, an average ensemble of four different Bert models, achieved 11th place out of 82 participants with a macro F1 score of 0. 91344 in the English SubTask A.
1 code implementation • EMNLP 2020 • Pedro Henrique Martins, Zita Marinho, André F. T. Martins
Current state-of-the-art text generators build on powerful language models such as GPT-2, achieving impressive performance.
1 code implementation • 13 Feb 2020 • Pedro Henrique Martins, Vlad Niculae, Zita Marinho, André Martins
Visual attention mechanisms are widely used in multimodal tasks, as visual question answering (VQA).
no code implementations • ACL 2019 • Pedro Henrique Martins, Zita Marinho, André F. T. Martins
Named entity recognition (NER) and entity linking (EL) are two fundamentally related tasks, since in order to perform EL, first the mentions to entities have to be detected.
Ranked #10 on Entity Linking on AIDA-CoNLL
no code implementations • WS 2019 • Zita Marinho, Afonso Mendes, Mir, Sebasti{\~a}o a, David Nogueira
In the medical domain and other scientific areas, it is often important to recognize different levels of hierarchy in mentions, such as those related to specific symptoms or diseases associated with different anatomical regions.
no code implementations • 3 Apr 2019 • Sebastião Miranda, David Nogueira, Afonso Mendes, Andreas Vlachos, Andrew Secker, Rebecca Garrett, Jeff Mitchel, Zita Marinho
Fact checking is an essential task in journalism; its importance has been highlighted due to recently increased concerns and efforts in combating misinformation.
1 code implementation • NAACL 2019 • Afonso Mendes, Shashi Narayan, Sebastião Miranda, Zita Marinho, André F. T. Martins, Shay B. Cohen
We present a new neural model for text summarization that first extracts sentences from a document and then compresses them.
2 code implementations • ICML 2018 • Ahmed Hefny, Zita Marinho, Wen Sun, Siddhartha Srinivasa, Geoffrey Gordon
Predictive state policy networks consist of a recursive filter, which keeps track of a belief about the state of the environment, and a reactive policy that directly maps beliefs to actions, to maximize the cumulative reward.