Search Results for author: Marcos Treviso

Found 15 papers, 10 papers with code

IST-Unbabel 2021 Submission for the Explainable Quality Estimation Shared Task

1 code implementation • EMNLP (Eval4NLP) 2021 • Marcos Treviso, Nuno M. Guerreiro, Ricardo Rei, André F. T. Martins

Paper
Code

Scaling up COMETKIWI: Unbabel-IST 2023 Submission for the Quality Estimation Shared Task

1 code implementation • 21 Sep 2023 • Ricardo Rei, Nuno M. Guerreiro, José Pombal, Daan van Stigt, Marcos Treviso, Luisa Coheur, José G. C. de Souza, André F. T. Martins

Our team participated on all tasks: sentence- and word-level quality prediction (task 1) and fine-grained error span detection (task 2).

Sentence Task 2

396

Paper
Code

CREST: A Joint Framework for Rationalization and Counterfactual Text Generation

1 code implementation • 26 May 2023 • Marcos Treviso, Alexis Ross, Nuno M. Guerreiro, André F. T. Martins

Selective rationales and counterfactual examples have emerged as two effective, complementary classes of interpretability methods for analyzing and training NLP models.

counterfactual Data Augmentation +2

Paper
Code

The Inside Story: Towards Better Understanding of Machine Translation Neural Evaluation Metrics

1 code implementation • 19 May 2023 • Ricardo Rei, Nuno M. Guerreiro, Marcos Treviso, Luisa Coheur, Alon Lavie, André F. T. Martins

Neural metrics for machine translation evaluation, such as COMET, exhibit significant improvements in their correlation with human judgments, as compared to traditional metrics based on lexical overlap, such as BLEU.

Decision Making Machine Translation +2

396

Paper
Code

CometKiwi: IST-Unbabel 2022 Submission for the Quality Estimation Shared Task

1 code implementation • 13 Sep 2022 • Ricardo Rei, Marcos Treviso, Nuno M. Guerreiro, Chrysoula Zerva, Ana C. Farinha, Christine Maroti, José G. C. de Souza, Taisiya Glushkova, Duarte M. Alves, Alon Lavie, Luisa Coheur, André F. T. Martins

We present the joint contribution of IST and Unbabel to the WMT 2022 Shared Task on Quality Estimation (QE).

Sentence

396

Paper
Code

Efficient Methods for Natural Language Processing: A Survey

no code implementations • 31 Aug 2022 • Marcos Treviso, Ji-Ung Lee, Tianchu Ji, Betty van Aken, Qingqing Cao, Manuel R. Ciosici, Michael Hassid, Kenneth Heafield, Sara Hooker, Colin Raffel, Pedro H. Martins, André F. T. Martins, Jessica Zosa Forde, Peter Milder, Edwin Simpson, Noam Slonim, Jesse Dodge, Emma Strubell, Niranjan Balasubramanian, Leon Derczynski, Iryna Gurevych, Roy Schwartz

Recent work in natural language processing (NLP) has yielded appealing results from scaling model parameters and training data; however, using only scale to improve performance means that resource consumption also grows.

Information Retrieval Open-Domain Question Answering

Paper
Add Code

Learning to Scaffold: Optimizing Model Explanations for Teaching

1 code implementation • 22 Apr 2022 • Patrick Fernandes, Marcos Treviso, Danish Pruthi, André F. T. Martins, Graham Neubig

In this work, leveraging meta-learning techniques, we extend this idea to improve the quality of the explanations themselves, specifically by optimizing explanations such that student models more effectively learn to simulate the original model.

Meta-Learning

Paper
Code

Predicting Attention Sparsity in Transformers

no code implementations • spnlp (ACL) 2022 • Marcos Treviso, António Góis, Patrick Fernandes, Erick Fonseca, André F. T. Martins

Transformers' quadratic complexity with respect to the input sequence length has motivated a body of work on efficient sparse approximations to softmax.

Language Modelling Machine Translation +3

Paper
Add Code

Sparse Continuous Distributions and Fenchel-Young Losses

1 code implementation • 4 Aug 2021 • André F. T. Martins, Marcos Treviso, António Farinhas, Pedro M. Q. Aguiar, Mário A. T. Figueiredo, Mathieu Blondel, Vlad Niculae

In contrast, for finite domains, recent work on sparse alternatives to softmax (e. g., sparsemax, $\alpha$-entmax, and fusedmax), has led to distributions with varying support.

Audio Classification Question Answering +1

Paper
Code

Sparse and Continuous Attention Mechanisms

2 code implementations • NeurIPS 2020 • André F. T. Martins, António Farinhas, Marcos Treviso, Vlad Niculae, Pedro M. Q. Aguiar, Mário A. T. Figueiredo

Exponential families are widely used in machine learning; they include many distributions in continuous and discrete domains (e. g., Gaussian, Dirichlet, Poisson, and categorical distributions via the softmax transformation).

Ranked #36 on Visual Question Answering (VQA) on VQA v2 test-std

Machine Translation Question Answering +4

Paper
Code

Evaluating Sentence Segmentation in Different Datasets of Neuropsychological Language Tests in Brazilian Portuguese

no code implementations • LREC 2020 • Edresson Casanova, Marcos Treviso, Lilian H{\"u}bner, S Alu{\'\i}sio, ra

Automatic analysis of connected speech by natural language processing techniques is a promising direction for diagnosing cognitive impairments.

Sentence Sentence segmentation

Paper
Add Code

Unbabel's Participation in the WMT19 Translation Quality Estimation Shared Task

no code implementations • WS 2019 • Fabio Kepler, Jonay Trénous, Marcos Treviso, Miguel Vera, António Góis, M. Amin Farajian, António V. Lopes, André F. T. Martins

We present the contribution of the Unbabel team to the WMT 2019 Shared Task on Quality Estimation.

Sentence Transfer Learning +1

Paper
Add Code

OpenKiwi: An Open Source Framework for Quality Estimation

1 code implementation • ACL 2019 • Fábio Kepler, Jonay Trénous, Marcos Treviso, Miguel Vera, André F. T. Martins

We introduce OpenKiwi, a PyTorch-based open source framework for translation quality estimation.

NMT Sentence +1

229

Paper
Code

Evaluating Word Embeddings for Sentence Boundary Detection in Speech Transcripts

no code implementations • WS 2017 • Marcos Treviso, Christopher Shulby, S Alu{\'\i}sio, ra

Automatic Speech Recognition (ASR) Boundary Detection +3

Paper
Add Code

Portuguese Word Embeddings: Evaluating on Word Analogies and Natural Language Tasks

3 code implementations • WS 2017 • Nathan Hartmann, Erick Fonseca, Christopher Shulby, Marcos Treviso, Jessica Rodrigues, Sandra Aluisio

Word embeddings have been found to provide meaningful representations for words in an efficient way; therefore, they have become common in Natural Language Processing sys- tems.

POS POS Tagging +4

224

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.