Search Results for author: Yvette Graham

Found 52 papers, 8 papers with code

Findings of the 2021 Conference on Machine Translation (WMT21)

no code implementations • WMT (EMNLP) 2021 • Farhad Akhbardeh, Arkady Arkhangorodsky, Magdalena Biesialska, Ondřej Bojar, Rajen Chatterjee, Vishrav Chaudhary, Marta R. Costa-Jussa, Cristina España-Bonet, Angela Fan, Christian Federmann, Markus Freitag, Yvette Graham, Roman Grundkiewicz, Barry Haddow, Leonie Harter, Kenneth Heafield, Christopher Homan, Matthias Huck, Kwabena Amponsah-Kaakyire, Jungo Kasai, Daniel Khashabi, Kevin Knight, Tom Kocmi, Philipp Koehn, Nicholas Lourie, Christof Monz, Makoto Morishita, Masaaki Nagata, Ajay Nagesh, Toshiaki Nakazawa, Matteo Negri, Santanu Pal, Allahsera Auguste Tapo, Marco Turchi, Valentin Vydrin, Marcos Zampieri

This paper presents the results of the newstranslation task, the multilingual low-resourcetranslation for Indo-European languages, thetriangular translation task, and the automaticpost-editing task organised as part of the Con-ference on Machine Translation (WMT) 2021. In the news task, participants were asked tobuild machine translation systems for any of10 language pairs, to be evaluated on test setsconsisting mainly of news stories.

Machine Translation Translation

Paper
Add Code

The Third Multilingual Surface Realisation Shared Task (SR’20): Overview and Evaluation Results

1 code implementation • MSR (COLING) 2020 • Simon Mille, Anya Belz, Bernd Bohnet, Thiago castro Ferreira, Yvette Graham, Leo Wanner

As in SR’18 and SR’19, the shared task comprised two tracks: (1) a Shallow Track where the inputs were full UD structures with word order information removed and tokens lemmatised; and (2) a Deep Track where additionally, functional words and morphological information were removed.

0

Paper
Code

Statistical Power and Translationese in Machine Translation Evaluation

no code implementations • EMNLP 2020 • Yvette Graham, Barry Haddow, Philipp Koehn

In addition, we provide a re-evaluation of a past machine translation evaluation claiming human-parity of MT.

Machine Translation Translation

Paper
Add Code

ALoRA: Allocating Low-Rank Adaptation for Fine-tuning Large Language Models

no code implementations • 24 Mar 2024 • Zequan Liu, Jiawen Lyn, Wei Zhu, Xing Tian, Yvette Graham

Parameter-efficient fine-tuning (PEFT) is widely studied for its effectiveness and efficiency in the era of large language models.

Paper
Add Code

Findings of the First Workshop on Simulating Conversational Intelligence in Chat

no code implementations • 9 Feb 2024 • Yvette Graham, Mohammed Rameez Qureshi, Haider Khalid, Gerasimos Lampouras, Ignacio Iacobacci, Qun Liu

The aim of this workshop is to bring together experts working on open-domain dialogue research.

Paper
Add Code

Findings of the WMT 2023 Shared Task on Discourse-Level Literary Translation: A Fresh Orb in the Cosmos of LLMs

no code implementations • 6 Nov 2023 • Longyue Wang, Zhaopeng Tu, Yan Gu, Siyou Liu, Dian Yu, Qingsong Ma, Chenyang Lyu, Liting Zhou, Chao-Hong Liu, Yufeng Ma, WeiYu Chen, Yvette Graham, Bonnie Webber, Philipp Koehn, Andy Way, Yulin Yuan, Shuming Shi

To foster progress in this domain, we hold a new shared task at WMT 2023, the first edition of the Discourse-Level Literary Translation.

Machine Translation Translation

Paper
Add Code

Do Stochastic Parrots have Feelings Too? Improving Neural Detection of Synthetic Text via Emotion Recognition

1 code implementation • 24 Oct 2023 • Alan Cowap, Yvette Graham, Jennifer Foster

Recent developments in generative AI have shone a spotlight on high-performance synthetic text generation technologies.

Emotion Recognition Text Generation

2

Paper
Code

An overview on the evaluated video retrieval tasks at TRECVID 2022

no code implementations • 22 Jun 2023 • George Awad, Keith Curtis, Asad Butt, Jonathan Fiscus, Afzal Godil, Yooyoung Lee, Andrew Delgado, Eliot Godard, Lukas Diduch, Jeffrey Liu, Yvette Graham, Georges Quenot

The TREC Video Retrieval Evaluation (TRECVID) is a TREC-style video analysis and retrieval evaluation with the goal of promoting progress in research and development of content-based exploitation and retrieval of information from digital video via open, tasks-based evaluation supported by metrology.

Ad-hoc video search Retrieval +2

Paper
Add Code

Is a Video worth $n\times n$ Images? A Highly Efficient Approach to Transformer-based Video Question Answering

no code implementations • 16 May 2023 • Chenyang Lyu, Tianbo Ji, Yvette Graham, Jennifer Foster

We show that by integrating our approach into VideoQA systems we can achieve comparable, even superior, performance with a significant speed up for training and inference.

Question Answering Video Question Answering

Paper
Add Code

Semantic-aware Dynamic Retrospective-Prospective Reasoning for Event-level Video Question Answering

no code implementations • 14 May 2023 • Chenyang Lyu, Tianbo Ji, Yvette Graham, Jennifer Foster

Specifically, we explicitly use the Semantic Role Labeling (SRL) structure of the question in the dynamic reasoning process where we decide to move to the next frame based on which part of the SRL structure (agent, verb, patient, etc.)

Question Answering Semantic Role Labeling +1

Paper
Add Code

Exploiting Rich Textual User-Product Context for Improving Sentiment Analysis

no code implementations • 17 Dec 2022 • Chenyang Lyu, Linyi Yang, Yue Zhang, Yvette Graham, Jennifer Foster

User and product information associated with a review is useful for sentiment polarity prediction.

Sentiment Analysis

Paper
Add Code

QAScore -- An Unsupervised Unreferenced Metric for the Question Generation Evaluation

no code implementations • 9 Oct 2022 • Tianbo Ji, Chenyang Lyu, Gareth Jones, Liting Zhou, Yvette Graham

Question Generation (QG) aims to automate the task of composing questions for a passage with a set of chosen answers found within the passage.

Language Modelling Question Generation +1

Paper
Add Code

Extending the Scope of Out-of-Domain: Examining QA models in multiple subdomains

1 code implementation • insights (ACL) 2022 • Chenyang Lyu, Jennifer Foster, Yvette Graham

Past works that investigate out-of-domain performance of QA systems have mainly focused on general domains (e. g. news domain, wikipedia domain), underestimating the importance of subdomains defined by the internal characteristics of QA datasets.

1

Paper
Code

Achieving Reliable Human Assessment of Open-Domain Dialogue Systems

1 code implementation • ACL 2022 • Tianbo Ji, Yvette Graham, Gareth J. F. Jones, Chenyang Lyu, Qun Liu

Answering the distress call of competitions that have emphasized the urgent need for better evaluation techniques in dialogue, we present the successful development of human evaluation that is highly reliable while still remaining feasible and low cost.

Dialogue Evaluation

8

Paper
Code

BERTHA: Video Captioning Evaluation Via Transfer-Learned Human Assessment

1 code implementation • LREC 2022 • Luis Lebron, Yvette Graham, Kevin McGuinness, Konstantinos Kouramas, Noel E. O'Connor

The model is based on BERT, which is a language model that has been shown to work well in multiple NLP tasks.

Language Modelling Video Captioning

0

Paper
Code

Improving Unsupervised Question Answering via Summarization-Informed Question Generation

no code implementations • EMNLP 2021 • Chenyang Lyu, Lifeng Shang, Yvette Graham, Jennifer Foster, Xin Jiang, Qun Liu

Template-based QG uses linguistically-informed heuristics to transform declarative sentences into interrogatives, whereas supervised QG uses existing Question Answering (QA) datasets to train a system to generate a question given a passage and an answer.

Dependency Parsing named-entity-recognition +8

Paper
Add Code

TRECVID 2020: A comprehensive campaign for evaluating video retrieval tasks across multiple application domains

no code implementations • 27 Apr 2021 • George Awad, Asad A. Butt, Keith Curtis, Jonathan Fiscus, Afzal Godil, Yooyoung Lee, Andrew Delgado, Jesse Zhang, Eliot Godard, Baptiste Chocot, Lukas Diduch, Jeffrey Liu, Alan F. Smeaton, Yvette Graham, Gareth J. F. Jones, Wessel Kraaij, Georges Quenot

In total, 29 teams from various research organizations worldwide completed one or more of the following six tasks: 1.

Ad-hoc video search Instance Search +3

Paper
Add Code

Findings of the 2020 Conference on Machine Translation (WMT20)

no code implementations • EMNLP 2020 • Loïc Barrault, Magdalena Biesialska, Ondřej Bojar, Marta R. Costa-jussà, Christian Federmann, Yvette Graham, Roman Grundkiewicz, Barry Haddow, Matthias Huck, Eric Joanis, Tom Kocmi, Philipp Koehn, Chi-kiu Lo, Nikola Ljubešić, Christof Monz, Makoto Morishita, Masaaki Nagata, Toshiaki Nakazawa, Santanu Pal, Matt Post, Marcos Zampieri

In the news task, participants were asked to build machine translation systems for any of 11 language pairs, to be evaluated on test sets consisting mainly of news stories.

Machine Translation Translation

Paper
Add Code

Improving Document-Level Sentiment Analysis with User and Product Context

1 code implementation • COLING 2020 • Chenyang Lyu, Jennifer Foster, Yvette Graham

We achieve this by explicitly storing representations of reviews written by the same user and about the same product and force the model to memorize all reviews for one particular user and product.

Sentiment Analysis

12

Paper
Code

Assessing Human-Parity in Machine Translation on the Segment Level

no code implementations • Findings of the Association for Computational Linguistics 2020 • Yvette Graham, Christian Federmann, Maria Eskevich, Barry Haddow

Recent machine translation shared tasks have shown top-performing systems to tie or in some cases even outperform human translation.

Machine Translation Translation

Paper
Add Code

TRECVID 2019: An Evaluation Campaign to Benchmark Video Activity Detection, Video Captioning and Matching, and Video Search & Retrieval

no code implementations • 21 Sep 2020 • George Awad, Asad A. Butt, Keith Curtis, Yooyoung Lee, Jonathan Fiscus, Afzal Godil, Andrew Delgado, Jesse Zhang, Eliot Godard, Lukas Diduch, Alan F. Smeaton, Yvette Graham, Wessel Kraaij, Georges Quenot

The TREC Video Retrieval Evaluation (TRECVID) 2019 was a TREC-style video analysis and retrieval evaluation, the goal of which remains to promote progress in research and development of content-based exploitation and retrieval of information from digital video via open, metrics-based evaluation.

Action Detection Activity Detection +5

Paper
Add Code

The Second Multilingual Surface Realisation Shared Task (SR'19): Overview and Evaluation Results

no code implementations • WS 2019 • Simon Mille, Anja Belz, Bernd Bohnet, Yvette Graham, Leo Wanner

We report results from the SR{'}19 Shared Task, the second edition of a multilingual surface realisation task organised as part of the EMNLP{'}19 Workshop on Multilingual Surface Realisation.

Paper
Add Code

Results of the WMT19 Metrics Shared Task: Segment-Level and Strong MT Systems Pose Big Challenges

no code implementations • WS 2019 • Qingsong Ma, Johnny Wei, Ond{\v{r}}ej Bojar, Yvette Graham

This paper presents the results of the WMT19 Metrics Shared Task.

Paper
Add Code

Findings of the 2019 Conference on Machine Translation (WMT19)

no code implementations • WS 2019 • Lo{\"\i}c Barrault, Ond{\v{r}}ej Bojar, Marta R. Costa-juss{\`a}, Christian Federmann, Mark Fishel, Yvette Graham, Barry Haddow, Matthias Huck, Philipp Koehn, Shervin Malmasi, Christof Monz, Mathias M{\"u}ller, Santanu Pal, Matt Post, Marcos Zampieri

This paper presents the results of the premier shared task organized alongside the Conference on Machine Translation (WMT) 2019.

Machine Translation Translation

Paper
Add Code

Translationese in Machine Translation Evaluation

no code implementations • 24 Jun 2019 • Yvette Graham, Barry Haddow, Philipp Koehn

Finally, we provide a comprehensive check-list for future machine translation evaluation.

Machine Translation Translation

Paper
Add Code

Findings of the 2018 Conference on Machine Translation (WMT18)

no code implementations • WS 2018 • Ond{\v{r}}ej Bojar, Christian Federmann, Mark Fishel, Yvette Graham, Barry Haddow, Philipp Koehn, Christof Monz

This paper presents the results of the premier shared task organized alongside the Conference on Machine Translation (WMT) 2018.

Automatic Post-Editing Multimodal Machine Translation +1

Paper
Add Code

Results of the WMT18 Metrics Shared Task: Both characters and embeddings achieve good performance

no code implementations • WS 2018 • Qingsong Ma, Ond{\v{r}}ej Bojar, Yvette Graham

We asked participants of this task to score the outputs of the MT systems involved in the WMT18 News Translation Task with automatic metrics.

Machine Translation Sentence +1

Paper
Add Code

Proceedings of the Third Conference on Machine Translation: Shared Task Papers

no code implementations • EMNLP 2018 • Ond{\v{r}}ej Bojar, Rajen Chatterjee, Christian Federmann, Mark Fishel, Yvette Graham, Barry Haddow, Matthias Huck, Antonio Jimeno Yepes, Philipp Koehn, Christof Monz, Matteo Negri, Aur{\'e}lie N{\'e}v{\'e}ol, Mariana Neves, Matt Post, Lucia Specia, Marco Turchi, Karin Verspoor

Machine Translation Translation

Paper
Add Code

The First Multilingual Surface Realisation Shared Task (SR'18): Overview and Evaluation Results

no code implementations • WS 2018 • Simon Mille, Anja Belz, Bernd Bohnet, Yvette Graham, Emily Pitler, Leo Wanner

We report results from the SR{'}18 Shared Task, a new multilingual surface realisation task organised as part of the ACL{'}18 Workshop on Multilingual Surface Realisation.

Paper
Add Code

The First Multilingual Surface Realisation Shared Task (SRâ18): Overview and Evaluation Results

no code implementations • WS 2018 • Simon Mille, Anja Belz, Bernd Bohnet, Yvette Graham, Emily Pitler, Leo Wanner

Question Answering Text Generation

Paper
Add Code

Translating Pro-Drop Languages with Reconstruction Models

1 code implementation • 10 Jan 2018 • Long-Yue Wang, Zhaopeng Tu, Shuming Shi, Tong Zhang, Yvette Graham, Qun Liu

Next, the annotated source sentence is reconstructed from hidden representations in the NMT model.

Machine Translation NMT +2

45

Paper
Code

Evaluation of Automatic Video Captioning Using Direct Assessment

no code implementations • 29 Oct 2017 • Yvette Graham, George Awad, Alan Smeaton

We present Direct Assessment, a method for manually assessing the quality of automatically-generated captions for video.

Caption Generation Machine Translation +2

Paper
Add Code

Blend: a Novel Combined MT Metric Based on Direct Assessment --- CASICT-DCU submission to WMT17 Metrics Task

no code implementations • WS 2017 • Qingsong Ma, Yvette Graham, Shugen Wang, Qun Liu

Machine Translation

Paper
Add Code

Results of the WMT17 Metrics Shared Task

no code implementations • WS 2017 • Ond{\v{r}}ej Bojar, Yvette Graham, Amir Kamran

Machine Translation

Paper
Add Code

Findings of the 2017 Conference on Machine Translation (WMT17)

no code implementations • WS 2017 • Ond{\v{r}}ej Bojar, Rajen Chatterjee, Christian Federmann, Yvette Graham, Barry Haddow, Shu-Jian Huang, Matthias Huck, Philipp Koehn, Qun Liu, Varvara Logacheva, Christof Monz, Matteo Negri, Matt Post, Raphael Rubino, Lucia Specia, Marco Turchi

Automatic Post-Editing Multimodal Machine Translation +1

Paper
Add Code

Further Investigation into Reference Bias in Monolingual Evaluation of Machine Translation

1 code implementation • EMNLP 2017 • Qingsong Ma, Yvette Graham, Timothy Baldwin, Qun Liu

Monolingual evaluation of Machine Translation (MT) aims to simplify human assessment by requiring assessors to compare the meaning of the MT output with a reference translation, opening up the task to a much larger pool of genuinely qualified evaluators.

Machine Translation Translation

1

Paper
Code

Improving Evaluation of Document-level Machine Translation Quality Estimation

no code implementations • EACL 2017 • Yvette Graham, Qingsong Ma, Timothy Baldwin, Qun Liu, Carla Parra, Carolina Scarton

Meaningful conclusions about the relative performance of NLP systems are only possible if the gold standard employed in a given evaluation is both valid and reliable.

Document Level Machine Translation Machine Translation +2

Paper
Add Code

Is all that Glitters in Machine Translation Quality Estimation really Gold?

no code implementations • COLING 2016 • Yvette Graham, Timothy Baldwin, Meghan Dowling, Maria Eskevich, Teresa Lynn, Lamia Tounsi

Human-targeted metrics provide a compromise between human evaluation of machine translation, where high inter-annotator agreement is difficult to achieve, and fully automatic metrics, such as BLEU or TER, that lack the validity of human assessment.

Machine Translation Translation

Paper
Add Code

Findings of the 2016 Conference on Machine Translation

no code implementations • WS 2016 • Ond{\v{r}}ej Bojar, Rajen Chatterjee, Christian Federmann, Yvette Graham, Barry Haddow, Matthias Huck, Antonio Jimeno Yepes, Philipp Koehn, Varvara Logacheva, Christof Monz, Matteo Negri, Aur{\'e}lie N{\'e}v{\'e}ol, Mariana Neves, Martin Popel, Matt Post, Raphael Rubino, Carolina Scarton, Lucia Specia, Marco Turchi, Karin Verspoor, Marcos Zampieri

Automatic Post-Editing Multimodal Machine Translation +1

Paper
Add Code

Results of the WMT16 Metrics Shared Task

no code implementations • WS 2016 • Ond{\v{r}}ej Bojar, Yvette Graham, Amir Kamran, Milo{\v{s}} Stanojevi{\'c}

Machine Translation

Paper
Add Code

Achieving Accurate Conclusions in Evaluation of Automatic Machine Translation Metrics

no code implementations • NAACL 2016 • Yvette Graham, Qun Liu

Machine Translation Translation

Paper
Add Code

Re-evaluating Automatic Summarization with BLEU and 192 Shades of ROUGE

no code implementations • EMNLP 2015 • Yvette Graham

Machine Translation

Paper
Add Code

Improving Evaluation of Machine Translation Quality Estimation

no code implementations • IJCNLP 2015 • Yvette Graham

Machine Translation Translation

Paper
Add Code

Accurate Evaluation of Segment-level Machine Translation Metrics

no code implementations • HLT 2015 • Timothy Baldwin, Yvette Graham, Nitika Mathur

Machine Translation Translation

Paper
Add Code

Testing for Significance of Increased Correlation with Human Judgment

no code implementations • EMNLP 2014 • Yvette Graham, Timothy Baldwin

Machine Translation

Paper
Add Code

Randomized Significance Tests in Machine Translation

no code implementations • WS 2014 • Yvette Graham, Nitika Mathur, Timothy Baldwin

Machine Translation Translation

Paper
Add Code

Is Machine Translation Getting Better over Time?

no code implementations • EACL 2014 • Yvette Graham, Timothy Baldwin, Alistair Moffat, Justin Zobel

Machine Translation Translation

Paper
Add Code

Crowd-Sourcing of Human Judgments of Machine Translation Fluency

no code implementations • ALTA 2013 • Yvette Graham, Timothy Baldwin, Alistair Moffat, Justin Zobel

Machine Translation Translation

Paper
Add Code

A Dependency-Constrained Hierarchical Model with Moses

no code implementations • WS 2013 • Yvette Graham

Machine Translation

Paper
Add Code

Continuous Measurement Scales in Human Evaluation of Machine Translation

no code implementations • WS 2013 • Yvette Graham, Timothy Baldwin, Alistair Moffat, Justin Zobel

Machine Translation Text Generation +1

Paper
Add Code

Umelb: Cross-lingual Textual Entailment with Word Alignment and String Similarity Features

no code implementations • SEMEVAL 2013 • Yvette Graham, Bahar Salehi, Timothy Baldwin

Natural Language Inference Word Alignment

Paper
Add Code

Measurement of Progress in Machine Translation

no code implementations • ALTA 2012 • Yvette Graham, Timothy Baldwin, Aaron Harwood, Alistair Moffat, Justin Zobel

Machine Translation Translation

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.