Search Results for author: Vassilis Papavassiliou

Found 12 papers, 0 papers with code

SciPar: A Collection of Parallel Corpora from Scientific Abstracts

no code implementations LREC 2022 Dimitrios Roussis, Vassilis Papavassiliou, Prokopis Prokopidis, Stelios Piperidis, Vassilis Katsouros

This paper presents SciPar, a new collection of parallel corpora created from openly available metadata of bachelor theses, master theses and doctoral dissertations hosted in institutional repositories, digital libraries of universities and national archives.

Machine Translation Sentence +1

Constructing Parallel Corpora from COVID-19 News using MediSys Metadata

no code implementations LREC 2022 Dimitrios Roussis, Vassilis Papavassiliou, Sokratis Sofianopoulos, Prokopis Prokopidis, Stelios Piperidis

This paper presents a collection of parallel corpora generated by exploiting the COVID-19 related dataset of metadata created with the Europe Media Monitor (EMM) / Medical Information System (MediSys) processing chain of news articles.

Machine Translation Translation

Findings of the Covid-19 MLIA Machine Translation Task

no code implementations14 Nov 2022 Francisco Casacuberta, Alexandru Ceausu, Khalid Choukri, Miltos Deligiannis, Miguel Domingo, Mercedes García-Martínez, Manuel Herranz, Guillaume Jacquet, Vassilis Papavassiliou, Stelios Piperidis, Prokopis Prokopidis, Dimitris Roussis, Marwa Hadj Salah

This work presents the results of the machine translation (MT) task from the Covid-19 MLIA @ Eval initiative, a community effort to improve the generation of MT systems focused on the current Covid-19 crisis.

Machine Translation Transfer Learning +1

The ILSP/ARC submission to the WMT 2018 Parallel Corpus Filtering Shared Task

no code implementations WS 2018 Vassilis Papavassiliou, Sokratis Sofianopoulos, Prokopis Prokopidis, Stelios Piperidis

We also discuss alternative methods for ranking the sentence pairs of the most appropriate clusters with the aim of generating the two datasets (of 10 and 100 million words as required in the task) that were evaluated.

Clustering Language Modelling +3

Parallel Global Voices: a Collection of Multilingual Corpora with Citizen Media Stories

no code implementations LREC 2016 Prokopis Prokopidis, Vassilis Papavassiliou, Stelios Piperidis

We present a new collection of multilingual corpora automatically created from the content available in the Global Voices websites, where volunteers have been posting and translating citizen media stories since 2004.

Cannot find the paper you are looking for? You can Submit a new open access paper.