1 code implementation • 4 Apr 2024 • Botond Barta, Dorina Lakatos, Attila Nagy, Milán Konor Nyist, Judit Ács
To address this gap our paper introduces HunSum-2 an open-source Hungarian corpus suitable for training abstractive and extractive summarization models.
1 code implementation • 4 Nov 2023 • Attila Nagy, Dorina Lakatos, Botond Barta, Judit Ács
Data augmentation methods for neural machine translation are particularly useful when limited amount of training data is available, which is often the case when dealing with low-resource languages.
2 code implementations • 13 Jul 2023 • Attila Nagy, Dorina Petra Lakatos, Botond Barta, Patrick Nanys, Judit Ács
We present a generic framework for data augmentation via dependency subtree swapping that is applicable to machine translation.
1 code implementation • 1 Feb 2023 • Botond Barta, Dorina Lakatos, Attila Nagy, Milán Konor Nyist, Judit Ács
We introduce HunSum-1: a dataset for Hungarian abstractive summarization, consisting of 1. 14M news articles.
2 code implementations • 18 Jan 2022 • Attila Nagy, Patrick Nanys, Balázs Frey Konrád, Bence Bial, Judit Ács
We train Transformer-based neural machine translation models for Hungarian-English and English-Hungarian using the Hunglish2 corpus.
no code implementations • 7 Nov 2021 • Attila Nagy
I start my thesis with a detailed literature review on neural networks, sequential modeling, neural machine translation, dependency parsing and data augmentation.
1 code implementation • 13 Oct 2021 • Attila Nagy, Ábel Boros
In our research, we propose to modify this to a more modern and complex algorithm, PPO, which has demonstrated to be faster and more stable in other environments.
1 code implementation • 18 Jan 2021 • Attila Nagy, Bence Bial, Judit Ács
We present an approach for automatic punctuation restoration with BERT models for English and Hungarian.