Search Results for author: Alena Fenogenova

Found 16 papers, 9 papers with code

The Russian-focused embedders' exploration: ruMTEB benchmark and Russian embedding model design

no code implementations22 Aug 2024 Artem Snegirev, Maria Tikhonova, Anna Maksimova, Alena Fenogenova, Alexander Abramov

Embedding models play a crucial role in Natural Language Processing (NLP) by creating text embeddings used in various tasks such as information retrieval and assessing semantic text similarity.

Information Retrieval Retrieval +4

MERA: A Comprehensive LLM Evaluation in Russian

no code implementations9 Jan 2024 Alena Fenogenova, Artem Chervyakov, Nikita Martynov, Anastasia Kozlova, Maria Tikhonova, Albina Akhmetgareeva, Anton Emelyanov, Denis Shevelev, Pavel Lebedev, Leonid Sinev, Ulyana Isaeva, Katerina Kolomeytseva, Daniil Moskovskiy, Elizaveta Goncharova, Nikita Savushkin, Polina Mikhailova, Denis Dimitrov, Alexander Panchenko, Sergei Markov

To address these issues, we introduce an open Multimodal Evaluation of Russian-language Architectures (MERA), a new instruction benchmark for evaluating foundation models oriented towards the Russian language.

A Methodology for Generative Spelling Correction via Natural Spelling Errors Emulation across Multiple Domains and Languages

2 code implementations18 Aug 2023 Nikita Martynov, Mark Baushenko, Anastasia Kozlova, Katerina Kolomeytseva, Aleksandr Abramov, Alena Fenogenova

Our research mainly focuses on exploring natural spelling errors and mistypings in texts and studying the ways those errors can be emulated in correct sentences to effectively enrich generative models' pre-train procedure.

Spelling Correction

mGPT: Few-Shot Learners Go Multilingual

1 code implementation15 Apr 2022 Oleh Shliazhko, Alena Fenogenova, Maria Tikhonova, Vladislav Mikhailov, Anastasia Kozlova, Tatiana Shavrina

Recent studies report that autoregressive language models can successfully solve many NLP tasks via zero- and few-shot learning paradigms, which opens up new possibilities for using the pre-trained language models.

Cross-Lingual Natural Language Inference Cross-Lingual Paraphrase Identification +5

Russian SuperGLUE 1.1: Revising the Lessons not Learned by Russian NLP models

no code implementations15 Feb 2022 Alena Fenogenova, Maria Tikhonova, Vladislav Mikhailov, Tatiana Shavrina, Anton Emelyanov, Denis Shevelev, Alexandr Kukushkin, Valentin Malykh, Ekaterina Artemova

In the last year, new neural architectures and multilingual pre-trained models have been released for Russian, which led to performance evaluation problems across a range of language understanding tasks.

Common Sense Reasoning Reading Comprehension

Russian Paraphrasers: Paraphrase with Transformers

2 code implementations BSNLP 2021 Alena Fenogenova

This paper studies the generation methods for paraphrasing in the Russian language.

Read and Reason with MuSeRC and RuCoS: Datasets for Machine Reading Comprehension for Russian

no code implementations COLING 2020 Alena Fenogenova, Vladislav Mikhailov, Denis Shevelev

The paper introduces two Russian machine reading comprehension (MRC) datasets, called MuSeRC and RuCoS, which require reasoning over multiple sentences and commonsense knowledge to infer the answer.

Machine Reading Comprehension

DaNetQA: a yes/no Question Answering Dataset for the Russian Language

no code implementations6 Oct 2020 Taisia Glushkova, Alexey Machnev, Alena Fenogenova, Tatiana Shavrina, Ekaterina Artemova, Dmitry I. Ignatov

The task is to take both the question and a paragraph as input and come up with a yes/no answer, i. e. to produce a binary output.

Question Answering Sentence +2

Humans Keep It One Hundred: an Overview of AI Journey

1 code implementation LREC 2020 Tatiana Shavrina, Anton Emelyanov, Alena Fenogenova, Vadim Fomin, Vladislav Mikhailov, Andrey Evlampiev, Valentin Malykh, Vladimir Larin, Alex Natekin, Aleks Vatulin, R, Peter Romov, Daniil Anastasiev, Nikolai Zinov, Andrey Chertok

Artificial General Intelligence (AGI) is showing growing performance in numerous applications - beating human performance in Chess and Go, using knowledge bases and text sources to answer questions (SQuAD) and even pass human examination (Aristo project).

Text Generation

Cannot find the paper you are looking for? You can Submit a new open access paper.