Search Results for author: Yuri Kuratov

Found 11 papers, 9 papers with code

In Search of Needles in a 11M Haystack: Recurrent Memory Finds What LLMs Miss

2 code implementations16 Feb 2024 Yuri Kuratov, Aydar Bulatov, Petr Anokhin, Dmitry Sorokin, Artyom Sorokin, Mikhail Burtsev

This paper addresses the challenge of processing long documents using generative transformer models.

Better Together: Enhancing Generative Knowledge Graph Completion with Language Models and Neighborhood Information

1 code implementation2 Nov 2023 Alla Chepurova, Aydar Bulatov, Yuri Kuratov, Mikhail Burtsev

In this study, we propose to include node neighborhoods as additional information to improve KGC methods based on language models.

Imputation World Knowledge

Scaling Transformer to 1M tokens and beyond with RMT

3 code implementations19 Apr 2023 Aydar Bulatov, Yuri Kuratov, Yermek Kapushev, Mikhail S. Burtsev

A major limitation for the broader scope of problems solvable by transformers is the quadratic scaling of computational complexity with input size.

Language Modelling Natural Language Understanding +1

Recurrent Memory Transformer

3 code implementations14 Jul 2022 Aydar Bulatov, Yuri Kuratov, Mikhail S. Burtsev

We implement a memory mechanism with no changes to Transformer model by adding special memory tokens to the input or output sequence.

Language Modelling

Knowledge Distillation of Russian Language Models with Reduction of Vocabulary

1 code implementation4 May 2022 Alina Kolesnikova, Yuri Kuratov, Vasily Konovalov, Mikhail Burtsev

We propose two simple yet effective alignment techniques to make knowledge distillation to the students with reduced vocabulary.

Knowledge Distillation

Memory Transformer

1 code implementation20 Jun 2020 Mikhail S. Burtsev, Yuri Kuratov, Anton Peganov, Grigory V. Sapunov

Adding trainable memory to selectively store local as well as global representations of a sequence is a promising direction to improve the Transformer model.

Language Modelling Machine Translation +4

Goal-Oriented Multi-Task BERT-Based Dialogue State Tracker

no code implementations5 Feb 2020 Pavel Gulyaev, Eugenia Elistratova, Vasily Konovalov, Yuri Kuratov, Leonid Pugachev, Mikhail Burtsev

The organizers introduced the Schema-Guided Dialogue (SGD) dataset with multi-domain conversations and released a zero-shot dialogue state tracking model.

Dialogue State Tracking Question Answering +1

Adaptation of Deep Bidirectional Multilingual Transformers for Russian Language

2 code implementations17 May 2019 Yuri Kuratov, Mikhail Arkhipov

This work shows that transfer learning from a multilingual model to monolingual model results in significant growth of performance on such tasks as reading comprehension, paraphrase detection, and sentiment analysis.

 Ranked #1 on Question Answering on SQuAD1.1 (Hardware Burden metric)

Natural Language Inference Paraphrase Identification +4

Cannot find the paper you are looking for? You can Submit a new open access paper.