Search Results for author: Yuri Kuratov

Found 8 papers, 6 papers with code

Recurrent Memory Transformer

1 code implementation14 Jul 2022 Aydar Bulatov, Yuri Kuratov, Mikhail S. Burtsev

We implement a memory mechanism with no changes to Transformer model by adding special memory tokens to the input or output sequence.

Language Modelling

Knowledge Distillation of Russian Language Models with Reduction of Vocabulary

1 code implementation4 May 2022 Alina Kolesnikova, Yuri Kuratov, Vasily Konovalov, Mikhail Burtsev

We propose two simple yet effective alignment techniques to make knowledge distillation to the students with reduced vocabulary.

Knowledge Distillation

Memory Transformer

1 code implementation20 Jun 2020 Mikhail S. Burtsev, Yuri Kuratov, Anton Peganov, Grigory V. Sapunov

Adding trainable memory to selectively store local as well as global representations of a sequence is a promising direction to improve the Transformer model.

Language Modelling Machine Translation +4

Goal-Oriented Multi-Task BERT-Based Dialogue State Tracker

no code implementations5 Feb 2020 Pavel Gulyaev, Eugenia Elistratova, Vasily Konovalov, Yuri Kuratov, Leonid Pugachev, Mikhail Burtsev

The organizers introduced the Schema-Guided Dialogue (SGD) dataset with multi-domain conversations and released a zero-shot dialogue state tracking model.

Dialogue State Tracking Question Answering +1

Adaptation of Deep Bidirectional Multilingual Transformers for Russian Language

1 code implementation17 May 2019 Yuri Kuratov, Mikhail Arkhipov

This work shows that transfer learning from a multilingual model to monolingual model results in significant growth of performance on such tasks as reading comprehension, paraphrase detection, and sentiment analysis.

 Ranked #1 on Question Answering on SQuAD1.1 (Hardware Burden metric)

Natural Language Inference Paraphrase Identification +4

Cannot find the paper you are looking for? You can Submit a new open access paper.