Search Results for author: Mikhail Burtsev

Found 24 papers, 15 papers with code

Discourse-Driven Integrated Dialogue Development Environment for Open-Domain Dialogue Systems

no code implementations • CODI 2021 • Denis Kuznetsov, Dmitry Evseev, Lidia Ostyakova, Oleg Serikov, Daniel Kornev, Mikhail Burtsev

Development environments for spoken dialogue systems are popular today because they enable rapid creation of the dialogue systems in times when usage of the voice AI Assistants is constantly growing.

Spoken Dialogue Systems

Paper
Add Code

Uncertainty Estimation of Transformer Predictions for Misclassification Detection

1 code implementation • ACL 2022 • Artem Vazhentsev, Gleb Kuzmin, Artem Shelmanov, Akim Tsvigun, Evgenii Tsymbalov, Kirill Fedyanin, Maxim Panov, Alexander Panchenko, Gleb Gusev, Mikhail Burtsev, Manvel Avetisian, Leonid Zhukov

Uncertainty estimation (UE) of model predictions is a crucial step for a variety of tasks such as active learning, misclassification detection, adversarial attack detection, out-of-distribution detection, etc.

Active Learning Adversarial Attack Detection +7

Paper
Code

Attention Understands Semantic Relations

no code implementations • LREC 2022 • Anastasia Chizhikova, Sanzhar Murzakhmetov, Oleg Serikov, Tatiana Shavrina, Mikhail Burtsev

Today, natural language processing heavily relies on pre-trained large language models.

Paper
Add Code

In Search of Needles in a 11M Haystack: Recurrent Memory Finds What LLMs Miss

2 code implementations • 16 Feb 2024 • Yuri Kuratov, Aydar Bulatov, Petr Anokhin, Dmitry Sorokin, Artyom Sorokin, Mikhail Burtsev

This paper addresses the challenge of processing long documents using generative transformer models.

741

Paper
Code

Uncertainty Guided Global Memory Improves Multi-Hop Question Answering

1 code implementation • 29 Nov 2023 • Alsu Sagirova, Mikhail Burtsev

Conversely, the second group relies on the attention mechanism of the long input encoding model to facilitate multi-hop reasoning.

Multi-hop Question Answering Question Answering

Paper
Code

Better Together: Enhancing Generative Knowledge Graph Completion with Language Models and Neighborhood Information

1 code implementation • 2 Nov 2023 • Alla Chepurova, Aydar Bulatov, Yuri Kuratov, Mikhail Burtsev

In this study, we propose to include node neighborhoods as additional information to improve KGC methods based on language models.

Imputation World Knowledge

Paper
Code

Monolingual and Cross-Lingual Knowledge Transfer for Topic Classification

no code implementations • 13 Jun 2023 • Dmitry Karpov, Mikhail Burtsev

This article investigates the knowledge transfer from the RuQTopics dataset.

Classification Topic Classification +1

Paper
Add Code

Active Learning for Abstractive Text Summarization

1 code implementation • 9 Jan 2023 • Akim Tsvigun, Ivan Lysenko, Danila Sedashov, Ivan Lazichny, Eldar Damirov, Vladimir Karlov, Artemy Belousov, Leonid Sanochkin, Maxim Panov, Alexander Panchenko, Mikhail Burtsev, Artem Shelmanov

Active Learning (AL) is a technique developed to reduce the amount of annotation required to achieve a certain level of machine learning model performance.

Abstractive Text Summarization Active Learning +3

Paper
Code

Collecting Interactive Multi-modal Datasets for Grounded Language Understanding

2 code implementations • 12 Nov 2022 • Shrestha Mohanty, Negar Arabzadeh, Milagro Teruel, Yuxuan Sun, Artem Zholus, Alexey Skrynnik, Mikhail Burtsev, Kavya Srinet, Aleksandr Panov, Arthur Szlam, Marc-Alexandre Côté, Julia Kiseleva

Human intelligence can remarkably adapt quickly to new tasks and environments.

Task 2

Paper
Code

Learning to Solve Voxel Building Embodied Tasks from Pixels and Natural Language Instructions

1 code implementation • 1 Nov 2022 • Alexey Skrynnik, Zoya Volovikova, Marc-Alexandre Côté, Anton Voronov, Artem Zholus, Negar Arabzadeh, Shrestha Mohanty, Milagro Teruel, Ahmed Awadallah, Aleksandr Panov, Mikhail Burtsev, Julia Kiseleva

The adoption of pre-trained language models to generate action plans for embodied agents is a promising research strategy.

Language Modelling reinforcement-learning +1

Paper
Code

Explain My Surprise: Learning Efficient Long-Term Memory by Predicting Uncertain Outcomes

1 code implementation • 27 Jul 2022 • Artyom Sorokin, Nazar Buzun, Leonid Pugachev, Mikhail Burtsev

This requires to store prohibitively large intermediate data if a sequence consists of thousands or even millions elements, and as a result, makes learning of very long-term dependencies infeasible.

Paper
Code

IGLU 2022: Interactive Grounded Language Understanding in a Collaborative Environment at NeurIPS 2022

1 code implementation • 27 May 2022 • Julia Kiseleva, Alexey Skrynnik, Artem Zholus, Shrestha Mohanty, Negar Arabzadeh, Marc-Alexandre Côté, Mohammad Aliannejadi, Milagro Teruel, Ziming Li, Mikhail Burtsev, Maartje ter Hoeve, Zoya Volovikova, Aleksandr Panov, Yuxuan Sun, Kavya Srinet, Arthur Szlam, Ahmed Awadallah

Starting from a very young age, humans acquire new skills and learn how to solve new tasks either by imitating the behavior of others or by following provided natural language instructions.

Natural Language Understanding Reinforcement Learning (RL)

Paper
Code

Interactive Grounded Language Understanding in a Collaborative Environment: IGLU 2021

no code implementations • 5 May 2022 • Julia Kiseleva, Ziming Li, Mohammad Aliannejadi, Shrestha Mohanty, Maartje ter Hoeve, Mikhail Burtsev, Alexey Skrynnik, Artem Zholus, Aleksandr Panov, Kavya Srinet, Arthur Szlam, Yuxuan Sun, Marc-Alexandre Côté, Katja Hofmann, Ahmed Awadallah, Linar Abdrazakov, Igor Churin, Putra Manggala, Kata Naszadi, Michiel van der Meer, Taewoon Kim

The primary goal of the competition is to approach the problem of how to build interactive agents that learn to solve a task while provided with grounded natural language instructions in a collaborative environment.

Paper
Add Code

Knowledge Distillation of Russian Language Models with Reduction of Vocabulary

1 code implementation • 4 May 2022 • Alina Kolesnikova, Yuri Kuratov, Vasily Konovalov, Mikhail Burtsev

We propose two simple yet effective alignment techniques to make knowledge distillation to the students with reduced vocabulary.

Knowledge Distillation

Paper
Code

NeurIPS 2021 Competition IGLU: Interactive Grounded Language Understanding in a Collaborative Environment

no code implementations • 13 Oct 2021 • Julia Kiseleva, Ziming Li, Mohammad Aliannejadi, Shrestha Mohanty, Maartje ter Hoeve, Mikhail Burtsev, Alexey Skrynnik, Artem Zholus, Aleksandr Panov, Kavya Srinet, Arthur Szlam, Yuxuan Sun, Katja Hofmann, Michel Galley, Ahmed Awadallah

Starting from a very young age, humans acquire new skills and learn how to solve new tasks either by imitating the behavior of others or by following provided natural language instructions.

Natural Language Understanding Reinforcement Learning (RL)

Paper
Add Code

Building and Evaluating Open-Domain Dialogue Corpora with Clarifying Questions

1 code implementation • EMNLP 2021 • Mohammad Aliannejadi, Julia Kiseleva, Aleksandr Chuklin, Jeffrey Dalton, Mikhail Burtsev

Enabling open-domain dialogue systems to ask clarifying questions when appropriate is an important direction for improving the quality of the system response.

126

Paper
Code

Multi-Stream Transformers

1 code implementation • 21 Jul 2021 • Mikhail Burtsev, Anna Rumshisky

Transformer-based encoder-decoder models produce a fused token-wise representation after every encoder layer.

Paper
Code

Short Text Clustering with Transformers

no code implementations • 31 Jan 2021 • Leonid Pugachev, Mikhail Burtsev

Recent techniques for the task of short text clustering often rely on word embeddings as a transfer learning component.

Ranked #2 on Short Text Clustering on Stackoverflow

Clustering Sentence +3

Paper
Add Code

Memory Representation in Transformer

no code implementations • 1 Jan 2021 • Mikhail Burtsev, Yurii Kuratov, Anton Peganov, Grigory V. Sapunov

Adding trainable memory to selectively store local as well as global representations of a sequence is a promising direction to improve the Transformer model.

Language Modelling Machine Translation +1

Paper
Add Code

ConvAI3: Generating Clarifying Questions for Open-Domain Dialogue Systems (ClariQ)

3 code implementations • 23 Sep 2020 • Mohammad Aliannejadi, Julia Kiseleva, Aleksandr Chuklin, Jeff Dalton, Mikhail Burtsev

The main aim of the conversational systems is to return an appropriate answer in response to the user requests.

126

Paper
Code

Goal-Oriented Multi-Task BERT-Based Dialogue State Tracker

no code implementations • 5 Feb 2020 • Pavel Gulyaev, Eugenia Elistratova, Vasily Konovalov, Yuri Kuratov, Leonid Pugachev, Mikhail Burtsev

The organizers introduced the Schema-Guided Dialogue (SGD) dataset with multi-domain conversations and released a zero-shot dialogue state tracking model.

Dialogue State Tracking Question Answering +1

Paper
Add Code

Loss Landscape Sightseeing with Multi-Point Optimization

1 code implementation • 9 Oct 2019 • Ivan Skorokhodov, Mikhail Burtsev

We present multi-point optimization: an optimization technique that allows to train several models simultaneously without the need to keep the parameters of each one individually.

Paper
Code

The Second Conversational Intelligence Challenge (ConvAI2)

2 code implementations • 31 Jan 2019 • Emily Dinan, Varvara Logacheva, Valentin Malykh, Alexander Miller, Kurt Shuster, Jack Urbanek, Douwe Kiela, Arthur Szlam, Iulian Serban, Ryan Lowe, Shrimai Prabhumoye, Alan W. black, Alexander Rudnicky, Jason Williams, Joelle Pineau, Mikhail Burtsev, Jason Weston

We describe the setting and results of the ConvAI2 NeurIPS competition that aims to further the state-of-the-art in open-domain chatbots.

Paper
Code

DeepPavlov: Open-Source Library for Dialogue Systems

no code implementations • ACL 2018 • Mikhail Burtsev, Alex Seliverstov, er, Rafael Airapetyan, Mikhail Arkhipov, Dilyara Baymurzina, Nickolay Bushkov, Olga Gureenkova, Taras Khakhulin, Yuri Kuratov, Denis Kuznetsov, Alexey Litinsky, Varvara Logacheva, Alexey Lymar, Valentin Malykh, Maxim Petrov, Vadim Polulyakh, Leonid Pugachev, Alexey Sorokin, Maria Vikhreva, Marat Zaynutdinov

It supports modular as well as end-to-end approaches to implementation of conversational agents.

General Classification intent-classification +5

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.