Search Results for author: Sergey Kolesnikov

Found 26 papers, 17 papers with code

In-Context Reinforcement Learning for Variable Action Spaces

1 code implementation • 20 Dec 2023 • Viacheslav Sinii, Alexander Nikulin, Vladislav Kurenkov, Ilya Zisman, Sergey Kolesnikov

Recently, it has been shown that transformers pre-trained on diverse datasets with multi-episode contexts can generalize to new reinforcement learning tasks in-context.

Multi-Armed Bandits reinforcement-learning

Paper
Code

XLand-MiniGrid: Scalable Meta-Reinforcement Learning Environments in JAX

1 code implementation • 19 Dec 2023 • Alexander Nikulin, Vladislav Kurenkov, Ilya Zisman, Artem Agarkov, Viacheslav Sinii, Sergey Kolesnikov

Inspired by the diversity and depth of XLand and the simplicity and minimalism of MiniGrid, we present XLand-MiniGrid, a suite of tools and grid-world environments for meta-reinforcement learning research.

Meta-Learning Meta Reinforcement Learning +1

139

Paper
Code

Emergence of In-Context Reinforcement Learning from Noise Distillation

1 code implementation • 19 Dec 2023 • Ilya Zisman, Vladislav Kurenkov, Alexander Nikulin, Viacheslav Sinii, Sergey Kolesnikov

Recently, extensive studies in Reinforcement Learning have been carried out on the ability of transformers to adapt in-context to various environments and tasks.

reinforcement-learning

Paper
Code

Unveiling Empirical Pathologies of Laplace Approximation for Uncertainty Estimation

no code implementations • 16 Dec 2023 • Maksim Zhdanov, Stanislav Dereka, Sergey Kolesnikov

In this paper, we critically evaluate Bayesian methods for uncertainty estimation in deep learning, focusing on the widely applied Laplace approximation and its variants.

Out of Distribution (OOD) Detection

Paper
Add Code

Wild-Tab: A Benchmark For Out-Of-Distribution Generalization In Tabular Regression

no code implementations • 4 Dec 2023 • Sergey Kolesnikov

Out-of-Distribution (OOD) generalization, a cornerstone for building robust machine learning models capable of handling data diverging from the training set's distribution, is an ongoing challenge in deep learning.

Out-of-Distribution Generalization regression +1

Paper
Add Code

Time-Aware Item Weighting for the Next Basket Recommendations

1 code implementation • 30 Jul 2023 • Aleksey Romanov, Oleg Lashinin, Marina Ananyeva, Sergey Kolesnikov

In addition, we show the results of an ablation study and a case study of a few items.

Next-basket recommendation

Paper
Code

RecBaselines2023: a new dataset for choosing baselines for recommender models

no code implementations • 25 Jun 2023 • Veronika Ivanova, Oleg Lashinin, Marina Ananyeva, Sergey Kolesnikov

To solve this problem, we have collected and published a dataset containing information about the recommender models used in 903 papers, both as baselines and as proposed approaches.

Collaborative Filtering Descriptive

Paper
Add Code

Katakomba: Tools and Benchmarks for Data-Driven NetHack

1 code implementation • NeurIPS 2023 • Vladislav Kurenkov, Alexander Nikulin, Denis Tarasov, Sergey Kolesnikov

NetHack is known as the frontier of reinforcement learning research where learning-based methods still need to catch up to rule-based solutions.

D4RL NetHack +2

Paper
Code

Diversifying Deep Ensembles: A Saliency Map Approach for Enhanced OOD Detection, Calibration, and Accuracy

no code implementations • 19 May 2023 • Stanislav Dereka, Ivan Karpukhin, Maksim Zhdanov, Sergey Kolesnikov

Deep ensembles are capable of achieving state-of-the-art results in classification and out-of-distribution (OOD) detection.

Classification Out of Distribution (OOD) Detection

Paper
Add Code

Revisiting the Minimalist Approach to Offline Reinforcement Learning

1 code implementation • NeurIPS 2023 • Denis Tarasov, Vladislav Kurenkov, Alexander Nikulin, Sergey Kolesnikov

Recent years have witnessed significant advancements in offline reinforcement learning (RL), resulting in the development of numerous algorithms with varying degrees of complexity.

D4RL Offline RL +2

Paper
Code

Anti-Exploration by Random Network Distillation

3 code implementations • 31 Jan 2023 • Alexander Nikulin, Vladislav Kurenkov, Denis Tarasov, Sergey Kolesnikov

Despite the success of Random Network Distillation (RND) in various domains, it was shown as not discriminative enough to be used as an uncertainty estimator for penalizing out-of-distribution actions in offline reinforcement learning.

D4RL

Paper
Code

Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch Size

2 code implementations • 20 Nov 2022 • Alexander Nikulin, Vladislav Kurenkov, Denis Tarasov, Dmitry Akimov, Sergey Kolesnikov

Training large neural networks is known to be time-consuming, with the learning duration taking days or even weeks.

Offline RL

384

Paper
Code

Let Offline RL Flow: Training Conservative Agents in the Latent Space of Normalizing Flows

2 code implementations • 20 Nov 2022 • Dmitriy Akimov, Vladislav Kurenkov, Alexander Nikulin, Denis Tarasov, Sergey Kolesnikov

This Normalizing Flows action encoder is pre-trained in a supervised manner on the offline dataset, and then an additional policy model - controller in the latent space - is trained via reinforcement learning.

Offline RL reinforcement-learning +1

Paper
Code

CORL: Research-oriented Deep Offline Reinforcement Learning Library

3 code implementations • NeurIPS 2023 • Denis Tarasov, Alexander Nikulin, Dmitry Akimov, Vladislav Kurenkov, Sergey Kolesnikov

CORL is an open-source library that provides thoroughly benchmarked single-file implementations of both deep offline and offline-to-online reinforcement learning algorithms.

Benchmarking D4RL +1

956

Paper
Code

Deep Image Retrieval is not Robust to Label Noise

no code implementations • 23 May 2022 • Stanislav Dereka, Ivan Karpukhin, Sergey Kolesnikov

Large-scale datasets are essential for the success of deep learning in image retrieval.

Classification Image Classification +2

Paper
Add Code

EXACT: How to Train Your Accuracy

1 code implementation • 19 May 2022 • Ivan Karpukhin, Stanislav Dereka, Sergey Kolesnikov

Classification tasks are usually evaluated in terms of accuracy.

Ranked #2 on Image Classification on SVHN (Percentage correct metric)

General Classification Image Classification

Paper
Code

CVTT: Cross-Validation Through Time

no code implementations • 11 May 2022 • Mikhail Andronov, Sergey Kolesnikov

The evaluation of recommender systems from a practical perspective is a topic of ongoing discourse within the research community.

Recommendation Systems

Paper
Add Code

Probabilistic Embeddings Revisited

1 code implementation • 14 Feb 2022 • Ivan Karpukhin, Stanislav Dereka, Sergey Kolesnikov

We thus provide a new confidence evaluation benchmark and establish a baseline for future confidence prediction research.

Face Verification Image Retrieval +2

Paper
Code

Next Period Recommendation Reality Check

no code implementations • 11 Oct 2021 • Sergey Kolesnikov, Oleg Lashinin, Michail Pechatov, Alexander Kosov

In this article, we aim to fill the gap in RecSys methods evaluation on the NPR task using publicly available datasets and (1) introduce the TTRS, a large-scale financial transactions dataset suitable for RecSys methods evaluation; (2) benchmark popular RecSys approaches on several datasets for the NPR task.

Recommendation Systems

Paper
Add Code

Showing Your Offline Reinforcement Learning Work: Online Evaluation Budget Matters

no code implementations • 8 Oct 2021 • Vladislav Kurenkov, Sergey Kolesnikov

In this work, we argue for the importance of an online evaluation budget for a reliable comparison of deep offline RL algorithms.

Decision Making energy management +4

Paper
Add Code

LRWR: Large-Scale Benchmark for Lip Reading in Russian language

no code implementations • 14 Sep 2021 • Evgeniy Egorov, Vasily Kostyumov, Mikhail Konyk, Sergey Kolesnikov

Lipreading, also known as visual speech recognition, aims to identify the speech content from videos by analyzing the visual deformations of lips and nearby areas.

Lipreading Lip Reading +2

Paper
Add Code

Sample Efficient Ensemble Learning with Catalyst.RL

2 code implementations • 29 Mar 2020 • Sergey Kolesnikov, Valentin Khrulkov

We present Catalyst. RL, an open-source PyTorch framework for reproducible and sample efficient reinforcement learning (RL) research.

Ensemble Learning reinforcement-learning +1

135

Paper
Code

Catalyst.RL: A Distributed Framework for Reproducible RL Research

1 code implementation • 28 Feb 2019 • Sergey Kolesnikov, Oleksii Hrinchuk

Despite the recent progress in deep reinforcement learning field (RL), and, arguably because of it, a large body of work remains to be done in reproducing and carefully comparing different RL algorithms.

Continuous Control

Paper
Code

Artificial Intelligence for Prosthetics - challenge solutions

1 code implementation • 7 Feb 2019 • Łukasz Kidziński, Carmichael Ong, Sharada Prasanna Mohanty, Jennifer Hicks, Sean F. Carroll, Bo Zhou, Hongsheng Zeng, Fan Wang, Rongzhong Lian, Hao Tian, Wojciech Jaśkowski, Garrett Andersen, Odd Rune Lykkebø, Nihat Engin Toklu, Pranav Shyam, Rupesh Kumar Srivastava, Sergey Kolesnikov, Oleksii Hrinchuk, Anton Pechenko, Mattias Ljungström, Zhen Wang, Xu Hu, Zehong Hu, Minghui Qiu, Jun Huang, Aleksei Shpilman, Ivan Sosin, Oleg Svidchenko, Aleksandra Malysheva, Daniel Kudenko, Lance Rane, Aditya Bhatt, Zhengfei Wang, Penghui Qi, Zeyang Yu, Peng Peng, Quan Yuan, Wenxin Li, Yunsheng Tian, Ruihan Yang, Pingchuan Ma, Shauharda Khadka, Somdeb Majumdar, Zach Dwiel, Yinyin Liu, Evren Tumer, Jeremy Watson, Marcel Salathé, Sergey Levine, Scott Delp

In the NeurIPS 2018 Artificial Intelligence for Prosthetics challenge, participants were tasked with building a controller for a musculoskeletal model with a goal of matching a given time-varying velocity vector.

Imitation Learning reinforcement-learning +1

Paper
Code

Learning to Run challenge solutions: Adapting reinforcement learning methods for neuromusculoskeletal environments

2 code implementations • 2 Apr 2018 • Łukasz Kidziński, Sharada Prasanna Mohanty, Carmichael Ong, Zhewei Huang, Shuchang Zhou, Anton Pechenko, Adam Stelmaszczyk, Piotr Jarosik, Mikhail Pavlov, Sergey Kolesnikov, Sergey Plis, Zhibo Chen, Zhizheng Zhang, Jiale Chen, Jun Shi, Zhuobin Zheng, Chun Yuan, Zhihui Lin, Henryk Michalewski, Piotr Miłoś, Błażej Osiński, Andrew Melnik, Malte Schilling, Helge Ritter, Sean Carroll, Jennifer Hicks, Sergey Levine, Marcel Salathé, Scott Delp

In the NIPS 2017 Learning to Run challenge, participants were tasked with building a controller for a musculoskeletal model to make it run as fast as possible through an obstacle course.

reinforcement-learning Reinforcement Learning (RL)

124

Paper
Code

Run, skeleton, run: skeletal model in a physics-based simulation

1 code implementation • 18 Nov 2017 • Mikhail Pavlov, Sergey Kolesnikov, Sergey M. Plis

In this paper, we present our approach to solve a physics-based reinforcement learning challenge "Learning to Run" with objective to train physiologically-based human model to navigate a complex obstacle course as quickly as possible.

Navigate Policy Gradient Methods +1

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.