Search Results for author: Borislav Mavrin

Found 5 papers, 2 papers with code

Self-Supervised Contrastive BERT Fine-tuning for Fusion-based Reviewed-Item Retrieval

2 code implementations • 1 Aug 2023 • Mohammad Mahdi Abdollah Pour, Parsa Farinneya, Armin Toroghi, Anton Korikov, Ali Pesaranghader, Touqir Sajed, Manasa Bharadwaj, Borislav Mavrin, Scott Sanner

Experimental results show that Late Fusion contrastive learning for Neural RIR outperforms all other contrastive IR configurations, Neural IR, and sparse retrieval baselines, thus demonstrating the power of exploiting the two-level structure in Neural RIR approaches as well as the importance of preserving the nuance of individual review content via Late Fusion methods.

Contrastive Learning Information Retrieval +2

Paper
Code

Efficient decorrelation of features using Gramian in Reinforcement Learning

no code implementations • 19 Nov 2019 • Borislav Mavrin, Daniel Graves, Alan Chan

Learning good representations is a long standing problem in reinforcement learning (RL).

Atari Games reinforcement-learning +1

Paper
Add Code

Distributional Reinforcement Learning for Efficient Exploration

no code implementations • 13 May 2019 • Borislav Mavrin, Shangtong Zhang, Hengshuai Yao, Linglong Kong, Kaiwen Wu, Yao-Liang Yu

In distributional reinforcement learning (RL), the estimated distribution of value function models both the parametric and intrinsic uncertainties.

Atari Games Distributional Reinforcement Learning +3

Paper
Add Code

Deep Reinforcement Learning with Decorrelation

no code implementations • 18 Mar 2019 • Borislav Mavrin, Hengshuai Yao, Linglong Kong

Further experiments on the losing games show that our decorelation algorithms can win over DQN and QR-DQN with a fined tuned regularization factor.

Atari Games reinforcement-learning +2

Paper
Add Code

QUOTA: The Quantile Option Architecture for Reinforcement Learning

3 code implementations • 5 Nov 2018 • Shangtong Zhang, Borislav Mavrin, Linglong Kong, Bo Liu, Hengshuai Yao

In this paper, we propose the Quantile Option Architecture (QUOTA) for exploration based on recent advances in distributional reinforcement learning (RL).

Decision Making Distributional Reinforcement Learning +2

3,095

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.