Search Results for author: Elena Smirnova

Found 8 papers, 2 papers with code

On the Convergence of Smooth Regularized Approximate Value Iteration Schemes

no code implementations NeurIPS 2020 Elena Smirnova, Elvis Dohmatob

Entropy regularization, smoothing of Q-values and neural network function approximator are key components of the state-of-the-art reinforcement learning (RL) algorithms, such as Soft Actor-Critic~\cite{haarnoja2018soft}.

reinforcement-learning Reinforcement Learning (RL)

On the Convergence of Approximate and Regularized Policy Iteration Schemes

no code implementations20 Sep 2019 Elena Smirnova, Elvis Dohmatob

Entropy regularized algorithms such as Soft Q-learning and Soft Actor-Critic, recently showed state-of-the-art performance on a number of challenging reinforcement learning (RL) tasks.

Q-Learning Reinforcement Learning (RL)

Distributionally Robust Counterfactual Risk Minimization

no code implementations14 Jun 2019 Louis Faury, Ugo Tanielian, Flavian vasile, Elena Smirnova, Elvis Dohmatob

This manuscript introduces the idea of using Distributionally Robust Optimization (DRO) for the Counterfactual Risk Minimization (CRM) problem.

counterfactual Decision Making

Distributionally Robust Reinforcement Learning

no code implementations23 Feb 2019 Elena Smirnova, Elvis Dohmatob, Jérémie Mary

Our formulation results in a efficient algorithm that accounts for a simple re-weighting of policy actions in the standard policy iteration scheme.

Continuous Control Q-Learning +2

Action-conditional Sequence Modeling for Recommendation

no code implementations7 Sep 2018 Elena Smirnova

In this paper, we consider interactions triggered by the recommendations of deployed recommender system in addition to browsing behavior.

Collaborative Filtering Recommendation Systems

Recurrent Neural Networks for Long and Short-Term Sequential Recommendation

no code implementations23 Jul 2018 Kiewan Villatel, Elena Smirnova, Jérémie Mary, Philippe Preux

Recommender systems objectives can be broadly characterized as modeling user preferences over short-or long-term time horizon.

Dimensionality Reduction Sequential Recommendation +1

Contextual Sequence Modeling for Recommendation with Recurrent Neural Networks

1 code implementation23 Jun 2017 Elena Smirnova, Flavian vasile

Recommendations can greatly benefit from good representations of the user state at recommendation time.

Session-Based Recommendations

Meta-Prod2Vec - Product Embeddings Using Side-Information for Recommendation

2 code implementations25 Jul 2016 Flavian Vasile, Elena Smirnova, Alexis Conneau

We propose Meta-Prod2vec, a novel method to compute item similarities for recommendation that leverages existing item metadata.

Cannot find the paper you are looking for? You can Submit a new open access paper.