Search Results for author: Tristan Cazenave

The architecture of the neural networks used in Deep Reinforcement Learning programs such as Alpha Zero or Polygames has been shown to have a great impact on the performances of the resulting playing engines.

Game of Go Reinforcement Learning (RL)

Paper
Add Code

Minimax Strikes Back

no code implementations • 19 Dec 2020 • Quentin Cohen-Solal, Tristan Cazenave

Deep Reinforcement Learning (DRL) reaches a superhuman level of play in many complete information games.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Stabilized Nested Rollout Policy Adaptation

no code implementations • 10 Jan 2021 • Tristan Cazenave, Jean-Baptiste Sevestre, Matthieu Toulemont

Nested Rollout Policy Adaptation (NRPA) is a Monte Carlo search algorithm for single player games.

Paper
Add Code

Optimizing $αμ$

no code implementations • 29 Jan 2021 • Tristan Cazenave, Swann Legras, Véronique Ventos

$\alpha\mu$ is a search algorithm which repairs two defaults of Perfect Information Monte Carlo search: strategy fusion and non locality.

Paper
Add Code

Improving Model and Search for Computer Go

no code implementations • 6 Feb 2021 • Tristan Cazenave

The standard for Deep Reinforcement Learning in games, following Alpha Zero, is to use residual networks and to increase the depth of the network to get better results.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Batch Monte Carlo Tree Search

no code implementations • 9 Apr 2021 • Tristan Cazenave

The transposition table contains the results of the inferences while the search tree contains the statistics of Monte Carlo Tree Search.

Game of Go

Paper
Add Code

Time-based Dynamic Controllability of Disjunctive Temporal Networks with Uncertainty: A Tree Search Approach with Graph Neural Network Guidance

no code implementations • 2 Aug 2021 • Kevin Osanlou, Jeremy Frank, J. Benton, Andrei Bursuc, Christophe Guettier, Eric Jacopin, Tristan Cazenave

Scheduling in the presence of uncertainty is an area of interest in artificial intelligence due to the large number of applications.

Scheduling

Paper
Add Code

Optimal Solving of Constrained Path-Planning Problems with Graph Convolutional Networks and Optimized Tree Search

no code implementations • 2 Aug 2021 • Kevin Osanlou, Andrei Bursuc, Christophe Guettier, Tristan Cazenave, Eric Jacopin

More specifically, a graph neural network is used to assist the branch and bound algorithm in handling constraints associated with a desired solution path.

Paper
Add Code

Constrained Shortest Path Search with Graph Convolutional Neural Networks

no code implementations • 2 Aug 2021 • Kevin Osanlou, Christophe Guettier, Andrei Bursuc, Tristan Cazenave, Eric Jacopin

In this paper, we focus on shortest path search with mandatory nodes on a given connected graph.

Paper
Add Code

Learning-based Preference Prediction for Constrained Multi-Criteria Path-Planning

no code implementations • 2 Aug 2021 • Kevin Osanlou, Christophe Guettier, Andrei Bursuc, Tristan Cazenave, Eric Jacopin

The uncertain criterion represents the feasibility of driving through the path without requiring human intervention.

Paper
Add Code

Generalized Nested Rollout Policy Adaptation with Dynamic Bias for Vehicle Routing

no code implementations • 12 Nov 2021 • Julien Sentuc, Tristan Cazenave, Jean-Yves Lucas

In this paper we present an extension of the Nested Rollout Policy Adaptation algorithm (NRPA), namely the Generalized Nested Rollout Policy Adaptation (GNRPA), as well as its use for solving some instances of the Vehicle Routing Problem.

Paper
Add Code

Solving Disjunctive Temporal Networks with Uncertainty under Restricted Time-Based Controllability using Tree Search and Graph Neural Networks

no code implementations • 28 Mar 2022 • Kevin Osanlou, Jeremy Frank, Andrei Bursuc, Tristan Cazenave, Eric Jacopin, Christophe Guettier, J. Benton

Moreover, we leverage a graph neural network as a heuristic for tree search guidance.

Scheduling

Paper
Add Code

Refutation of Spectral Graph Theory Conjectures with Monte Carlo Search

no code implementations • 4 Jul 2022 • Milo Roucairol, Tristan Cazenave

We demonstrate how Monte Carlo Search (MCS) algorithms, namely Nested Monte Carlo Search (NMCS) and Nested Rollout Policy Adaptation (NRPA), can be used to build graphs and find counter-examples to spectral graph theory conjectures in minutes.

Paper
Add Code

Planning and Learning: Path-Planning for Autonomous Vehicles, a Review of the Literature

no code implementations • 26 Jul 2022 • Kevin Osanlou, Christophe Guettier, Tristan Cazenave, Eric Jacopin

We describe briefly the concept of reinforcement learning algorithms and some approaches designed to date.

Autonomous Vehicles reinforcement-learning +2

Paper
Add Code

Nested Search versus Limited Discrepancy Search

no code implementations • 1 Oct 2022 • Tristan Cazenave

Limited Discrepancy Search (LDS) is a popular algorithm to search a state space with a heuristic to order the possible actions.

Paper
Add Code

Solving the HP model with Nested Monte Carlo Search

no code implementations • 23 Jan 2023 • Milo Roucairol, Tristan Cazenave

The algorithm presented in this paper does not beat state of the art algorithms, see PERM (Hsu and Grassberger 2011), REMC (Thachuk, Shmygelska, and Hoos 2007) or WLRE (W\"ust and Landau 2012) for better results.

Protein Folding

Paper
Add Code

Learning to Play Stochastic Two-player Perfect-Information Games without Knowledge

no code implementations • 8 Feb 2023 • Quentin Cohen-Solal, Tristan Cazenave

In this paper, we extend the Descent framework, which enables learning and planning in the context of two-player games with perfect information, to the framework of stochastic games.

Vocal Bursts Valence Prediction

Paper
Add Code

Towards Tackling MaxSAT by Combining Nested Monte Carlo with Local Search

no code implementations • 26 Feb 2023 • Hui Wang, Abdallah Saffidine, Tristan Cazenave

First, a nesting of the tree search inspired by the Nested Monte Carlo Search algorithm is effective on most instance types in the benchmark.

Paper
Add Code

A Scalable Framework for Automatic Playlist Continuation on Music Streaming Services

1 code implementation • 12 Apr 2023 • Walid Bendada, Guillaume Salha-Galvan, Thomas Bouabça, Tristan Cazenave

Music streaming services often aim to recommend songs for users to extend the playlists they have created on these services.

Representation Learning

Paper
Code

On the Consistency of Average Embeddings for Item Recommendation

1 code implementation • 24 Aug 2023 • Walid Bendada, Guillaume Salha-Galvan, Romain Hennequin, Thomas Bouabça, Tristan Cazenave

A prevalent practice in recommender systems consists in averaging item embeddings to represent users or higher-level concepts in the same embedding space.

Recommendation Systems

Paper
Code

The Mathematical Game

no code implementations • 22 Sep 2023 • Marc Pierre, Quentin Cohen-Solal, Tristan Cazenave

Monte Carlo Tree Search can be used for automated theorem proving.

Automated Theorem Proving

Paper
Add Code

Vision Transformers for Computer Go

no code implementations • 22 Sep 2023 • Amani Sagri, Tristan Cazenave, Jérôme Arjonilla, Abdallah Saffidine

Motivated by the success of transformers in various fields, such as language understanding and image analysis, this investigation explores their application in the context of the game of Go.

Game of Go

Paper
Add Code

Generalized Nested Rollout Policy Adaptation with Limited Repetitions

no code implementations • 18 Jan 2024 • Tristan Cazenave

Generalized Nested Rollout Policy Adaptation (GNRPA) is a Monte Carlo search algorithm for optimizing a sequence of choices.

Traveling Salesman Problem

Paper
Add Code

Learning a Prior for Monte Carlo Search by Replaying Solutions to Combinatorial Problems

no code implementations • 19 Jan 2024 • Tristan Cazenave

Monte Carlo Search gives excellent results in multiple difficult combinatorial problems.

Paper
Add Code

Monte Carlo Search Algorithms Discovering Monte Carlo Tree Search Exploration Terms

no code implementations • 14 Apr 2024 • Tristan Cazenave

We automatically design the PUCT and the SHUSS root exploration terms.

Paper
Add Code

Dialogue avec Molière (Dialogue with Molière )

no code implementations • JEP/TALN/RECITAL 2022 • Guillaume Grosjean, Anna Pappa, Baptiste Roziere, Tristan Cazenave

A l’occasion du quatre-centième anniversaire de la naissance de Molière (1622-1673), nous présentons un agent conversationnel qui parle comme un personnage du théâtre de Molière.

Chatbot

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.