Search Results for author: Tristan Cazenave

Found 32 papers, 2 papers with code

The αμ Search Algorithm for the Game of Bridge

no code implementations18 Nov 2019 Tristan Cazenave, Véronique Ventos

{\alpha}{\mu} is an anytime heuristic search algorithm for incomplete information games that assumes perfect information for the opponents.

Monte Carlo Game Solver

no code implementations15 Jan 2020 Tristan Cazenave

We present a general algorithm to order moves so as to speedup exact game solvers.

Generalized Nested Rollout Policy Adaptation

no code implementations22 Mar 2020 Tristan Cazenave

Nested Rollout Policy Adaptation (NRPA) is a Monte Carlo search algorithm for single player games.

Traveling Salesman Problem

Monte Carlo Inverse Folding

no code implementations20 May 2020 Tristan Cazenave, Thomas Fournier

The RNA Inverse Folding problem comes from computational biology.

Mobile Networks for Computer Go

no code implementations23 Aug 2020 Tristan Cazenave

The architecture of the neural networks used in Deep Reinforcement Learning programs such as Alpha Zero or Polygames has been shown to have a great impact on the performances of the resulting playing engines.

Game of Go Reinforcement Learning (RL)

Minimax Strikes Back

no code implementations19 Dec 2020 Quentin Cohen-Solal, Tristan Cazenave

Deep Reinforcement Learning (DRL) reaches a superhuman level of play in many complete information games.

reinforcement-learning Reinforcement Learning (RL)

Stabilized Nested Rollout Policy Adaptation

no code implementations10 Jan 2021 Tristan Cazenave, Jean-Baptiste Sevestre, Matthieu Toulemont

Nested Rollout Policy Adaptation (NRPA) is a Monte Carlo search algorithm for single player games.

Optimizing $αμ$

no code implementations29 Jan 2021 Tristan Cazenave, Swann Legras, Véronique Ventos

$\alpha\mu$ is a search algorithm which repairs two defaults of Perfect Information Monte Carlo search: strategy fusion and non locality.

Improving Model and Search for Computer Go

no code implementations6 Feb 2021 Tristan Cazenave

The standard for Deep Reinforcement Learning in games, following Alpha Zero, is to use residual networks and to increase the depth of the network to get better results.

reinforcement-learning Reinforcement Learning (RL)

Batch Monte Carlo Tree Search

no code implementations9 Apr 2021 Tristan Cazenave

The transposition table contains the results of the inferences while the search tree contains the statistics of Monte Carlo Tree Search.

Game of Go

Optimal Solving of Constrained Path-Planning Problems with Graph Convolutional Networks and Optimized Tree Search

no code implementations2 Aug 2021 Kevin Osanlou, Andrei Bursuc, Christophe Guettier, Tristan Cazenave, Eric Jacopin

More specifically, a graph neural network is used to assist the branch and bound algorithm in handling constraints associated with a desired solution path.

Constrained Shortest Path Search with Graph Convolutional Neural Networks

no code implementations2 Aug 2021 Kevin Osanlou, Christophe Guettier, Andrei Bursuc, Tristan Cazenave, Eric Jacopin

In this paper, we focus on shortest path search with mandatory nodes on a given connected graph.

Learning-based Preference Prediction for Constrained Multi-Criteria Path-Planning

no code implementations2 Aug 2021 Kevin Osanlou, Christophe Guettier, Andrei Bursuc, Tristan Cazenave, Eric Jacopin

The uncertain criterion represents the feasibility of driving through the path without requiring human intervention.

Generalized Nested Rollout Policy Adaptation with Dynamic Bias for Vehicle Routing

no code implementations12 Nov 2021 Julien Sentuc, Tristan Cazenave, Jean-Yves Lucas

In this paper we present an extension of the Nested Rollout Policy Adaptation algorithm (NRPA), namely the Generalized Nested Rollout Policy Adaptation (GNRPA), as well as its use for solving some instances of the Vehicle Routing Problem.

Refutation of Spectral Graph Theory Conjectures with Monte Carlo Search

no code implementations4 Jul 2022 Milo Roucairol, Tristan Cazenave

We demonstrate how Monte Carlo Search (MCS) algorithms, namely Nested Monte Carlo Search (NMCS) and Nested Rollout Policy Adaptation (NRPA), can be used to build graphs and find counter-examples to spectral graph theory conjectures in minutes.

Nested Search versus Limited Discrepancy Search

no code implementations1 Oct 2022 Tristan Cazenave

Limited Discrepancy Search (LDS) is a popular algorithm to search a state space with a heuristic to order the possible actions.

Solving the HP model with Nested Monte Carlo Search

no code implementations23 Jan 2023 Milo Roucairol, Tristan Cazenave

The algorithm presented in this paper does not beat state of the art algorithms, see PERM (Hsu and Grassberger 2011), REMC (Thachuk, Shmygelska, and Hoos 2007) or WLRE (W\"ust and Landau 2012) for better results.

Protein Folding

Learning to Play Stochastic Two-player Perfect-Information Games without Knowledge

no code implementations8 Feb 2023 Quentin Cohen-Solal, Tristan Cazenave

In this paper, we extend the Descent framework, which enables learning and planning in the context of two-player games with perfect information, to the framework of stochastic games.

Vocal Bursts Valence Prediction

Towards Tackling MaxSAT by Combining Nested Monte Carlo with Local Search

no code implementations26 Feb 2023 Hui Wang, Abdallah Saffidine, Tristan Cazenave

First, a nesting of the tree search inspired by the Nested Monte Carlo Search algorithm is effective on most instance types in the benchmark.

A Scalable Framework for Automatic Playlist Continuation on Music Streaming Services

1 code implementation12 Apr 2023 Walid Bendada, Guillaume Salha-Galvan, Thomas Bouabça, Tristan Cazenave

Music streaming services often aim to recommend songs for users to extend the playlists they have created on these services.

Representation Learning

On the Consistency of Average Embeddings for Item Recommendation

1 code implementation24 Aug 2023 Walid Bendada, Guillaume Salha-Galvan, Romain Hennequin, Thomas Bouabça, Tristan Cazenave

A prevalent practice in recommender systems consists in averaging item embeddings to represent users or higher-level concepts in the same embedding space.

Recommendation Systems

The Mathematical Game

no code implementations22 Sep 2023 Marc Pierre, Quentin Cohen-Solal, Tristan Cazenave

Monte Carlo Tree Search can be used for automated theorem proving.

Automated Theorem Proving

Vision Transformers for Computer Go

no code implementations22 Sep 2023 Amani Sagri, Tristan Cazenave, Jérôme Arjonilla, Abdallah Saffidine

Motivated by the success of transformers in various fields, such as language understanding and image analysis, this investigation explores their application in the context of the game of Go.

Game of Go

Generalized Nested Rollout Policy Adaptation with Limited Repetitions

no code implementations18 Jan 2024 Tristan Cazenave

Generalized Nested Rollout Policy Adaptation (GNRPA) is a Monte Carlo search algorithm for optimizing a sequence of choices.

Traveling Salesman Problem

Learning a Prior for Monte Carlo Search by Replaying Solutions to Combinatorial Problems

no code implementations19 Jan 2024 Tristan Cazenave

Monte Carlo Search gives excellent results in multiple difficult combinatorial problems.

Monte Carlo Search Algorithms Discovering Monte Carlo Tree Search Exploration Terms

no code implementations14 Apr 2024 Tristan Cazenave

We automatically design the PUCT and the SHUSS root exploration terms.

Dialogue avec Molière (Dialogue with Molière )

no code implementations JEP/TALN/RECITAL 2022 Guillaume Grosjean, Anna Pappa, Baptiste Roziere, Tristan Cazenave

A l’occasion du quatre-centième anniversaire de la naissance de Molière (1622-1673), nous présentons un agent conversationnel qui parle comme un personnage du théâtre de Molière.

Chatbot

Cannot find the paper you are looking for? You can Submit a new open access paper.