no code implementations • 8 Jul 2016 • Tristan Cazenave, Jialin Liu, Fabien Teytaud, Olivier Teytaud
Many artificial intelligences (AIs) are randomized.
no code implementations • 18 Nov 2019 • Tristan Cazenave, Véronique Ventos
{\alpha}{\mu} is an anytime heuristic search algorithm for incomplete information games that assumes perfect information for the opponents.
no code implementations • 15 Jan 2020 • Tristan Cazenave
We present a general algorithm to order moves so as to speedup exact game solvers.
no code implementations • 27 Jan 2020 • Tristan Cazenave, Yen-Chi Chen, Guan-Wei Chen, Shi-Yu Chen, Xian-Dong Chiu, Julien Dehos, Maria Elsa, Qucheng Gong, Hengyuan Hu, Vasil Khalidov, Cheng-Ling Li, Hsin-I Lin, Yu-Jin Lin, Xavier Martinet, Vegard Mella, Jeremy Rapin, Baptiste Roziere, Gabriel Synnaeve, Fabien Teytaud, Olivier Teytaud, Shi-Cheng Ye, Yi-Jun Ye, Shi-Jim Yen, Sergey Zagoruyko
Since DeepMind's AlphaZero, Zero learning quickly became the state-of-the-art method for many board games.
no code implementations • 22 Mar 2020 • Tristan Cazenave
Nested Rollout Policy Adaptation (NRPA) is a Monte Carlo search algorithm for single player games.
no code implementations • 20 May 2020 • Tristan Cazenave, Thomas Fournier
The RNA Inverse Folding problem comes from computational biology.
no code implementations • 23 Aug 2020 • Tristan Cazenave
The architecture of the neural networks used in Deep Reinforcement Learning programs such as Alpha Zero or Polygames has been shown to have a great impact on the performances of the resulting playing engines.
no code implementations • 19 Dec 2020 • Quentin Cohen-Solal, Tristan Cazenave
Deep Reinforcement Learning (DRL) reaches a superhuman level of play in many complete information games.
no code implementations • 10 Jan 2021 • Tristan Cazenave, Jean-Baptiste Sevestre, Matthieu Toulemont
Nested Rollout Policy Adaptation (NRPA) is a Monte Carlo search algorithm for single player games.
no code implementations • 29 Jan 2021 • Tristan Cazenave, Swann Legras, Véronique Ventos
$\alpha\mu$ is a search algorithm which repairs two defaults of Perfect Information Monte Carlo search: strategy fusion and non locality.
no code implementations • 6 Feb 2021 • Tristan Cazenave
The standard for Deep Reinforcement Learning in games, following Alpha Zero, is to use residual networks and to increase the depth of the network to get better results.
no code implementations • 9 Apr 2021 • Tristan Cazenave
The transposition table contains the results of the inferences while the search tree contains the statistics of Monte Carlo Tree Search.
no code implementations • 2 Aug 2021 • Kevin Osanlou, Jeremy Frank, J. Benton, Andrei Bursuc, Christophe Guettier, Eric Jacopin, Tristan Cazenave
Scheduling in the presence of uncertainty is an area of interest in artificial intelligence due to the large number of applications.
no code implementations • 2 Aug 2021 • Kevin Osanlou, Andrei Bursuc, Christophe Guettier, Tristan Cazenave, Eric Jacopin
More specifically, a graph neural network is used to assist the branch and bound algorithm in handling constraints associated with a desired solution path.
no code implementations • 2 Aug 2021 • Kevin Osanlou, Christophe Guettier, Andrei Bursuc, Tristan Cazenave, Eric Jacopin
In this paper, we focus on shortest path search with mandatory nodes on a given connected graph.
no code implementations • 2 Aug 2021 • Kevin Osanlou, Christophe Guettier, Andrei Bursuc, Tristan Cazenave, Eric Jacopin
The uncertain criterion represents the feasibility of driving through the path without requiring human intervention.
no code implementations • 12 Nov 2021 • Julien Sentuc, Tristan Cazenave, Jean-Yves Lucas
In this paper we present an extension of the Nested Rollout Policy Adaptation algorithm (NRPA), namely the Generalized Nested Rollout Policy Adaptation (GNRPA), as well as its use for solving some instances of the Vehicle Routing Problem.
no code implementations • 28 Mar 2022 • Kevin Osanlou, Jeremy Frank, Andrei Bursuc, Tristan Cazenave, Eric Jacopin, Christophe Guettier, J. Benton
Moreover, we leverage a graph neural network as a heuristic for tree search guidance.
no code implementations • 4 Jul 2022 • Milo Roucairol, Tristan Cazenave
We demonstrate how Monte Carlo Search (MCS) algorithms, namely Nested Monte Carlo Search (NMCS) and Nested Rollout Policy Adaptation (NRPA), can be used to build graphs and find counter-examples to spectral graph theory conjectures in minutes.
no code implementations • 26 Jul 2022 • Kevin Osanlou, Christophe Guettier, Tristan Cazenave, Eric Jacopin
We describe briefly the concept of reinforcement learning algorithms and some approaches designed to date.
no code implementations • 1 Oct 2022 • Tristan Cazenave
Limited Discrepancy Search (LDS) is a popular algorithm to search a state space with a heuristic to order the possible actions.
no code implementations • 23 Jan 2023 • Milo Roucairol, Tristan Cazenave
The algorithm presented in this paper does not beat state of the art algorithms, see PERM (Hsu and Grassberger 2011), REMC (Thachuk, Shmygelska, and Hoos 2007) or WLRE (W\"ust and Landau 2012) for better results.
no code implementations • 8 Feb 2023 • Quentin Cohen-Solal, Tristan Cazenave
In this paper, we extend the Descent framework, which enables learning and planning in the context of two-player games with perfect information, to the framework of stochastic games.
no code implementations • 26 Feb 2023 • Hui Wang, Abdallah Saffidine, Tristan Cazenave
First, a nesting of the tree search inspired by the Nested Monte Carlo Search algorithm is effective on most instance types in the benchmark.
1 code implementation • 12 Apr 2023 • Walid Bendada, Guillaume Salha-Galvan, Thomas Bouabça, Tristan Cazenave
Music streaming services often aim to recommend songs for users to extend the playlists they have created on these services.
1 code implementation • 24 Aug 2023 • Walid Bendada, Guillaume Salha-Galvan, Romain Hennequin, Thomas Bouabça, Tristan Cazenave
A prevalent practice in recommender systems consists in averaging item embeddings to represent users or higher-level concepts in the same embedding space.
no code implementations • 22 Sep 2023 • Marc Pierre, Quentin Cohen-Solal, Tristan Cazenave
Monte Carlo Tree Search can be used for automated theorem proving.
no code implementations • 22 Sep 2023 • Amani Sagri, Tristan Cazenave, Jérôme Arjonilla, Abdallah Saffidine
Motivated by the success of transformers in various fields, such as language understanding and image analysis, this investigation explores their application in the context of the game of Go.
no code implementations • 18 Jan 2024 • Tristan Cazenave
Generalized Nested Rollout Policy Adaptation (GNRPA) is a Monte Carlo search algorithm for optimizing a sequence of choices.
no code implementations • 19 Jan 2024 • Tristan Cazenave
Monte Carlo Search gives excellent results in multiple difficult combinatorial problems.
no code implementations • 14 Apr 2024 • Tristan Cazenave
We automatically design the PUCT and the SHUSS root exploration terms.
no code implementations • JEP/TALN/RECITAL 2022 • Guillaume Grosjean, Anna Pappa, Baptiste Roziere, Tristan Cazenave
A l’occasion du quatre-centième anniversaire de la naissance de Molière (1622-1673), nous présentons un agent conversationnel qui parle comme un personnage du théâtre de Molière.