no code implementations • 6 Apr 2024 • Nicolas Yax, Pierre-Yves Oudeyer, Stefano Palminteri
This paper introduces PhyloLM, a method applying phylogenetic algorithms to Large Language Models to explore their finetuning relationships, and predict their performance characteristics.
1 code implementation • 16 Feb 2024 • Johann Lussange, Stefano Vrizzi, Stefano Palminteri, Boris Gutkin
Building on a previous foundation work (Lussange et al. 2020), this study introduces a multi-agent reinforcement learning (MARL) model simulating crypto markets, which is calibrated to the Binance's daily closing prices of $153$ cryptocurrencies that were continuously traded between 2018 and 2022.
Multi-agent Reinforcement Learning reinforcement-learning +1
no code implementations • 25 Jan 2024 • William M. Hayes, Nicolas Yax, Stefano Palminteri
Studies of reinforcement learning in humans and animals have demonstrated a preference for options that yielded relatively better outcomes in the past, even when those options are associated with lower absolute reward.
no code implementations • 21 Sep 2023 • Nicolas Yax, Hernan Anlló, Stefano Palminteri
In the present study, we investigate and compare reasoning in large language models (LLM) and humans using a selection of cognitive psychology tools traditionally dedicated to the study of (bounded) rationality.