Search Results for author: Stefano Palminteri

Found 4 papers, 1 papers with code

Inferring the Phylogeny of Large Language Models and Predicting their Performances in Benchmarks

no code implementations • 6 Apr 2024 • Nicolas Yax, Pierre-Yves Oudeyer, Stefano Palminteri

This paper introduces PhyloLM, a method applying phylogenetic algorithms to Large Language Models to explore their finetuning relationships, and predict their performance characteristics.

Paper
Add Code

Modelling crypto markets by multi-agent reinforcement learning

1 code implementation • 16 Feb 2024 • Johann Lussange, Stefano Vrizzi, Stefano Palminteri, Boris Gutkin

Building on a previous foundation work (Lussange et al. 2020), this study introduces a multi-agent reinforcement learning (MARL) model simulating crypto markets, which is calibrated to the Binance's daily closing prices of $153$ cryptocurrencies that were continuously traded between 2018 and 2022.

Multi-agent Reinforcement Learning reinforcement-learning +1

Paper
Code

Relative Value Biases in Large Language Models

no code implementations • 25 Jan 2024 • William M. Hayes, Nicolas Yax, Stefano Palminteri

Studies of reinforcement learning in humans and animals have demonstrated a preference for options that yielded relatively better outcomes in the past, even when those options are associated with lower absolute reward.

Paper
Add Code

Studying and improving reasoning in humans and machines

no code implementations • 21 Sep 2023 • Nicolas Yax, Hernan Anlló, Stefano Palminteri

In the present study, we investigate and compare reasoning in large language models (LLM) and humans using a selection of cognitive psychology tools traditionally dedicated to the study of (bounded) rationality.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.