no code implementations • 5 Nov 2024 • Daniel de Vassimon Manela, Linying Yang, Robin J. Evans
By basing simulations on real data, our method ensures more realistic evaluations, which is often missing in current work relying on simplified datasets.
no code implementations • 10 Jul 2024 • Linying Yang, Vik Shirvaikar, Oscar Clivio, Fabian Falck
We hope this work will pave the way towards a general framework for the assessment of causal understanding in LLMs and the design of novel benchmarks.