1 code implementation • 25 Sep 2024 • Ivi Chatzi, Nina Corvelo Benz, Eleni Straitouri, Stratis Tsirtsis, Manuel Gomez-Rodriguez
Our model allows any large language model to perform counterfactual token generation at almost no cost in comparison with vanilla token generation, it is embarrassingly simple to implement, and it does not require any fine-tuning nor prompt engineering.
1 code implementation • 10 Jun 2024 • Eleni Straitouri, Suhas Thejaswi, Manuel Gomez Rodriguez
In this paper, our goal is to control how frequently a decision support system based on prediction sets may cause harm, by design.
1 code implementation • 27 May 2024 • Giovanni De Toni, Nastaran Okati, Suhas Thejaswi, Eleni Straitouri, Manuel Gomez-Rodriguez
Then, we show that the problem of finding the optimal prediction sets under which the human experts achieve the highest average accuracy is NP-hard.
1 code implementation • 27 Feb 2024 • Ivi Chatzi, Eleni Straitouri, Suhas Thejaswi, Manuel Gomez Rodriguez
Using pairwise comparisons made by humans in the LMSYS Chatbot Arena platform and pairwise comparisons made by three strong large language models, we empirically demonstrate the effectivity of our framework and show that the rank-sets constructed using only pairwise comparisons by the strong large language models are often inconsistent with (the distribution of) human pairwise preferences.
1 code implementation • 6 Jun 2023 • Eleni Straitouri, Manuel Gomez Rodriguez
In this context, it has been recently argued that an alternative type of decision support systems may circumvent this challenge.
1 code implementation • 28 Jan 2022 • Eleni Straitouri, Lequn Wang, Nastaran Okati, Manuel Gomez Rodriguez
In this work, we develop an automated decision support system that, by design, does not require experts to understand when to trust the system to improve performance.
no code implementations • 23 Sep 2021 • Eleni Straitouri, Adish Singla, Vahid Balazadeh Meresht, Manuel Gomez-Rodriguez
Methods to learn under algorithmic triage have predominantly focused on supervised learning settings where each decision, or prediction, is independent of each other.