no code implementations • 21 Jun 2024 • Bram M. Renting, Thomas M. Moerland, Holger H. Hoos, Catholijn M. Jonker
We developed an end-to-end reinforcement learning method for diverse negotiation problems by representing observations and actions as a graph and applying graph neural networks in the policy.
no code implementations • 11 Mar 2024 • Michiel van der Meer, Enrico Liscio, Catholijn M. Jonker, Aske Plaat, Piek Vossen, Pradeep K. Murukannaiah
We find that, on the one hand, HyEnA achieves higher coverage and precision than a state-of-the-art automated method when compared to a common set of diverse opinions, justifying the need for human insight.
no code implementations • 26 Feb 2024 • Enrico Liscio, Luciano C. Siebert, Catholijn M. Jonker, Pradeep K. Murukannaiah
We focus on situations where a conflict is detected between participants' choices and motivations, and propose methods for estimating value preferences while addressing detected inconsistencies by interacting with the participants.
no code implementations • 2 Feb 2024 • Michiel van der Meer, Piek Vossen, Catholijn M. Jonker, Pradeep K. Murukannaiah
Presenting high-level arguments is a crucial task for fostering participation in online societal discussions.
no code implementations • 8 Nov 2023 • Siddharth Mehrotra, Chadha Degachi, Oleksandra Vereschak, Catholijn M. Jonker, Myrthe L. Tielman
Appropriate Trust in Artificial Intelligence (AI) systems has rapidly become an important area of focus for both researchers and practitioners.
1 code implementation • 24 Oct 2023 • Michiel van der Meer, Piek Vossen, Catholijn M. Jonker, Pradeep K. Murukannaiah
We investigate a hypothesis that differences in personal values are indicative of disagreement in online discussions.
no code implementations • 12 Jul 2023 • Catholijn M. Jonker, Luciano Cavalcante Siebert, Pradeep K. Murukannaiah
With the growing capabilities and pervasiveness of AI systems, societies must collectively choose between reduced human autonomy, endangered democracies and limited human rights, and AI that is aligned to human and social values, nurturing collaboration, resilience, knowledge and ethical behaviour.
no code implementations • 16 Jan 2023 • Pei-Yu Chen, Myrthe L. Tielman, Dirk K. J. Heylen, Catholijn M. Jonker, M. Birna van Riemsdijk
Most of the current approach to AI alignment is to learn what humans value from their behavioural data.
no code implementations • 5 Oct 2022 • Ruben S. Verhagen, Siddharth Mehrotra, Mark A. Neerincx, Catholijn M. Jonker, Myrthe L. Tielman
We identify four components to which explanation designers can pay special attention: perception, semantics, intent, and user & context.
no code implementations • 13 May 2022 • Pradeep K. Murukannaiah, Catholijn M. Jonker
Existing protocols for multilateral negotiation require a full consensus among the negotiating parties.
1 code implementation • 24 Jan 2022 • Maria Tsfasman, Avinash Saravanan, Dekel Viner, Daan Goslinga, Sarah de Wolf, Chirag Raman, Catholijn M. Jonker, Catharine Oertel
43 English-speaking participants took part in the study for whom we analysed the degree of acoustic-prosodic entrainment to the human or robot face, respectively.
no code implementations • 25 Nov 2021 • Luciano Cavalcante Siebert, Maria Luce Lupetti, Evgeni Aizenberg, Niek Beckers, Arkady Zgonnikov, Herman Veluwenkamp, David Abbink, Elisa Giaccardi, Geert-Jan Houben, Catholijn M. Jonker, Jeroen van den Hoven, Deborah Forster, Reginald L. Lagendijk
The concept of meaningful human control has been proposed to address responsibility gaps and mitigate them by establishing conditions that enable a proper attribution of responsibility for humans; however, clear requirements for researchers, designers, and engineers are yet inexistent, making the development of AI-based systems that remain under meaningful human control challenging.
no code implementations • 19 Oct 2021 • Ilir Kola, Pradeep K. Murukannaiah, Catholijn M. Jonker, M. Birna van Riemsdijk
Since many daily activities are social in nature, support agents should understand a user's social situation to offer comprehensive support.
no code implementations • 15 Oct 2021 • Ilir Kola, Catholijn M. Jonker, M. Birna van Riemsdijk
First, from a technical perspective, we show that psychological characteristics of situations can be used as input to predict the priority of social situations, and that psychological characteristics of situations can be predicted from the features of a social situation.
no code implementations • 29 Sep 2021 • Catholijn M. Jonker, Jan Treur
It is shown how for different aggregation levels and other elements within an organisation structure, sets of dynamic properties can be specified.
no code implementations • 27 Jul 2021 • Ruth Shortall, Anatol Itten, Michiel van der Meer, Pradeep K. Murukannaiah, Catholijn M. Jonker
Designers of online deliberative platforms aim to counter the degrading quality of online debates.
no code implementations • 3 Jul 2021 • Ming Li, Pradeep K. Murukannaiah, Catholijn M. Jonker
Our approach includes a data generation method for an agent to generate domain-independent sequences by negotiating with a variety of opponents across domains, a feature engineering method for representing negotiation data as time series with time-step features and overall features, and a hybrid (recurrent neural network-based) deep learning method for recognizing an opponent's strategy from the time series of bids.
1 code implementation • 10 Jun 2021 • Youri Coppens, Denis Steckelmacher, Catholijn M. Jonker, Ann Nowé
Then, to ensure that the rules explain a valid, non-degenerate policy, we introduce a refinement algorithm that fine-tunes the rules to obtain good performance when executed in the environment.
no code implementations • 19 May 2021 • Siddharth Mehrotra, Catholijn M. Jonker, Myrthe L. Tielman
To achieve this, it is first important to understand which factors influence trust in AI.
no code implementations • 22 Dec 2020 • Rijk Mercuur, Virginia Dignum, Catholijn M. Jonker
This paper provides the domain-independent Social Practice Agent (SoPrA) framework that satisfies requirements from the literature to simulate our routines.
no code implementations • 30 Jun 2020 • Thomas M. Moerland, Joost Broekens, Aske Plaat, Catholijn M. Jonker
Two key approaches to this problem are reinforcement learning (RL) and planning.
no code implementations • 26 Jun 2020 • Thomas M. Moerland, Joost Broekens, Aske Plaat, Catholijn M. Jonker
Therefore, this paper presents a unifying algorithmic framework for reinforcement learning and planning (FRAP), which identifies underlying dimensions on which MDP planning and learning algorithms have to decide.
1 code implementation • 19 May 2020 • Thomas M. Moerland, Joost Broekens, Aske Plaat, Catholijn M. Jonker
Monte Carlo Tree Search (MCTS) efficiently balances exploration and exploitation in tree search based on count-derived uncertainty.
1 code implementation • 15 May 2020 • Thomas M. Moerland, Anna Deichler, Simone Baldi, Joost Broekens, Catholijn M. Jonker
Planning and reinforcement learning are two key approaches to sequential decision making.
no code implementations • 31 Mar 2020 • Bram M. Renting, Holger H. Hoos, Catholijn M. Jonker
By empowering automated negotiating agents using automated algorithm configuration, we obtain a flexible negotiation agent that can be configured automatically for a rich space of opponents and negotiation scenarios.
1 code implementation • 11 Jun 2018 • Thomas M. Moerland, Joost Broekens, Catholijn M. Jonker
This paper studies the potential of the return distribution for exploration in deterministic reinforcement learning (RL) environments.
2 code implementations • 24 May 2018 • Thomas M. Moerland, Joost Broekens, Aske Plaat, Catholijn M. Jonker
A core novelty of Alpha Zero is the interleaving of tree search and deep learning, which has proven very successful in board games like Chess, Shogi and Go.
2 code implementations • 23 May 2018 • Thomas M. Moerland, Joost Broekens, Aske Plaat, Catholijn M. Jonker
Asymmetric termination of search trees introduces a type of uncertainty for which the standard upper confidence bound (UCB) formula does not account.
1 code implementation • 21 Feb 2018 • Luisa M. Zintgraf, Diederik M. Roijers, Sjoerd Linders, Catholijn M. Jonker, Ann Nowé
We build on previous work on Gaussian processes and pairwise comparisons for preference modelling, extend it to the multi-objective decision support scenario, and propose new ordered preference elicitation strategies based on ranking and clustering.
no code implementations • 29 Nov 2017 • Thomas M. Moerland, Joost Broekens, Catholijn M. Jonker
This paper studies directed exploration for reinforcement learning agents by tracking uncertainty about the value of each available action.
no code implementations • 15 May 2017 • Thomas M. Moerland, Joost Broekens, Catholijn M. Jonker
This article provides the first survey of computational models of emotion in reinforcement learning (RL) agents.
1 code implementation • 1 May 2017 • Thomas M. Moerland, Joost Broekens, Catholijn M. Jonker
In this paper we study how to learn stochastic, multimodal transition dynamics in reinforcement learning (RL) tasks.
Model-based Reinforcement Learning reinforcement-learning +2
no code implementations • 3 Jul 2016 • Victor Sanchez-Anguix, Reyhan Aydogan, Tim Baarslag, Catholijn M. Jonker
Traditionally, researchers in decision making have focused on attempting to reach Pareto Optimality using horizontal approaches, where optimality is calculated taking into account every participant at the same time.