no code implementations • 9 Jul 2024 • Guillermo Infante, Anders Jonsson, Vicenç Gómez
We introduce a novel approach to hierarchical reinforcement learning for Linearly-solvable Markov Decision Processes (LMDPs) in the infinite-horizon average-reward setting.
Hierarchical Reinforcement Learning reinforcement-learning +1
no code implementations • 22 Mar 2024 • Guillermo Infante, David Kuric, Anders Jonsson, Vicenç Gómez, Herke van Hoof
Conventional reinforcement learning (RL) methods can successfully solve a wide range of sequential decision problems.
1 code implementation • 10 Dec 2023 • Nurudin Alvarez-Gonzalez, Andreas Kaltenbrunner, Vicenç Gómez
We present a novel edge-level ego-network encoding for learning on graphs that can boost Message Passing Graph Neural Networks (MP-GNNs) by providing additional node and edge features or extending message-passing formats.
1 code implementation • 27 Nov 2022 • Nurudin Alvarez-Gonzalez, Andreas Kaltenbrunner, Vicenç Gómez
Identifying similar network structures is key to capture graph isomorphisms and learn representations that exploit structural information encoded in graph data.
2 code implementations • Findings (EMNLP) 2021 • Nurudin Alvarez-Gonzalez, Andreas Kaltenbrunner, Vicenç Gómez
Identifying emotions from text is crucial for a variety of real world tasks.
1 code implementation • 29 Jun 2021 • Guillermo Infante, Anders Jonsson, Vicenç Gómez
In this work we present a novel approach to hierarchical reinforcement learning for linearly-solvable Markov decision processes.
Hierarchical Reinforcement Learning reinforcement-learning +1
1 code implementation • 15 Jan 2021 • Miquel Junyent, Vicenç Gómez, Anders Jonsson
In classical planning, we show how IW(1) at two levels of abstraction can solve problems of width 2.
no code implementations • 26 Sep 2020 • Nurudin Alvarez-Gonzalez, Andreas Kaltenbrunner, Vicenç Gómez
Learning embeddings from large-scale networks is an open challenge.
no code implementations • 13 May 2020 • Dominik Thalmeier, Hilbert J. Kappen, Simone Totaro, Vicenç Gómez
We identify PICE as the infinite smoothing limit of such technique and show that the sample efficiency problems that PICE suffers disappear for finite levels of smoothing.
1 code implementation • ICLR 2020 • Joan Serrà, David Álvarez, Vicenç Gómez, Olga Slizovskaia, José F. Núñez, Jordi Luque
Likelihood-based generative models are a promising resource to detect out-of-distribution (OOD) inputs which could compromise the robustness or reliability of a machine learning system.
Ranked #10 on Anomaly Detection on Unlabeled CIFAR-10 vs CIFAR-100
no code implementations • 13 May 2019 • Behzad Tabibian, Vicenç Gómez, Abir De, Bernhard Schölkopf, Manuel Gomez Rodriguez
Can we design ranking models that understand the consequences of their proposed rankings and, more importantly, are able to avoid the undesirable ones?
1 code implementation • 12 Apr 2019 • Miquel Junyent, Anders Jonsson, Vicenç Gómez
Surprisingly, we observe that the representation learned by the neural network can be used as a feature space for the width-based planner without degrading its performance, thus removing the requirement of pre-defined features for the planner.
no code implementations • 7 Feb 2019 • Fabrizio Germano, Vicenç Gómez, Gaël Le Mens
Ranking algorithms play a crucial role in online platforms ranging from search engines to recommender systems.
2 code implementations • 15 Jan 2019 • Nikolaos Lykousas, Costantinos Patsakis, Andreas Kaltenbrunner, Vicenç Gómez
We present the Vent dataset, the largest annotated dataset of text, emotions, and social connections to date.
Social and Information Networks Human-Computer Interaction
1 code implementation • 27 Jun 2018 • Nikolaos Lykousas, Vicenç Gómez, Constantinos Patsakis
Social Live Stream Services (SLSS) exploit a new level of social interaction.
Social and Information Networks
no code implementations • 15 Jun 2018 • Miquel Junyent, Anders Jonsson, Vicenç Gómez
The planning step hinges on the Iterated-Width (IW) planner, a state of the art planner that makes explicit use of the state representation to perform structured exploration.
no code implementations • 22 May 2017 • Gergely Neu, Anders Jonsson, Vicenç Gómez
We propose a general framework for entropy-regularized average-reward reinforcement learning in Markov decision processes (MDPs).
no code implementations • 21 Feb 2017 • Gergely Neu, Vicenç Gómez
We study the problem of online learning in a class of Markov decision processes known as linearly solvable MDPs.
no code implementations • 10 Mar 2016 • Anders Jonsson, Vicenç Gómez
We present a hierarchical reinforcement learning framework that formulates each task in the hierarchy as a special type of Markov decision process for which the Bellman equation is linear and has analytical solution.
Hierarchical Reinforcement Learning reinforcement-learning +2