Search Results for author: Bo Wahlberg

Found 20 papers, 0 papers with code

Fast and Consistent Learning of Hidden Markov Models by Incorporating Non-Consecutive Correlations

no code implementations ICML 2020 Robert Mattila, Cristian Rojas, Eric Moulines, Vikram Krishnamurthy, Bo Wahlberg

Can the parameters of a hidden Markov model (HMM) be estimated from a single sweep through the observations -- and additionally, without being trapped at a local optimum in the likelihood surface?

Time Series Time Series Analysis

A Biologically-Inspired Computational Model of Time Perception

no code implementations7 Nov 2023 Inês Lourenço, Robert Mattila, Rodrigo Ventura, Bo Wahlberg

We conclude that the agent is able to perceive time similarly to animals when it comes to their intrinsic mechanisms of interpreting time and performing time-aware actions.

Decision Making

Optimal Transport for Correctional Learning

no code implementations4 Apr 2023 Rebecka Winqvist, Inês Lourenco, Francesco Quinzan, Cristian R. Rojas, Bo Wahlberg

In this framework, an expert agent, referred to as the teacher, modifies the data used by a learning agent, known as the student, to improve its estimation process.

Prediction-Based Leader-Follower Rendezvous Model Predictive Control with Robustness to Communication Losses

no code implementations3 Apr 2023 Dženan Lapandić, Christos K. Verginis, Dimos V. Dimarogonas, Bo Wahlberg

In this paper we propose a novel distributed model predictive control (DMPC) based algorithm with a trajectory predictor for a scenario of landing of unmanned aerial vehicles (UAVs) on a moving unmanned surface vehicle (USV).

Model Predictive Control

Robust Trajectory Tracking for Underactuated Quadrotors with Prescribed Performance

no code implementations13 Jun 2022 Dženan Lapandić, Christos K. Verginis, Dimos V. Dimarogonas, Bo Wahlberg

We propose a control protocol based on the prescribed performance control (PPC) methodology for a quadrotor unmanned aerial vehicle (UAV).

A Teacher-Student Markov Decision Process-based Framework for Online Correctional Learning

no code implementations15 Nov 2021 Inês Lourenço, Rebecka Winqvist, Cristian R. Rojas, Bo Wahlberg

A classical learning setting typically concerns an agent/student who collects data, or observations, from a system in order to estimate a certain property of interest.

Aperiodic Communication for MPC in Autonomous Cooperative Landing

no code implementations15 Feb 2021 Dženan Lapandić, Linnea Persson, Dimos V. Dimarogonas, Bo Wahlberg

The main contribution is a rendezvous algorithm with an online update rule of the rendezvous location.

Model Predictive Control

Learning Models of Model Predictive Controllers using Gradient Data

no code implementations3 Feb 2021 Rebecka Winqvist, Arun Venkitaraman, Bo Wahlberg

As a proof of concept, we apply this approach to explicit MPC (eMPC), for which the feedback law is a piece-wise affine function of the state, but the number of pieces grows rapidly with the state dimension.

Experimental Design

Cooperative System Identification via Correctional Learning

no code implementations9 Dec 2020 Inês Lourenço, Robert Mattila, Cristian R. Rojas, Bo Wahlberg

We consider a cooperative system identification scenario in which an expert agent (teacher) knows a correct, or at least a good, model of the system and aims to assist a learner-agent (student), but cannot directly transfer its knowledge to the student.

Learning the Step-size Policy for the Limited-Memory Broyden-Fletcher-Goldfarb-Shanno Algorithm

no code implementations3 Oct 2020 Lucas N. Egidio, Anders Hansson, Bo Wahlberg

The step-length policy is learned from data of similar optimization problems, avoids additional evaluations of the objective function, and guarantees that the output step remains inside a pre-defined interval.

Stochastic Optimization

Task-similarity Aware Meta-learning through Nonparametric Kernel Regression

no code implementations12 Jun 2020 Arun Venkitaraman, Anders Hansson, Bo Wahlberg

Our hypothesis is that the use of tasksimilarity helps meta-learning when the available tasks are limited and may contain outlier/ dissimilar tasks.

Meta-Learning regression

On Training and Evaluation of Neural Network Approaches for Model Predictive Control

no code implementations8 May 2020 Rebecka Winqvist, Arun Venkitaraman, Bo Wahlberg

The contribution of this paper is a framework for training and evaluation of Model Predictive Control (MPC) implemented using constrained neural networks.

Model Predictive Control

Teaching robots to perceive time -- A reinforcement learning approach (Extended version)

no code implementations20 Dec 2019 Inês Lourenço, Bo Wahlberg, Rodrigo Ventura

In this paper, we study how to replicate neural mechanisms involved in time perception, allowing robots to take a step towards temporal cognition.

Gaussian Processes reinforcement-learning +1

Learning sparse linear dynamic networks in a hyper-parameter free setting

no code implementations26 Nov 2019 Arun Venkitaraman, Håkan Hjalmarsson, Bo Wahlberg

We address the issue of estimating the topology and dynamics of sparse linear dynamic networks in a hyperparameter-free setting.

Recursive Prediction of Graph Signals with Incoming Nodes

no code implementations26 Nov 2019 Arun Venkitaraman, Saikat Chatterjee, Bo Wahlberg

Kernel and linear regression have been recently explored in the prediction of graph signals as the output, given arbitrary input signals that are agnostic to the graph.

regression

Inverse Filtering for Hidden Markov Models

no code implementations NeurIPS 2017 Robert Mattila, Cristian Rojas, Vikram Krishnamurthy, Bo Wahlberg

This paper considers a number of related inverse filtering problems for hidden Markov models (HMMs).

How to monitor and mitigate stair-casing in l1 trend filtering

no code implementations1 Dec 2014 Cristian R. Rojas, Bo Wahlberg

It is known that TV denoising suffers from the so-called stair-case effect, which leads to detecting false change points.

Denoising Time Series +1

Approximate Regularization Path for Nuclear Norm Based H2 Model Reduction

no code implementations22 Jul 2014 Niclas Blomberg, Cristian R. Rojas, Bo Wahlberg

This paper concerns model reduction of dynamical systems using the nuclear norm of the Hankel matrix to make a trade-off between model fit and model complexity.

On change point detection using the fused lasso method

no code implementations21 Jan 2014 Cristian R. Rojas, Bo Wahlberg

In this paper we analyze the asymptotic properties of l1 penalized maximum likelihood estimation of signals with piece-wise constant mean values and/or variances.

Change Point Detection Denoising +2

Cannot find the paper you are looking for? You can Submit a new open access paper.