Search Results for author: Ole J. Mengshoel

Found 10 papers, 1 papers with code

Influence Diagram Bandits

no code implementations ICML 2020 Tong Yu, Branislav Kveton, Zheng Wen, Ruiyi Zhang, Ole J. Mengshoel

We experiment with three structured bandit problems: cascading bandits, online learning to rank in the position-based model, and rank-1 bandits.

Learning-To-Rank Position

Customizing Graph Neural Networks using Path Reweighting

2 code implementations21 Jun 2021 Jianpeng Chen, Yujing Wang, Ming Zeng, Zongyi Xiang, Bitan Hou, Yunhai Tong, Ole J. Mengshoel, Yazhou Ren

Specifically, the proposed CustomGNN can automatically learn the high-level semantics for specific downstream tasks to highlight semantically relevant paths as well to filter out task-irrelevant noises in a graph.

Data Augmentation Graph Attention +1

Adaptive Stress Testing: Finding Likely Failure Events with Reinforcement Learning

no code implementations6 Nov 2018 Ritchie Lee, Ole J. Mengshoel, Anshu Saksena, Ryan Gardner, Daniel Genin, Joshua Silbermann, Michael Owen, Mykel J. Kochenderfer

Finding the most likely path to a set of failure states is important to the analysis of safety-critical systems that operate over a sequence of time steps, such as aircraft collision avoidance systems and autonomous cars.

Autonomous Driving Collision Avoidance +2

Customized Nonlinear Bandits for Online Response Selection in Neural Conversation Models

no code implementations22 Nov 2017 Bing Liu, Tong Yu, Ian Lane, Ole J. Mengshoel

Moreover, we report encouraging response selection performance of the proposed neural bandit model using the Recall@k metric for a small set of online training samples.

Multi-Armed Bandits Response Generation +2

Interpretable Categorization of Heterogeneous Time Series Data

no code implementations30 Aug 2017 Ritchie Lee, Mykel J. Kochenderfer, Ole J. Mengshoel, Joshua Silbermann

In particular, when a grammar based on temporal logic is used, we show that GBDTs can be used for the interpretable classi cation of high-dimensional and heterogeneous time series data.

Clustering Collision Avoidance +2

Cannot find the paper you are looking for? You can Submit a new open access paper.