Search Results for author: Zerong Xi

Found 3 papers, 1 papers with code

Leveraging the Variance of Return Sequences for Exploration Policy

no code implementations17 Nov 2020 Zerong Xi, Gita Sukthankar

We demonstrate that the variance of the return sequence for a specific state-action pair is an important information source that can be leveraged to guide exploration in reinforcement learning.

Atari Games reinforcement-learning +1

metricDTW: local distance metric learning in Dynamic Time Warping

no code implementations11 Jun 2016 Jiaping Zhao, Zerong Xi, Laurent Itti

We propose to learn multiple local Mahalanobis distance metrics to perform k-nearest neighbor (kNN) classification of temporal sequences.

Dynamic Time Warping General Classification +4

Cannot find the paper you are looking for? You can Submit a new open access paper.