Search Results for author: Nan Ye

Found 17 papers, 11 papers with code

DESPOT: Online POMDP Planning with Regularization

1 code implementation NeurIPS 2013 Nan Ye, Adhiraj Somani, David Hsu, Wee Sun Lee

We show that the best policy obtained from a DESPOT is near-optimal, with a regret bound that depends on the representation size of the optimal policy.

Autonomous Driving

Discriminative Particle Filter Reinforcement Learning for Complex Partial Observations

1 code implementation ICLR 2020 Xiao Ma, Peter Karkus, David Hsu, Wee Sun Lee, Nan Ye

The particle filter maintains a belief using learned discriminative update, which is trained end-to-end for decision making.

Atari Games Decision Making +3

Nesterov Acceleration of Alternating Least Squares for Canonical Tensor Decomposition: Momentum Step Size Selection and Restart Mechanisms

1 code implementation13 Oct 2018 Drew Mitchell, Nan Ye, Hans De Sterck

While Nesterov acceleration turns gradient descent into an optimal first-order method for convex problems by adding a momentum term with a specific weight sequence, a direct application of this method and weight sequence to ALS results in erratic convergence behaviour.

Tensor Decomposition

Revisiting Maximum Entropy Inverse Reinforcement Learning: New Perspectives and Algorithms

1 code implementation1 Dec 2020 Aaron J. Snoswell, Surya P. N. Singh, Nan Ye

This improves the previous heuristic derivation of the MaxEnt IRL model (for stochastic MDPs), allows a unified view of MaxEnt IRL and Relative Entropy IRL, and leads to a model-free learning algorithm for the MaxEnt IRL model.

OpenAI Gym reinforcement-learning +1

Positive-Unlabeled Learning using Random Forests via Recursive Greedy Risk Minimization

1 code implementation16 Oct 2022 Jonathan Wilton, Abigail M. Y. Koay, Ryan K. L. Ko, Miao Xu, Nan Ye

Key to our approach is a new interpretation of decision tree algorithms for positive and negative data as \emph{recursive greedy risk minimization algorithms}.

Feature Importance Weakly Supervised Classification

Adaptive Discretization using Voronoi Trees for Continuous-Action POMDPs

1 code implementation13 Sep 2022 Marcus Hoerger, Hanna Kurniawati, Dirk Kroese, Nan Ye

A Voronoi tree is a Binary Space Partitioning (BSP) that implicitly maintains the partition of a cell as the Voronoi diagram of two points sampled from the cell.

Adaptive Discretization using Voronoi Trees for Continuous POMDPs

1 code implementation21 Feb 2023 Marcus Hoerger, Hanna Kurniawati, Dirk Kroese, Nan Ye

ADVT uses the estimated diameters of the cells to form an upper-confidence bound on the action value function within the cell, guiding the Monte Carlo Tree Search expansion and further discretization of the action space.

Fast Controllable Diffusion Models for Undersampled MRI Reconstruction

1 code implementation20 Nov 2023 Wei Jiang, Zhuang Xiong, Feng Liu, Nan Ye, Hongfu Sun

Supervised deep learning methods have shown promise in undersampled Magnetic Resonance Imaging (MRI) reconstruction, but their requirement for paired data limits their generalizability to the diverse MRI acquisition parameters.

MRI Reconstruction

Robust Loss Functions for Training Decision Trees with Noisy Labels

1 code implementation20 Dec 2023 Jonathan Wilton, Nan Ye

We consider training decision trees using noisily labeled data, focusing on loss functions that can lead to robust learning algorithms.

Learning with noisy labels

Greedy Convex Ensemble

1 code implementation9 Oct 2019 Tan Nguyen, Nan Ye, Peter L. Bartlett

Theoretically, we first consider whether we can use linear, instead of convex, combinations, and obtain generalization results similar to existing ones for learning from a convex hull.

Conditional Random Fields with High-Order Features for Sequence Labeling

no code implementations NeurIPS 2009 Nan Ye, Wee S. Lee, Hai L. Chieu, Dan Wu

Dependencies among neighbouring labels in a sequence is an important source of information for sequence labeling problems.

Vocal Bursts Intensity Prediction

Tensor Belief Propagation

no code implementations ICML 2017 Andrew Wrigley, Wee Sun Lee, Nan Ye

We propose a new approximate inference algorithm for graphical models, tensor belief propagation, based on approximating the messages passed in the junction tree algorithm.

LiMIIRL: Lightweight Multiple-Intent Inverse Reinforcement Learning

no code implementations3 Jun 2021 Aaron J. Snoswell, Surya P. N. Singh, Nan Ye

Multiple-Intent Inverse Reinforcement Learning (MI-IRL) seeks to find a reward function ensemble to rationalize demonstrations of different but unlabelled intents.

Clustering reinforcement-learning +1

A Boosting Algorithm for Positive-Unlabeled Learning

no code implementations19 May 2022 Yawen Zhao, Mingzhe Zhang, Chenhao Zhang, Weitong Chen, Nan Ye, Miao Xu

This is because AdaPU learns a weak classifier and its weight using a weighted positive-negative (PN) dataset with some negative data weights $-$ the dataset is derived from the original PU data, and the data weights are determined by the current weighted classifier combination, but some data weights are negative.

Action Detection Activity Detection +1

A Surprisingly Simple Continuous-Action POMDP Solver: Lazy Cross-Entropy Search Over Policy Trees

1 code implementation14 May 2023 Marcus Hoerger, Hanna Kurniawati, Dirk Kroese, Nan Ye

At each planning step, our method uses a novel lazy Cross-Entropy method to search the space of policy trees, which provide a simple policy representation.

Decision Making

Cannot find the paper you are looking for? You can Submit a new open access paper.