no code implementations • NAACL (ACL) 2022 • Shiau Hong Lim, Laura Wynter
We present a system for document retrieval that combines direct classification with standard content-based retrieval approaches to significantly improve the relevance of the retrieved documents.
no code implementations • 27 Feb 2022 • Supriyo Ghosh, Laura Wynter, Shiau Hong Lim, Duc Thien Nguyen
We propose a framework, called neural-progressive hedging (NP), that leverages stochastic programming during the online phase of executing a reinforcement learning (RL) policy.
1 code implementation • 14 Oct 2021 • Fabian Lim, Laura Wynter, Shiau Hong Lim
Optimal transport is a framework for comparing measures whereby a cost is incurred for transporting one measure to another.
no code implementations • 18 Feb 2021 • Desmond Cai, Shiau Hong Lim, Laura Wynter
One of the main challenges in real-world reinforcement learning is to learn successfully from limited training samples.
no code implementations • 1 Jan 2021 • Shiau Hong Lim, Ilyas Malik
We address the problem of learning a risk-sensitive policy based on the CVaR risk measure using distributional reinforcement learning.
Distributional Reinforcement Learning reinforcement-learning +1
no code implementations • 14 Sep 2020 • Achintya Kundu, Pengqian Yu, Laura Wynter, Shiau Hong Lim
We present a class of methods for robust, personalized federated learning, called Fed+, that unifies many federated learning algorithms.
no code implementations • 1 Jun 2020 • Desmond Cai, Duc Thien Nguyen, Shiau Hong Lim, Laura Wynter
Crowdsourcing has emerged as an effective means for performing a number of machine learning tasks such as annotation and labelling of images and other data sets.
no code implementations • 3 Apr 2020 • Supriyo Ghosh, Sean Laguna, Shiau Hong Lim, Laura Wynter, Hasan Poonawala
Air traffic control is an example of a highly challenging operational problem that is readily amenable to human expertise augmentation via decision support technologies.
no code implementations • NeurIPS 2014 • Shiau Hong Lim, Yudong Chen, Huan Xu
Our theoretical results cover and subsume a wide range of existing graph clustering results including planted partition, weighted clustering and partially observed graphs.
no code implementations • NeurIPS 2013 • Shiau Hong Lim, Huan Xu, Shie Mannor
An important challenge in Markov decision processes is to ensure robustness with respect to unexpected or adversarial system behavior while taking advantage of well-behaving parts of the system.