Search Results for author: Ruitong Huang

Found 12 papers, 4 papers with code

Optimistic and Adaptive Lagrangian Hedging

no code implementations • 23 Jan 2021 • Ryan D'Orazio, Ruitong Huang

The generality of this framework includes problems that are not adversarial, for example offline optimization, or saddle point problems (i. e. min max optimization).

Paper
Add Code

CDT: Cascading Decision Trees for Explainable Reinforcement Learning

1 code implementation • 15 Nov 2020 • Zihan Ding, Pablo Hernandez-Leal, Gavin Weiguang Ding, Changjian Li, Ruitong Huang

As a second contribution our study reveals limitations of explaining black-box policies via imitation learning with tree-based explainable models, due to its inherent instability.

Explainable Models Imitation Learning +3

Paper
Code

Maximum Entropy Monte-Carlo Planning

no code implementations • NeurIPS 2019 • Chenjun Xiao, Ruitong Huang, Jincheng Mei, Dale Schuurmans, Martin Müller

We then extend this approach to general sequential decision making by developing a general MCTS algorithm, Maximum Entropy for Tree Search (MENTS).

Atari Games Decision Making

Paper
Add Code

On the Sensitivity of Adversarial Robustness to Input Data Distributions

no code implementations • ICLR 2019 • Gavin Weiguang Ding, Kry Yik Chau Lui, Xiaomeng Jin, Luyu Wang, Ruitong Huang

Even a semantics-preserving transformations on the input data distribution can cause a significantly different robustness for the adversarial trained model that is both trained and evaluated on the new distribution.

Adversarial Robustness

Paper
Add Code

MMA Training: Direct Input Space Margin Maximization through Adversarial Training

1 code implementation • ICLR 2020 • Gavin Weiguang Ding, Yash Sharma, Kry Yik Chau Lui, Ruitong Huang

We study adversarial robustness of neural networks from a margin maximization perspective, where margins are defined as the distances from inputs to a classifier's decision boundary.

Adversarial Defense Adversarial Robustness

Paper
Code

Few-Shot Self Reminder to Overcome Catastrophic Forgetting

no code implementations • 3 Dec 2018 • Junfeng Wen, Yanshuai Cao, Ruitong Huang

We demonstrate the superiority of our method to the previous ones in two different continual learning settings on popular benchmarks, as well as a new continual learning problem where tasks are designed to be more dissimilar.

Continual Learning

Paper
Add Code

Dimensionality Reduction has Quantifiable Imperfections: Two Geometric Bounds

no code implementations • NeurIPS 2018 • Kry Yik Chau Lui, Gavin Weiguang Ding, Ruitong Huang, Robert J. McCann

In this paper, we investigate Dimensionality reduction (DR) maps in an information retrieval setting from a quantitative topology point of view.

Dimensionality Reduction Information Retrieval +2

Paper
Add Code

Improving GAN Training via Binarized Representation Entropy (BRE) Regularization

1 code implementation • ICLR 2018 • Yanshuai Cao, Gavin Weiguang Ding, Kry Yik-Chau Lui, Ruitong Huang

We propose a novel regularizer to improve the training of Generative Adversarial Networks (GANs).

General Classification

Paper
Code

Structured Best Arm Identification with Fixed Confidence

no code implementations • 16 Jun 2017 • Ruitong Huang, Mohammad M. Ajallooeian, Csaba Szepesvári, Martin Müller

We study the problem of identifying the best action among a set of possible options when the value of each action is given by a mapping from a number of noisy micro-observables in the so-called fixed confidence setting.

Paper
Add Code

Following the Leader and Fast Rates in Linear Prediction: Curved Constraint Sets and Other Regularities

no code implementations • NeurIPS 2016 • Ruitong Huang, Tor Lattimore, András György, Csaba Szepesvári

The follow the leader (FTL) algorithm, perhaps the simplest of all online learning algorithms, is known to perform well when the loss functions it is used on are convex and positively curved.

Paper
Add Code

Revise Saturated Activation Functions

no code implementations • 18 Feb 2016 • Bing Xu, Ruitong Huang, Mu Li

In this paper, we revise two commonly used saturated functions, the logistic sigmoid and the hyperbolic tangent (tanh).

Paper
Add Code

Learning with a Strong Adversary

1 code implementation • 10 Nov 2015 • Ruitong Huang, Bing Xu, Dale Schuurmans, Csaba Szepesvari

The robustness of neural networks to intended perturbations has recently attracted significant attention.

General Classification

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.