Search Results for author: Chapman Siu

Found 8 papers, 3 papers with code

Online GentleAdaBoost -- Technical Report

no code implementations27 Aug 2023 Chapman Siu

We study the online variant of GentleAdaboost, where we combine a weak learner to a strong learner in an online fashion.

Regularize! Don't Mix: Multi-Agent Reinforcement Learning without Explicit Centralized Structures

no code implementations19 Sep 2021 Chapman Siu, Jason Traish, Richard Yi Da Xu

We propose using regularization for Multi-Agent Reinforcement Learning rather than learning explicit cooperative structures called {\em Multi-Agent Regularized Q-learning} (MARQ).

Multi-agent Reinforcement Learning Q-Learning +2

Dual Behavior Regularized Reinforcement Learning

no code implementations19 Sep 2021 Chapman Siu, Jason Traish, Richard Yi Da Xu

We demonstrate the flexibility of this approach and how it can be adapted to online contexts where the environment is available to collect experiences and a variety of other contexts.

counterfactual reinforcement-learning +1

Residual Networks Behave Like Boosting Algorithms

no code implementations25 Sep 2019 Chapman Siu

We show that Residual Networks (ResNet) is equivalent to boosting feature representation, without any modification to the underlying ResNet training algorithm.

TreeGrad: Transferring Tree Ensembles to Neural Networks

1 code implementation25 Apr 2019 Chapman Siu

Gradient Boosting Decision Tree (GBDT) are popular machine learning algorithms with implementations such as LightGBM and in popular machine learning toolkits like Scikit-Learn.

BIG-bench Machine Learning Neural Architecture Search

Automatic Induction of Neural Network Decision Tree Algorithms

1 code implementation26 Nov 2018 Chapman Siu

This work presents an approach to automatically induction for non-greedy decision trees constructed from neural network architecture.

Diverse Online Feature Selection

1 code implementation12 Jun 2018 Chapman Siu, Richard Yi Da Xu

The framework aims to promote diversity based on the kernel produced on a feature level, through at most three stages: feature sampling, local criteria and global criteria for feature selection.

feature selection Point Processes

Cannot find the paper you are looking for? You can Submit a new open access paper.