Search Results for author: Site Bai

Found 5 papers, 1 papers with code

Federated Composite Saddle Point Optimization

no code implementations25 May 2023 Site Bai, Brian Bullins

Federated learning (FL) approaches for saddle point problems (SPP) have recently gained in popularity due to the critical role they play in machine learning (ML).

Federated Learning

Dual Convexified Convolutional Neural Networks

no code implementations27 May 2022 Site Bai, Chuyang Ke, Jean Honorio

To overcome this, we propose a highly novel weight recovery algorithm, which takes the dual solution and the kernel information as the input, and recovers the linear weight and the output of convolutional layer, instead of weight parameter.

Hindsight Trust Region Policy Optimization

1 code implementation29 Jul 2019 Hanbo Zhang, Site Bai, Xuguang Lan, David Hsu, Nanning Zheng

We propose \emph{Hindsight Trust Region Policy Optimization}(HTRPO), a new RL algorithm that extends the highly successful TRPO algorithm with \emph{hindsight} to tackle the challenge of sparse rewards.

Atari Games Policy Gradient Methods +1

ROI-based Robotic Grasp Detection for Object Overlapping Scenes

no code implementations30 Aug 2018 Hanbo Zhang, Xuguang Lan, Site Bai, Xinwen Zhou, Zhiqiang Tian, Nanning Zheng

Experimental results demonstrate that ROI-GD performs much better in object overlapping scenes and at the meantime, remains comparable with state-of-the-art grasp detection algorithms on Cornell Grasp Dataset and Jacquard Dataset.

Robotics

Cannot find the paper you are looking for? You can Submit a new open access paper.