Search Results for author: Hao Jin

Found 9 papers, 3 papers with code

Federated Control in Markov Decision Processes

no code implementations7 May 2024 Hao Jin, Yang Peng, Liangyu Zhang, Zhihua Zhang

In face of the difference among restricted regions, we firstly introduce concepts of leakage probabilities to understand how such heterogeneity affects the learning process, and then propose a novel communication protocol that we call Federated-Q protocol (FedQ), which periodically aggregates agents' knowledge of their restricted regions and accordingly modifies their learning problems for further training.

Federated Reinforcement Learning with Constraint Heterogeneity

no code implementations6 May 2024 Hao Jin, Liangyu Zhang, Zhihua Zhang

In our setting, we aim to solve a reinforcement learning problem with multiple constraints while $N$ training agents are located in $N$ different environments with limited access to the constraint signals and they are expected to collaboratively learn a policy satisfying all constraint signals.

Language Modelling Large Language Model +2

Federated Reinforcement Learning with Environment Heterogeneity

1 code implementation6 Apr 2022 Hao Jin, Yang Peng, Wenhao Yang, Shusen Wang, Zhihua Zhang

We study a Federated Reinforcement Learning (FedRL) problem in which $n$ agents collaboratively learn a single policy without sharing the trajectories they collected during agent-environment interaction.

reinforcement-learning Reinforcement Learning (RL)

Meta-Regularization: An Approach to Adaptive Choice of the Learning Rate in Gradient Descent

no code implementations12 Apr 2021 Guangzeng Xie, Hao Jin, Dachao Lin, Zhihua Zhang

We propose \textit{Meta-Regularization}, a novel approach for the adaptive choice of the learning rate in first-order gradient descent methods.

Positive Seebeck coefficient in highly doped La$_{2-x}$Sr$_x$CuO$_4$ ($x$=0.33); its origin and implication

no code implementations26 Jan 2021 Hao Jin, Alessandro Narduzzo, Minoru Nohara, Hidenori Takagi, Nigel Hussey, Kamran Behnia

We present a study of the thermoelectric (Seebeck and Nernst) response in heavily overdoped, non-superconducting La$_{1. 67}$Sr$_{0. 33}$CuO$_4$.

Superconductivity Materials Science Strongly Correlated Electrons

Natural Language Adversarial Defense through Synonym Encoding

1 code implementation15 Sep 2019 Xiaosen Wang, Hao Jin, Yichen Yang, Kun He

In the area of natural language processing, deep learning models are recently known to be vulnerable to various types of adversarial perturbations, but relatively few works are done on the defense side.

Adversarial Attack Adversarial Defense

Towards Better Generalization: BP-SVRG in Training Deep Neural Networks

no code implementations18 Aug 2019 Hao Jin, Dachao Lin, Zhihua Zhang

Stochastic variance-reduced gradient (SVRG) is a classical optimization method.

Hyper-Regularization: An Adaptive Choice for the Learning Rate in Gradient Descent

no code implementations ICLR 2019 Guangzeng Xie, Hao Jin, Dachao Lin, Zhihua Zhang

Specifically, we impose a regularization term on the learning rate via a generalized distance, and cast the joint updating process of the parameter and the learning rate into a maxmin problem.

Practical Verifiable In-network Filtering for DDoS defense

1 code implementation4 Jan 2019 Deli Gong, Muoi Tran, Shweta Shinde, Hao Jin, Vyas Sekar, Prateek Saxena, Min Suk Kang

In this paper, we show the technical feasibility of verifiable in-network filtering, called VIF, that offers filtering verifiability to DDoS victims and neighbor ASes.

Cryptography and Security

Cannot find the paper you are looking for? You can Submit a new open access paper.