Search Results for author: Yehong Zhang

Found 13 papers, 3 papers with code

COPR: Continual Human Preference Learning via Optimal Policy Regularization

no code implementations22 Feb 2024 Han Zhang, Lin Gui, Yu Lei, Yuanzhao Zhai, Yehong Zhang, Yulan He, Hui Wang, Yue Yu, Kam-Fai Wong, Bin Liang, Ruifeng Xu

Reinforcement Learning from Human Feedback (RLHF) is commonly utilized to improve the alignment of Large Language Models (LLMs) with human preferences.

Continual Learning

SecFormer: Towards Fast and Accurate Privacy-Preserving Inference for Large Language Models

1 code implementation1 Jan 2024 Jinglong Luo, Yehong Zhang, Zhuo Zhang, JiaQi Zhang, Xin Mu, Hui Wang, Yue Yu, Zenglin Xu

However, the application of SMPC in Privacy-Preserving Inference (PPI) for large language models, particularly those based on the Transformer architecture, often leads to considerable slowdowns or declines in performance.

Knowledge Distillation Privacy Preserving

EncryIP: A Practical Encryption-Based Framework for Model Intellectual Property Protection

no code implementations19 Dec 2023 Xin Mu, Yu Wang, Zhengan Huang, Junzuo Lai, Yehong Zhang, Hui Wang, Yue Yu

In the rapidly growing digital economy, protecting intellectual property (IP) associated with digital products has become increasingly important.

Model Provenance via Model DNA

no code implementations4 Aug 2023 Xin Mu, Yu Wang, Yehong Zhang, JiaQi Zhang, Hui Wang, Yang Xiang, Yue Yu

Understanding the life cycle of the machine learning (ML) model is an intriguing area of research (e. g., understanding where the model comes from, how it is trained, and how it is used).

Representation Learning

Dependency Structure Search Bayesian Optimization for Decision Making Models

no code implementations1 Aug 2023 Mohit Rajpal, Lac Gia Tran, Yehong Zhang, Bryan Kian Hsiang Low

Derivative-free approaches such as Bayesian Optimization mitigate the dependency on the quality of gradient feedback, but are known to scale poorly in the high-dimension setting of complex decision making models.

Bayesian Optimization Decision Making

Practical Privacy-Preserving Gaussian Process Regression via Secret Sharing

no code implementations26 Jun 2023 Jinglong Luo, Yehong Zhang, JiaQi Zhang, Shuang Qin, Hui Wang, Yue Yu, Zenglin Xu

In contrast to existing studies that protect the data privacy of GPR via homomorphic encryption, differential privacy, or federated learning, our proposed method is more practical and can be used to preserve the data privacy of both the model inputs and outputs for various data-sharing scenarios (e. g., horizontally/vertically-partitioned data).

Federated Learning GPR +2

Balancing training time vs. performance with Bayesian Early Pruning

no code implementations1 Jan 2021 Mohit Rajpal, Yehong Zhang, Bryan Kian Hsiang Low

Pruning is an approach to alleviate overparameterization of deep neural networks (DNN) by zeroing out or pruning DNN elements with little to no efficacy at a given task.

Computational Efficiency

Scalable Variational Bayesian Kernel Selection for Sparse Gaussian Process Regression

no code implementations5 Dec 2019 Tong Teng, Jie Chen, Yehong Zhang, Kian Hsiang Low

To achieve this, we represent the probabilistic kernel as an additional variational variable in a variational inference (VI) framework for SGPR models where its posterior belief is learned together with that of the other variational variables (i. e., inducing variables and kernel hyperparameters).

regression Stochastic Optimization +1

Bayesian Optimization with Binary Auxiliary Information

no code implementations17 Jun 2019 Yehong Zhang, Zhongxiang Dai, Kian Hsiang Low

This paper presents novel mixed-type Bayesian optimization (BO) algorithms to accelerate the optimization of a target objective function by exploiting correlated auxiliary information of binary type that can be more cheaply obtained, such as in policy search for reinforcement learning and hyperparameter tuning of machine learning models with early stopping.

Bayesian Optimization Reinforcement Learning +1

Near-Optimal Active Learning of Multi-Output Gaussian Processes

1 code implementation21 Nov 2015 Yehong Zhang, Trong Nghia Hoang, Kian Hsiang Low, Mohan Kankanhalli

This paper addresses the problem of active learning of a multi-output Gaussian process (MOGP) model representing multiple types of coexisting correlated environmental phenomena.

Active Learning Gaussian Processes

Cannot find the paper you are looking for? You can Submit a new open access paper.