Search Results for author: Jian Tao

Found 4 papers, 3 papers with code

Exploration and Anti-Exploration with Distributional Random Network Distillation

1 code implementation18 Jan 2024 Kai Yang, Jian Tao, Jiafei Lyu, Xiu Li

To address this issue, we introduce the Distributional RND (DRND), a derivative of the RND.

D4RL

Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model

1 code implementation22 Nov 2023 Kai Yang, Jian Tao, Jiafei Lyu, Chunjiang Ge, Jiaxin Chen, Qimai Li, Weihan Shen, Xiaolong Zhu, Xiu Li

The direct preference optimization (DPO) method, effective in fine-tuning large language models, eliminates the necessity for a reward model.

Denoising

Hierarchical Autoencoder-based Lossy Compression for Large-scale High-resolution Scientific Data

1 code implementation9 Jul 2023 Hieu Le, Hernan Santos, Jian Tao

Our model achieves a compression ratio of 140 on several benchmark data sets without compromising the reconstruction quality.

Cannot find the paper you are looking for? You can Submit a new open access paper.