Search Results for author: Jian Tao

Exploration and Anti-Exploration with Distributional Random Network Distillation

To address this issue, we introduce the Distributional RND (DRND), a derivative of the RND.

Paper
Code

The direct preference optimization (DPO) method, effective in fine-tuning large language models, eliminates the necessity for a reward model.

111

Paper
Code

Our model achieves a compression ratio of 140 on several benchmark data sets without compromising the reconstruction quality.

Paper
Code

Magnetic resonance imaging (MRI) is one of the noninvasive imaging modalities that can produce high-quality images.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.