Search Results for author: Huanchen Zhang

Found 6 papers, 4 papers with code

UDA: A Benchmark Suite for Retrieval Augmented Generation in Real-world Document Analysis

1 code implementation21 Jun 2024 Yulong Hui, Yao Lu, Huanchen Zhang

The use of Retrieval-Augmented Generation (RAG) has improved Large Language Models (LLMs) in collaborating with external data, yet significant challenges exist in real-world scenarios.

Question Answering RAG +1

ReaLHF: Optimized RLHF Training for Large Language Models through Parameter Reallocation

1 code implementation20 Jun 2024 Zhiyu Mei, Wei Fu, Kaiwei Li, Guangju Wang, Huanchen Zhang, Yi Wu

Based on this formulation, ReaLHF employs a tailored search algorithm with a lightweight cost estimator to discover an efficient execution plan.

Language Modelling Large Language Model

SRL: Scaling Distributed Reinforcement Learning to Over Ten Thousand Cores

2 code implementations29 Jun 2023 Zhiyu Mei, Wei Fu, Jiaxuan Gao, Guangju Wang, Huanchen Zhang, Yi Wu

Following this abstraction, we develop a scalable, efficient, and extensible distributed RL system called ReaLlyScalableRL, which allows efficient and massively parallelized training and easy development of customized algorithms.

reinforcement-learning Reinforcement Learning +1

LeCo: Lightweight Compression via Learning Serial Correlations

no code implementations27 Jun 2023 Yihao Liu, Xinyu Zeng, Huanchen Zhang

Lightweight data compression is a key technique that allows column stores to exhibit superior performance for analytical queries.

Data Compression

Proteus: A Self-Designing Range Filter

2 code implementations30 Jun 2022 Eric R. Knorr, Baptiste Lemaire, Andrew Lim, Siqiang Luo, Huanchen Zhang, Stratos Idreos, Michael Mitzenmacher

We introduce Proteus, a novel self-designing approximate range filter, which configures itself based on sampled data in order to optimize its false positive rate (FPR) for a given space requirement.

Cannot find the paper you are looking for? You can Submit a new open access paper.