Search Results for author: Wanying Wang

Found 2 papers, 0 papers with code

Revisiting Benchmark and Assessment: An Agent-based Exploratory Dynamic Evaluation Framework for LLMs

no code implementations15 Oct 2024 Wanying Wang, Zeyu Ma, PengFei Liu, Mingang Chen

We propose an agent-based evaluation framework called TestAgent, which implements these concepts through retrieval augmented generation and reinforcement learning.

Cannot find the paper you are looking for? You can Submit a new open access paper.