Search Results for author: Xingyu Han

Found 1 papers, 1 papers with code

Benchmarking Data Science Agents

1 code implementation27 Feb 2024 Yuge Zhang, Qiyang Jiang, Xingyu Han, Nan Chen, Yuqing Yang, Kan Ren

In this paper, we introduce DSEval -- a novel evaluation paradigm, as well as a series of innovative benchmarks tailored for assessing the performance of these agents throughout the entire data science lifecycle.

Benchmarking Decision Making

Cannot find the paper you are looking for? You can Submit a new open access paper.