Search Results for author: Xiyu Hu

Found 1 papers, 1 papers with code

ScreenAgent: A Vision Language Model-driven Computer Control Agent

1 code implementation9 Feb 2024 Runliang Niu, Jindong Li, Shiqi Wang, Yali Fu, Xiyu Hu, Xueyuan Leng, He Kong, Yi Chang, Qi Wang

Additionally, we construct the ScreenAgent Dataset, which collects screenshots and action sequences when completing a variety of daily computer tasks.

Language Modelling

Cannot find the paper you are looking for? You can Submit a new open access paper.