Search Results for author: Dongyoon Hahm

Found 2 papers, 1 papers with code

MobileSafetyBench: Evaluating Safety of Autonomous Agents in Mobile Device Control

1 code implementation23 Oct 2024 Juyong Lee, Dongyoon Hahm, June Suk Choi, W. Bradley Knox, Kimin Lee

In this work, we introduce MobileSafetyBench, a benchmark designed to evaluate the safety of device-control agents within a realistic mobile environment based on Android emulators.

Benchmarking Mobile Device Control Agents across Diverse Configurations

no code implementations25 Apr 2024 Juyong Lee, Taywon Min, Minyong An, Dongyoon Hahm, Haeone Lee, Changyeon Kim, Kimin Lee

In this work, we introduce B-MoCA: a novel benchmark with interactive environments for evaluating and developing mobile device control agents.

Benchmarking Imitation Learning

Cannot find the paper you are looking for? You can Submit a new open access paper.