1 code implementation • 10 Jan 2025 • Taywon Min, Haeone Lee, Yongchan Kwon, Kimin Lee
By quantifying the impact of human feedback on reward models, we believe that influence functions can enhance feedback interpretability and contribute to scalable oversight in RLHF, helping labelers provide more accurate and consistent feedback.
no code implementations • 25 Apr 2024 • Juyong Lee, Taywon Min, Minyong An, Dongyoon Hahm, Haeone Lee, Changyeon Kim, Kimin Lee
In this work, we introduce B-MoCA: a novel benchmark with interactive environments for evaluating and developing mobile device control agents.
1 code implementation • 24 Apr 2023 • Haeone Lee
In this paper, we propose ComGAN(ComparativeGAN) which allows the generator in GANs to refer to the semantics of comparative samples(e. g. real data) by comparison.