1 code implementation • 27 May 2023 • Deokjae Lee, JunYeong Lee, Jung-Woo Ha, Jin-Hwa Kim, Sang-Woo Lee, Hwaran Lee, Hyun Oh Song
To this end, we propose Bayesian red teaming (BRT), novel query-efficient black-box red teaming methods based on Bayesian optimization, which iteratively identify diverse positive test cases leading to model failures by utilizing the pre-defined user input pool and the past evaluations.
1 code implementation • 18 Oct 2022 • Seungyong Moon, JunYeong Lee, Hyun Oh Song
Our work focuses on training RL agents on multiple visually diverse environments to improve observational generalization performance.