no code implementations • 9 Apr 2024 • Dmitriy Bespalov, Sourav Bhabesh, Yi Xiang, Liutong Zhou, Yanjun Qi
Recent NLP literature pays little attention to the robustness of toxicity language predictors, while these systems are most likely to be used in adversarial contexts.
no code implementations • 7 Dec 2023 • Zifan Xu, Haozhu Wang, Dmitriy Bespalov, Peter Stone, Yanjun Qi
Simultaneously, RSD learns a reasoning policy to determine the required reasoning skill for a given question.