Search Results for author: Dmitriy Bespalov

Found 2 papers, 0 papers with code

Towards Building a Robust Toxicity Predictor

no code implementations9 Apr 2024 Dmitriy Bespalov, Sourav Bhabesh, Yi Xiang, Liutong Zhou, Yanjun Qi

Recent NLP literature pays little attention to the robustness of toxicity language predictors, while these systems are most likely to be used in adversarial contexts.

Adversarial Attack

Latent Skill Discovery for Chain-of-Thought Reasoning

no code implementations7 Dec 2023 Zifan Xu, Haozhu Wang, Dmitriy Bespalov, Peter Stone, Yanjun Qi

Simultaneously, RSD learns a reasoning policy to determine the required reasoning skill for a given question.

Math

Cannot find the paper you are looking for? You can Submit a new open access paper.