Search Results for author: Isabelle Ondracek

Found 2 papers, 2 papers with code

Moderating Illicit Online Image Promotion for Unsafe User-Generated Content Games Using Large Vision-Language Models

1 code implementation • 27 Mar 2024 • Keyan Guo, Ayush Utkarsh, Wenbo Ding, Isabelle Ondracek, Ziming Zhao, Guo Freeman, Nishant Vishwamitra, Hongxin Hu

Online user-generated content games (UGCGs) are increasingly popular among children and adolescents for social interaction and more creative online entertainment.

Domain Adaptation

Paper
Code

Moderating New Waves of Online Hate with Chain-of-Thought Reasoning in Large Language Models

1 code implementation • 22 Dec 2023 • Nishant Vishwamitra, Keyan Guo, Farhan Tajwar Romit, Isabelle Ondracek, Long Cheng, Ziming Zhao, Hongxin Hu

HATEGUARD further achieves prompt-based zero-shot detection by automatically generating and updating detection prompts with new derogatory terms and targets in new wave samples to effectively address new waves of online hate.

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.