A Dataset to Identify Manipulated Social Media News in Bangla

We construct a publicly available Bangla dataset of 800 news-related social media items that are annotated as manipulated or not relative to 500 reference news articles. We present a semi-automatic (use both human and LLM) method for generating such a dataset, which allows scalable dataset collection using annotators efficiently for languages with few available NLP tools.

Papers


Paper Code Results Date Stars

Dataset Loaders


No data loaders found. You can submit your data loader here.

Tasks


Similar Datasets


License


  • Unknown

Modalities


Languages