Search Results for author: Tarek Naous

Found 9 papers, 5 papers with code

Empathy-driven Arabic Conversational Chatbot

1 code implementation COLING (WANLP) 2020 Tarek Naous, Christian Hokayem, Hazem Hajj

However, the dataset is not large enough to develop very complex encoder-decoder models.

Chatbot

Stanceosaurus 2.0: Classifying Stance Towards Russian and Spanish Misinformation

no code implementations6 Feb 2024 Anton Lavrouk, Ian Ligon, Tarek Naous, Jonathan Zheng, Alan Ritter, Wei Xu

The Stanceosaurus corpus (Zheng et al., 2022) was designed to provide high-quality, annotated, 5-way stance data extracted from Twitter, suitable for analyzing cross-cultural and cross-lingual misinformation.

Misinformation Stance Classification +1

Reducing Privacy Risks in Online Self-Disclosures with Language Models

no code implementations16 Nov 2023 Yao Dou, Isadora Krsek, Tarek Naous, Anubha Kabra, Sauvik Das, Alan Ritter, Wei Xu

Motivated by the user feedback, we introduce the task of self-disclosure abstraction, which is paraphrasing disclosures into less specific terms while preserving their utility, e. g., "Im 16F" to "I'm a teenage girl".

Language Modelling

Revisiting non-English Text Simplification: A Unified Multilingual Benchmark

1 code implementation25 May 2023 Michael J. Ryan, Tarek Naous, Wei Xu

However, less work has been done on multilingual text simplification due to the lack of a diverse evaluation benchmark that covers complex-simple sentence pairs in many languages.

Sentence Text Simplification +1

ReadMe++: Benchmarking Multilingual Language Models for Multi-Domain Readability Assessment

1 code implementation23 May 2023 Tarek Naous, Michael J. Ryan, Anton Lavrouk, Mohit Chandra, Wei Xu

We present a systematic study and comprehensive evaluation of large language models for automatic multilingual readability assessment.

Benchmarking Cross-Lingual Transfer +1

Having Beer after Prayer? Measuring Cultural Bias in Large Language Models

no code implementations23 May 2023 Tarek Naous, Michael J. Ryan, Alan Ritter, Wei Xu

In this paper, we show that multilingual and Arabic monolingual LMs exhibit bias towards entities associated with Western culture.

named-entity-recognition Named Entity Recognition +4

Stanceosaurus: Classifying Stance Towards Multilingual Misinformation

no code implementations28 Oct 2022 Jonathan Zheng, Ashutosh Baheti, Tarek Naous, Wei Xu, Alan Ritter

We present Stanceosaurus, a new corpus of 28, 033 tweets in English, Hindi, and Arabic annotated with stance towards 251 misinformation claims.

Domain Adaptation Fact Checking +1

Clustering Plotted Data by Image Segmentation

1 code implementation CVPR 2022 Tarek Naous, Srinjay Sarkar, Abubakar Abid, James Zou

We describe the method and compare it to ten other clustering methods on synthetic data to illustrate its advantages and disadvantages.

Clustering Image Segmentation +3

Cannot find the paper you are looking for? You can Submit a new open access paper.