Search Results for author: Tharindu Kumarage

Found 11 papers, 7 papers with code

Can Knowledge Graphs Reduce Hallucinations in LLMs? : A Survey

no code implementations14 Nov 2023 Garima Agrawal, Tharindu Kumarage, Zeyad Alghamdi, Huan Liu

The contemporary LLMs are prone to producing hallucinations, stemming mainly from the knowledge gaps within the models.

Knowledge Graphs

How Reliable Are AI-Generated-Text Detectors? An Assessment Framework Using Evasive Soft Prompts

no code implementations8 Oct 2023 Tharindu Kumarage, Paras Sheth, Raha Moraffah, Joshua Garland, Huan Liu

The novel universal evasive prompt is achieved in two steps: First, we create an evasive soft prompt tailored to a specific PLM through prompt tuning; and then, we leverage the transferability of soft prompts to transfer the learned evasive soft prompt from one PLM to another.

ConDA: Contrastive Domain Adaptation for AI-generated Text Detection

1 code implementation7 Sep 2023 Amrita Bhattacharjee, Tharindu Kumarage, Raha Moraffah, Huan Liu

Given the potential malicious nature in which these LLMs can be used to generate disinformation at scale, it is important to build effective detectors for such AI-generated text.

Contrastive Learning Text Detection +1

Neural Authorship Attribution: Stylometric Analysis on Large Language Models

1 code implementation14 Aug 2023 Tharindu Kumarage, Huan Liu

Large language models (LLMs) such as GPT-4, PaLM, and Llama have significantly propelled the generation of AI-crafted text.

Authorship Attribution Language Modelling +1

Causality Guided Disentanglement for Cross-Platform Hate Speech Detection

1 code implementation3 Aug 2023 Paras Sheth, Tharindu Kumarage, Raha Moraffah, Aman Chadha, Huan Liu

By disentangling input into platform-dependent features (useful for predicting hate targets) and platform-independent features (used to predict the presence of hate), we learn invariant representations resistant to distribution shifts.

Disentanglement Hate Speech Detection

PEACE: Cross-Platform Hate Speech Detection- A Causality-guided Framework

1 code implementation15 Jun 2023 Paras Sheth, Tharindu Kumarage, Raha Moraffah, Aman Chadha, Huan Liu

Hate speech detection refers to the task of detecting hateful content that aims at denigrating an individual or a group based on their religion, gender, sexual orientation, or other characteristics.

Hate Speech Detection

Stylometric Detection of AI-Generated Text in Twitter Timelines

1 code implementation7 Mar 2023 Tharindu Kumarage, Joshua Garland, Amrita Bhattacharjee, Kirill Trapeznikov, Scott Ruston, Huan Liu

However, tweets are inherently short, thus making it difficult for current state-of-the-art pre-trained language model-based detectors to accurately detect at what point the AI starts to generate tweets in a given Twitter timeline.

Language Modelling Misinformation +1

Towards Detecting Harmful Agendas in News Articles

1 code implementation31 Jan 2023 Melanie Subbiah, Amrita Bhattacharjee, Yilun Hua, Tharindu Kumarage, Huan Liu, Kathleen McKeown

Manipulated news online is a growing problem which necessitates the use of automated systems to curtail its spread.

Misinformation

Cannot find the paper you are looking for? You can Submit a new open access paper.