no code implementations • EMNLP 2020 • Adaku Uchendu, Thai Le, Kai Shu, Dongwon Lee
In recent years, the task of generating realistic short and long texts have made tremendous advancements.
no code implementations • 20 Nov 2023 • YiFan Li, Zhen Tan, Kai Shu, Zongsheng Cao, Yu Kong, Huan Liu
Graph Neural Networks (GNNs) have emerged as a powerful tool for representation learning on graphs, but they often suffer from overfitting and label noise issues, especially when the data is scarce or imbalanced.
no code implementations • 15 Nov 2023 • Yueqing Liang, Lu Cheng, Ali Payani, Kai Shu
This work investigates the potential of undermining both fairness and detection performance in abusive language detection.
1 code implementation • 15 Nov 2023 • Haoran Wang, Kai Shu
Our code and data are available at https://github. com/wang2226/Backdoor-Activation-Attack Warning: this paper contains content that can be offensive or upsetting.
no code implementations • 16 Oct 2023 • Qiong Nan, Qiang Sheng, Juan Cao, Yongchun Zhu, Danding Wang, Guang Yang, Jintao Li, Kai Shu
To break such a dilemma, a feasible but not well-studied solution is to leverage social contexts (e. g., comments) from historical news for training a detection model and apply it to newly emerging news without social contexts.
1 code implementation • 8 Oct 2023 • Haoran Wang, Kai Shu
While existing works on claim verification have shown promising results, a crucial piece of the puzzle that remains unsolved is to understand how to verify claims without relying on human-annotated data, which is expensive to create at a large scale.
no code implementations • 25 Sep 2023 • Canyu Chen, Kai Shu
Then, through extensive empirical investigation, we discover that LLM-generated misinformation can be harder to detect for humans and detectors compared to human-written misinformation with the same semantics, which suggests it can have more deceptive styles and potentially cause more harm.
1 code implementation • 15 Sep 2023 • Aman Rangapur, Haoran Wang, Kai Shu
Fact-checking in financial domain is under explored, and there is a shortage of quality dataset in this domain.
no code implementations • 6 Sep 2023 • Aman Rangapur, Haoran Wang, Kai Shu
In conclusion, this research paper sheds light on the pervasive issue of online financial misinformation and its wide-ranging consequences.
no code implementations • 27 Jun 2023 • Junwei Yin, Min Gao, Kai Shu, Zehua Zhao, Yinqiu Huang, Jia Wang
To this end, we propose an approach of Emulating the behaviors of readers (Ember) for fake news detection on social media, incorporating readers' reading and verificating process to model news from the component perspective thoroughly.
no code implementations • 14 Jun 2023 • Kai Shu, Yuchang Zhao, Le Wu, Aiping Liu, Ruobing Qian, Xun Chen
Data augmentation is an intuitive way to solve this problem.
no code implementations • 18 May 2023 • Xiongxiao Xu, Kaize Ding, Canyu Chen, Kai Shu
However, the work exploring limited labeled anomalies and a large amount of unlabeled nodes in graphs to detect anomalies is rather limited.
1 code implementation • 17 Apr 2023 • Baixiang Huang, Bryan Hooi, Kai Shu
To bridge this gap, we have constructed a real-world graph-based Traffic Accident Prediction (TAP) data repository, along with two representative tasks: accident occurrence prediction and accident severity prediction.
1 code implementation • 24 Dec 2022 • Ujun Jeong, Kaize Ding, Lu Cheng, Ruocheng Guo, Kai Shu, Huan Liu
Nowadays, fake news easily propagates through online social networks and becomes a grand threat to individuals and society.
Ranked #1 on
Graph Classification
on UPFD-POL
no code implementations • 10 Nov 2022 • Canyu Chen, Haoran Wang, Matthew Shapiro, Yunyu Xiao, Fei Wang, Kai Shu
Because of the uniqueness and importance of combating health misinformation in social media, we conduct this survey to further facilitate interdisciplinary research on this problem.
no code implementations • 18 Jul 2022 • Canyu Chen, Yueqing Liang, Xiongxiao Xu, Shangyu Xie, Ashish Kundu, Ali Payani, Yuan Hong, Kai Shu
Thus, it is essential to ensure fairness in machine learning models.
1 code implementation • 26 Jun 2022 • Yongchun Zhu, Qiang Sheng, Juan Cao, Qiong Nan, Kai Shu, Minghui Wu, Jindong Wang, Fuzhen Zhuang
In this paper, we propose a Memory-guided Multi-view Multi-domain Fake News Detection Framework (M$^3$FEND) to address these two challenges.
2 code implementations • 21 Jun 2022 • Kay Liu, Yingtong Dou, Yue Zhao, Xueying Ding, Xiyang Hu, Ruitong Zhang, Kaize Ding, Canyu Chen, Hao Peng, Kai Shu, Lichao Sun, Jundong Li, George H. Chen, Zhihao Jia, Philip S. Yu
To bridge this gap, we present--to the best of our knowledge--the first comprehensive benchmark for unsupervised outlier node detection on static attributed graphs called BOND, with the following highlights.
no code implementations • 8 Jun 2022 • Yueqing Liang, Canyu Chen, Tian Tian, Kai Shu
Though we lack the sensitive attribute for training a fair model in the target domain, there might exist a similar domain that has sensitive attributes.
1 code implementation • 18 May 2022 • Canyu Chen, Kai Shu
Extensive experiment results on few-shot text classification tasks demonstrate the superior performance of the proposed framework by effectively leveraging label semantics and data augmentation for natural language understanding.
1 code implementation • 6 May 2022 • Qiang Sheng, Juan Cao, H. Russell Bernard, Kai Shu, Jintao Li, Huan Liu
False news that spreads on social media has proliferated over the past years and has led to multi-aspect threats in the real world.
1 code implementation • 26 Apr 2022 • Kay Liu, Yingtong Dou, Yue Zhao, Xueying Ding, Xiyang Hu, Ruitong Zhang, Kaize Ding, Canyu Chen, Hao Peng, Kai Shu, George H. Chen, Zhihao Jia, Philip S. Yu
PyGOD is an open-source Python library for detecting outliers on graph data.
no code implementations • 16 Feb 2022 • Ahmadreza Mosallanezhad, Mansooreh Karami, Kai Shu, Michelle V. Mancenido, Huan Liu
With social media being a major force in information consumption, accelerated propagation of fake news has presented new challenges for platforms to distinguish between legitimate and fake news.
no code implementations • 9 Feb 2022 • Xinyi Zhou, Kai Shu, Vir V. Phoha, Huan Liu, Reza Zafarani
To distinguish between intentional versus unintentional spreading, we study the psychological explanations of unintentional spreading.
1 code implementation • EMNLP (CINLP) 2021 • Zhao Wang, Kai Shu, Aron Culotta
In this paper, we propose a simple and intuitive regularization approach to integrate causal knowledge during model training and build a robust and fair model by emphasizing causal features and de-emphasizing spurious features.
no code implementations • NAACL 2022 • Guoqing Zheng, Giannis Karamanolakis, Kai Shu, Ahmed Hassan Awadallah
In this paper, we propose such a benchmark, named WALNUT (semi-WeAkly supervised Learning for Natural language Understanding Testbed), to advocate and facilitate research on weak supervision for NLU.
no code implementations • 8 Jun 2021 • Enyan Dai, Kai Shu, Yiwei Sun, Suhang Wang
We propose a novel generative framework named as ADDES which can synthesize high-quality labeled data for target classification tasks by learning from data with inexact supervision and the relations between inexact supervision and target classes.
1 code implementation • 29 Apr 2021 • Tianxiang Zhao, Enyan Dai, Kai Shu, Suhang Wang
Though the sensitive attribute of each data sample is unknown, we observe that there are usually some non-sensitive features in the training data that are highly correlated with sensitive attributes, which can be used to alleviate the bias.
1 code implementation • 25 Apr 2021 • Yingtong Dou, Kai Shu, Congying Xia, Philip S. Yu, Lichao Sun
The majority of existing fake news detection algorithms focus on mining news content and/or the surrounding exogenous context for discovering deceptive signals; while the endogenous preference of a user when he/she decides to spread a piece of fake news or not is ignored.
Ranked #1 on
Graph Classification
on UPFD-GOS
1 code implementation • 8 Dec 2020 • Kai Shu, Yichuan Li, Kaize Ding, Huan Liu
The existing text generation methods either afford limited supplementary information or lose consistency between the input and output which makes the synthetic news less trustworthy.
2 code implementations • 8 Nov 2020 • Yichuan Li, Bohan Jiang, Kai Shu, Huan Liu
The COVID-19 epidemic is considered as the global health crisis of the whole society and the greatest challenge mankind faced since World War Two.
Social and Information Networks Computers and Society
no code implementations • 3 Nov 2020 • Hao Liao, Qixin Liu, Kai Shu, Xing Xie
Yet, the popularity of social media also provides opportunities to better detect fake news.
Fake News Detection
Representation Learning
Social and Information Networks
no code implementations • 30 Oct 2020 • Ahmadreza Mosallanezhad, Kai Shu, Huan Liu
In this paper, we consider realistic news as news that cannot be easily detected by a fake news classifier.
no code implementations • 18 Oct 2020 • Amrita Bhattacharjee, Kai Shu, Min Gao, Huan Liu
We then proceed to discuss the inherent challenges in disinformation research, and then elaborate on the computational and interdisciplinary approaches towards mitigation of disinformation, after a short overview of the various directions explored in detection efforts.
no code implementations • 14 Jul 2020 • Kai Shu, Amrita Bhattacharjee, Faisal Alatawi, Tahora Nazer, Kaize Ding, Mansooreh Karami, Huan Liu
The creation, dissemination, and consumption of disinformation and fabricated content on social media is a growing concern, especially with the ease of access to such sources, and the lack of awareness of the existence of such false information.
1 code implementation • 23 Jun 2020 • Kaize Ding, Jianling Wang, Jundong Li, Kai Shu, Chenghao Liu, Huan Liu
By constructing a pool of semi-supervised node classification tasks to mimic the real test environment, GPN is able to perform \textit{meta-learning} on an attributed network and derive a highly generalizable model for handling the target classification task.
no code implementations • 26 May 2020 • Kai Shu, Subhabrata Mukherjee, Guoqing Zheng, Ahmed Hassan Awadallah, Milad Shokouhi, Susan Dumais
In this paper, we propose to leverage user actions as a source of weak supervision, in addition to a limited set of annotated examples, to detect intents in emails.
no code implementations • 3 Apr 2020 • Kai Shu, Guoqing Zheng, Yichuan Li, Subhabrata Mukherjee, Ahmed Hassan Awadallah, Scott Ruston, Huan Liu
Social media has greatly enabled people to participate in online activities at an unprecedented rate.
1 code implementation • 2 Jan 2020 • Kai Shu, Suhang Wang, Dongwon Lee, Huan Liu
In recent years, disinformation including fake news, has became a global phenomenon due to its explosive growth, particularly on social media.
no code implementations • 28 Oct 2019 • Raha Moraffah, Kai Shu, Adrienne Raglin, Huan Liu
Recent research on deep domain adaptation proposed to mitigate this problem by forcing the deep model to learn more transferable feature representations across domains.
no code implementations • 24 Oct 2019 • Kai Shu, Ahmed Hassan Awadallah, Susan Dumais, Huan Liu
This is especially the case for many real-world tasks where large scale annotated examples are either too expensive to acquire or unavailable due to privacy or data access constraints.
no code implementations • 1 May 2019 • Yu-Feng Yu, Yuelong Zhu, Dingsheng Wan, Qun Zhao, Kai Shu, Huan Liu
Floods of research and practical applications employ social media data for a wide range of public applications, including environmental monitoring, water resource managing, disaster and emergency response. Hydroinformatics can benefit from the social media technologies with newly emerged data, techniques and analytical tools to handle large datasets, from which creative ideas and new values could be mined. This paper first proposes a 4W (What, Why, When, hoW) model and a methodological structure to better understand and represent the application of social media to hydroinformatics, then provides an overview of academic research of applying social media to hydroinformatics such as water environment, water resources, flood, drought and water Scarcity management.
no code implementations • 30 Apr 2019 • Kai Shu, Xinyi Zhou, Suhang Wang, Reza Zafarani, Huan Liu
In an attempt to understand connections between user profiles and fake news, first, we measure users' sharing behaviors on social media and group representative users who are more likely to share fake and real news; then, we perform a comparative analysis of explicit and implicit profile features between these user groups, which reveals their potential to help differentiate fake news from real news.
2 code implementations • 21 Mar 2019 • Kai Shu, Deepak Mahudeswaran, Suhang Wang, Huan Liu
In an attempt to understand the correlations between news propagation networks and fake news, first, we build a hierarchical propagation network from macro-level and micro-level of fake news and true news; second, we perform a comparative analysis of the propagation network features of linguistic, structural and temporal perspectives between fake and real news, which demonstrates the potential of utilizing these features to detect fake news; third, we show the effectiveness of these propagation network features for fake news detection.
Social and Information Networks
no code implementations • 6 Mar 2019 • Wen Zhang, Kai Shu, Huan Liu, Yalin Wang
In particular, we provide a principled approach to jointly capture local and global information in the user-user social graph and propose the framework {\m}, which jointly learning user representations for user identity linkage.
1 code implementation • 5 Mar 2019 • Xueyao Zhang, Juan Cao, Xirong Li, Qiang Sheng, Lei Zhong, Kai Shu
Emotion plays an important role in detecting fake news online.
7 code implementations • 5 Sep 2018 • Kai Shu, Deepak Mahudeswaran, Suhang Wang, Dongwon Lee, Huan Liu
However, fake news detection is a non-trivial task, which requires multi-source information such as news content, social context, and dynamic information.
Social and Information Networks
4 code implementations • 20 Dec 2017 • Kai Shu, Suhang Wang, Huan Liu
Recent Social and Psychology studies show potential importance to utilize social media data: 1) Confirmation bias effect reveals that consumers prefer to believe information that confirms their existing stances; 2) Echo chamber effect suggests that people tend to follow likeminded users and form segregated communities on social media.
Social and Information Networks
no code implementations • 14 Sep 2017 • Fred Morstatter, Kai Shu, Suhang Wang, Huan Liu
We apply our solution to sentiment analysis, a task that can benefit from the emoji calibration technique we use in this work.
6 code implementations • 7 Aug 2017 • Kai Shu, Amy Sliva, Suhang Wang, Jiliang Tang, Huan Liu
First, fake news is intentionally written to mislead readers to believe false information, which makes it difficult and nontrivial to detect based on news content; therefore, we need to include auxiliary information, such as user social engagements on social media, to help make a determination.