no code implementations • LREC 2022 • Isil Yakut Kilic, SHimei Pan
In this research, we investigate diverse methods to incorporate Linguistic Inquiry and Word Count (LIWC), a widely-used psycholinguistic lexicon, in NN models to improve human trait and behavior analysis in low resource scenarios.
no code implementations • EMNLP 2021 • Arpita Roy, SHimei Pan
In recent years pre-trained language models (PLM) such as BERT have proven to be very effective in diverse NLP tasks such as Information Extraction, Sentiment Analysis and Question Answering.
no code implementations • 14 Apr 2025 • Alden Dima, James Foulds, SHimei Pan, Philip Feldman
We present an approach to monitor LLMs for changes by comparing the distributions of linguistic and psycholinguistic features of generated text.
no code implementations • 11 Mar 2025 • Siyuan Wang, James R. Foulds, Md Osman Gani, SHimei Pan
In this paper, we introduce CIBER (Claim Investigation Based on Evidence Retrieval), an extension of the Retrieval-Augmented Generation (RAG) framework designed to identify corroborating and refuting documents as evidence for scientific claim verification.
1 code implementation • 20 Jun 2024 • Tao Zhang, Ziqian Zeng, Yuxiang Xiao, Huiping Zhuang, Cen Chen, James Foulds, SHimei Pan
Furthermore, we categorized the gender biases in the "rejected" responses of GenderAlign into 4 principal categories.
no code implementations • 2 Mar 2024 • Philip Feldman, James R. Foulds, SHimei Pan
Large language models (LLMs) like ChatGPT demonstrate the remarkable progress of artificial intelligence.
no code implementations • 13 Nov 2023 • Fatema Hasan, Yulong Li, James Foulds, SHimei Pan, Bishwaranjan Bhattacharjee
This leads to an improved language model for analyzing spoken transcripts while avoiding an audio processing overhead at test time.
no code implementations • 9 Jun 2023 • Philip Feldman, James R. Foulds, SHimei Pan
Recent advances in large language models (LLMs), such as ChatGPT, have led to highly sophisticated conversation agents.
no code implementations • 11 Jan 2023 • Richard Brath, Daniel Keim, Johannes Knittel, SHimei Pan, Pia Sommerauer, Hendrik Strobelt
We motivate the use of visualization in relation to target users and common NLP pipelines.
no code implementations • 15 Sep 2022 • Rashidul Islam, SHimei Pan, James R. Foulds
It is now well understood that machine learning models, trained on data without due care, often exhibit unfair and discriminatory behavior against certain populations.
no code implementations • 6 May 2022 • George J. Cancro, SHimei Pan, James Foulds
In this paper we survey literature in the area of trust between a single human supervisor and a single agent subordinate to determine the nature and extent of this additional information and to characterize it into a taxonomy that can be leveraged by future researchers and intelligent agent practitioners.
no code implementations • 13 Jun 2021 • Clarice Wang, Kathryn Wang, Andrew Bian, Rashidul Islam, Kamrun Naher Keya, James Foulds, SHimei Pan
In other words, our results demonstrate we cannot fully address the gender bias issue in AI recommendations without addressing the gender bias in humans.
no code implementations • 17 May 2021 • Fatema Hasan, Kevin S. Xu, James R. Foulds, SHimei Pan
User-generated data on social media contain rich information about who we are, what we like and how we make decisions.
no code implementations • 20 Apr 2021 • Philip Feldman, Sim Tiwari, Charissa S. L. Cheah, James R. Foulds, SHimei Pan
This paper describes a method for using Transformer-based Language Models (TLMs) to understand public opinion from social media posts.
1 code implementation • 18 Apr 2021 • Ziqian Zeng, Rashidul Islam, Kamrun Naher Keya, James Foulds, Yangqiu Song, SHimei Pan
Recently, much attention has been paid to the societal impact of AI, especially concerns regarding its fairness.
no code implementations • 14 Oct 2020 • Kamrun Naher Keya, Rashidul Islam, SHimei Pan, Ian Stockwell, James R. Foulds
Healthcare programs such as Medicaid provide crucial services to vulnerable populations, but due to limited resources, many of the individuals who need these services the most languish on waiting lists.
no code implementations • 9 Oct 2020 • Guohou Shan, James Foulds, SHimei Pan
Text features that are correlated with class labels, but do not directly cause them, are sometimesuseful for prediction, but they may not be insightful.
no code implementations • 2 Sep 2020 • Rashidul Islam, Kamrun Naher Keya, Ziqian Zeng, SHimei Pan, James Foulds
A growing proportion of human interactions are digitized on social media platforms and subjected to algorithmic decision-making, and it has become increasingly important to ensure fair treatment from these algorithms.
no code implementations • IJCNLP 2019 • Arpita Roy, Youngja Park, Taesung Lee, SHimei Pan
We propose a novel supervised open information extraction (Open IE) framework that leverages an ensemble of unsupervised Open IE systems and a small amount of labeled data to improve system performance.
no code implementations • NAACL 2019 • Arpita Roy, Youngja Park, SHimei Pan
Text analytics is a useful tool for studying malware behavior and tracking emerging threats.
no code implementations • 18 Nov 2018 • James Foulds, Rashidul Islam, Kamrun Keya, SHimei Pan
Intersectionality is a framework that analyzes how interlocking systems of power and oppression affect individuals along overlapping dimensions including race, gender, sexual orientation, class, and disability.
2 code implementations • 22 Jul 2018 • James Foulds, Rashidul Islam, Kamrun Naher Keya, SHimei Pan
We propose definitions of fairness in machine learning and artificial intelligence systems that are informed by the framework of intersectionality, a critical lens arising from the Humanities literature which analyzes how interlocking systems of power and oppression affect individuals along overlapping dimensions including gender, race, sexual orientation, class, and disability.
no code implementations • SEMEVAL 2018 • Ankur Padia, Arpita Roy, Taneeya Satyapanich, Francis Ferraro, SHimei Pan, Youngja Park, Anupam Joshi, Tim Finin
We describe the systems developed by the UMBC team for 2018 SemEval Task 8, SecureNLP (Semantic Extraction from CybersecUrity REports using Natural Language Processing).
no code implementations • 11 Apr 2018 • Shimei Pan, Tao Ding
Given the complexity of human minds and their behavioral flexibility, it requires sophisticated data analysis to sift through a large amount of human behavioral evidence to model human minds and to predict human behavior.
no code implementations • 21 Sep 2017 • Arpita Roy, Youngja Park, SHimei Pan
In this pa-per, we describe a novel method to train domain-specificword embeddings from sparse texts.
no code implementations • EMNLP 2017 • Tao Ding, Warren K. Bickel, SHimei Pan
In this paper, we demonstrate how the state-of-the-art machine learning and text mining techniques can be used to build effective social media-based substance use detection systems.
no code implementations • 16 May 2017 • Tao Ding, Warren K. Bickel, SHimei Pan
In this paper, we demonstrate how the state-of-the-art machine learning and text mining techniques can be used to build effective social media-based substance use detection systems.
no code implementations • 22 Mar 2017 • Tao Ding, Warren K. Bickel, SHimei Pan
In economics and psychology, delay discounting is often used to characterize how individuals choose between a smaller immediate reward and a larger delayed reward.
no code implementations • EMNLP 2016 • Tao Ding, SHimei Pan
In this paper, we present a study on personalized emphasis framing which can be used to tailor the content of a message to enhance its appeal to different individuals.