no code implementations • COLING 2022 • Pranav Narayanan Venkit, Mukund Srinath, Shomir Wilson
Pretrained language models (PLMs) have been shown to exhibit sociodemographic biases, such as against gender and race, raising concerns of downstream biases in language technologies.
no code implementations • GWC 2016 • Shomir Wilson, Alan Black, Jon Oberlander
Writing intended to inform frequently contains references to document entities (DEs), a mixed class that includes orthographically structured items (e. g., illustrations, sections, lists) and discourse entities (arguments, suggestions, points).
no code implementations • LREC 2022 • Siddhant Arora, Henry Hosseini, Christine Utz, Vinayshekhar Bannihatti Kumar, Tristan Dhellemmes, Abhilasha Ravichander, Peter Story, Jasmine Mangat, Rex Chen, Martin Degeling, Thomas Norton, Thomas Hupperich, Shomir Wilson, Norman Sadeh
Over the past decade, researchers have started to explore the use of NLP to develop tools aimed at helping the public, vendors, and regulators analyze disclosures made in privacy policies.
1 code implementation • LREC 2022 • Nan Zhang, Shomir Wilson, Prasenjit Mitra
Therefore, we propose the first title-text dataset on web documents that incorporates a wide variety of domains to facilitate downstream training.
no code implementations • 11 Apr 2024 • Pranav Narayanan Venkit, Tatiana Chakravorti, Vipul Gupta, Heidi Biggs, Mukund Srinath, Koustava Goswami, Sarah Rajtmajer, Shomir Wilson
We investigate how hallucination in large language models (LLM) is characterized in peer-reviewed literature using a critical examination of 103 publications across NLP research.
no code implementations • 16 Feb 2024 • Mukund Srinath, Pranav Venkit, Maria Badillo, Florian Schaub, C. Lee Giles, Shomir Wilson
Privacy policies are crucial for informing users about data practices, yet their length and complexity often deter users from reading them.
no code implementations • 18 Oct 2023 • Pranav Narayanan Venkit, Mukund Srinath, Sanjana Gautam, Saranya Venkatraman, Vipul Gupta, Rebecca J. Passonneau, Shomir Wilson
We conduct an inquiry into the sociotechnical aspects of sentiment analysis (SA) by critically examining 189 peer-reviewed papers on their applications, models, and datasets.
1 code implementation • 24 Aug 2023 • Vipul Gupta, Pranav Narayanan Venkit, Hugo Laurençon, Shomir Wilson, Rebecca J. Passonneau
We apply CALM to 20 large language models, and find that for 2 language model series, larger parameter models tend to be more biased than smaller ones.
no code implementations • 8 Aug 2023 • Pranav Narayanan Venkit, Sanjana Gautam, Ruchi Panchanadikar, Ting-Hao `Kenneth' Huang, Shomir Wilson
We investigate the potential for nationality biases in natural language processing (NLP) models using human evaluation methods.
no code implementations • 18 Jul 2023 • Pranav Narayanan Venkit, Mukund Srinath, Shomir Wilson
We analyze sentiment analysis and toxicity detection models to detect the presence of explicit bias against people with disability (PWD).
no code implementations • 13 Jun 2023 • Vipul Gupta, Pranav Narayanan Venkit, Shomir Wilson, Rebecca J. Passonneau
This paper presents a comprehensive survey of work on sociodemographic bias in language models (LMs).
no code implementations • 5 Feb 2023 • Pranav Narayanan Venkit, Sanjana Gautam, Ruchi Panchanadikar, Ting-Hao 'Kenneth' Huang, Shomir Wilson
Little attention is placed on analyzing nationality bias in language models, especially when nationality is highly used as a factor in increasing the performance of social NLP models.
no code implementations • 28 Jun 2022 • Sonu Gupta, Ellen Poplavska, Nora O'Toole, Siddhant Arora, Thomas Norton, Norman Sadeh, Shomir Wilson
To examine the status and evolution of this patchwork, we introduce the Government Privacy Instructions Corpus, or GPI Corpus, of 1, 043 privacy laws, regulations, and guidelines, covering 182 jurisdictions.
no code implementations • 2 Feb 2022 • Younes Karimi, Anna Squicciarini, Shomir Wilson
Doxing refers to the practice of disclosing sensitive personal information about a person without their consent.
no code implementations • 25 Nov 2021 • Pranav Narayanan Venkit, Shomir Wilson
The results show that all exhibit strong negative biases on sentences that mention disability.
no code implementations • ACL 2021 • Abhilasha Ravichander, Alan W Black, Thomas Norton, Shomir Wilson, Norman Sadeh
Privacy plays a crucial role in preserving democratic ideals and personal autonomy.
no code implementations • 14 Mar 2021 • Pranav Venkit, Zeba Karishma, Chi-Yang Hsu, Rahul Katiki, Kenneth Huang, Shomir Wilson, Patrick Dudas
We widely use emojis in social networking to heighten, mitigate or negate the sentiment of the text.
no code implementations • ACL 2021 • Mukund Srinath, Shomir Wilson, C. Lee Giles
Organisations disclose their privacy practices by posting privacy policies on their website.
1 code implementation • IJCNLP 2019 • Abhilasha Ravichander, Alan W. black, Shomir Wilson, Thomas Norton, Norman Sadeh
The PrivacyQA corpus offers a challenging corpus for question answering, with genuine real-world utility.
no code implementations • EMNLP 2018 • Abhijith Athreya Mysore Gopinath, Shomir Wilson, Norman Sadeh
To remedy this, we present a flexible system for automatically extracting the hierarchical section titles and prose organization of web documents irrespective of differences in HTML representation.
no code implementations • EMNLP 2017 • Kanthashree Mysore Sathyendra, Shomir Wilson, Florian Schaub, Sebastian Zimmeck, Norman Sadeh
Our techniques enable the creation of systems to help Internet users to learn about their choices, thereby effectuating notice and choice and improving Internet privacy.
no code implementations • ACL 2016 • Shomir Wilson, Florian Schaub, Aswarth Abhilash Dara, Frederick Liu, Sushain Cherivirala, Pedro Giovanni Leon, Mads Schaarup Andersen, Sebastian Zimmeck, Kanthashree Mysore Sathyendra, N. Cameron Russell, Thomas B. Norton, Eduard Hovy, Joel Reidenberg, Norman Sadeh