no code implementations • 2 Oct 2024 • Doohee You, Karim Lasri, Samuel Fraiberger
This study investigates efficient deduplication techniques for a large NLP dataset of economic research paper titles.
1 code implementation • 27 Apr 2024 • Manuel Tonneau, Diyi Liu, Samuel Fraiberger, Ralph Schroeder, Scott A. Hale, Paul Röttger
We find that HS datasets for these languages exhibit a strong geo-cultural bias, largely overrepresenting a handful of countries (e. g., US and UK for English) relative to their prominence in both the broader social media population and the general population speaking these languages.
1 code implementation • ACL 2022 • Manuel Tonneau, Dhaval Adjodah, João Palotti, Nir Grinberg, Samuel Fraiberger
Detecting disclosures of individuals' employment status on social media can provide valuable information to match job seekers with suitable vacancies, offer social protection, or measure labor market flows.
no code implementations • IJCNLP 2019 • Ananth Balashankar, Sun Chakraborty, an, Samuel Fraiberger, Lakshminarayanan Subramanian
We propose a new framework to uncover the relationship between news events and real world phenomena.