no code implementations • ACL 2020 • Emily M. Bender, Dirk Hovy, Alex Schofield, ra
To raise awareness among future NLP practitioners and prevent inertia in the field, we need to place ethics in the curriculum for all NLP students{---}not as an elective, but as a core part of their education.
no code implementations • EMNLP 2017 • Alex Schofield, ra, Laure Thompson, David Mimno
Duplicate documents are a pervasive problem in text datasets and can have a strong effect on unsupervised models.
no code implementations • EACL 2017 • Alex Schofield, ra, M{\aa}ns Magnusson, David Mimno
It is often assumed that topic models benefit from the use of a manually curated stopword list.
no code implementations • TACL 2016 • Alex Schofield, ra, David Mimno
Rule-based stemmers such as the Porter stemmer are frequently used to preprocess English corpora for topic modeling.