Search Results for author: Stephen Mayhew

Found 20 papers, 5 papers with code

Building Low-Resource NER Models Using Non-Speaker Annotations

no code implementations NAACL (DaSH) 2021 Tatiana Tsygankova, Francesca Marini, Stephen Mayhew, Dan Roth

In low-resource natural language processing (NLP), the key problems are a lack of target language training data, and a lack of native speakers to create it.

Low Resource Named Entity Recognition Named Entity Recognition +1

Building Low-Resource NER Models Using Non-Speaker Annotation

no code implementations17 Jun 2020 Tatiana Tsygankova, Francesca Marini, Stephen Mayhew, Dan Roth

In low-resource natural language processing (NLP), the key problems are a lack of target language training data, and a lack of native speakers to create it.

Low Resource Named Entity Recognition Named Entity Recognition +1

Cross-Lingual Ability of Multilingual BERT: An Empirical Study

no code implementations ICLR 2020 Karthikeyan K, Zihan Wang, Stephen Mayhew, Dan Roth

Recent work has exhibited the surprising cross-lingual abilities of multilingual BERT (M-BERT) -- surprising since it is trained without any cross-lingual objective and with no aligned data.

Named Entity Recognition Natural Language Inference

Robust Named Entity Recognition with Truecasing Pretraining

no code implementations15 Dec 2019 Stephen Mayhew, Nitish Gupta, Dan Roth

Although modern named entity recognition (NER) systems show impressive performance on standard datasets, they perform poorly when presented with noisy data.

Named Entity Recognition NER

Named Entity Recognition with Partially Annotated Training Data

no code implementations CONLL 2019 Stephen Mayhew, Snigdha Chaturvedi, Chen-Tse Tsai, Dan Roth

Supervised machine learning assumes the availability of fully-labeled data, but in many cases, such as low-resource languages, the only data available is partially annotated.

Named Entity Recognition NER

BSNLP2019 Shared Task Submission: Multisource Neural NER Transfer

no code implementations WS 2019 Tatiana Tsygankova, Stephen Mayhew, Dan Roth

This paper describes the Cognitive Computation (CogComp) Group{'}s submissions to the multilingual named entity recognition shared task at the Balto-Slavic Natural Language Processing (BSNLP) Workshop.

Multilingual Named Entity Recognition NER

Legal Linking: Citation Resolution and Suggestion in Constitutional Law

1 code implementation WS 2019 Robert Shaffer, Stephen Mayhew

This paper describes a dataset and baseline systems for linking paragraphs from court cases to clauses or amendments in the US Constitution.

ner and pos when nothing is capitalized

no code implementations IJCNLP 2019 Stephen Mayhew, Tatiana Tsygankova, Dan Roth

While prior work and first impressions might suggest training a caseless model, or using a truecaser at test time, we show that the most effective strategy is a concatenation of cased and lowercased training data, producing a single model with high performance on both cased and uncased text.

Machine Translation Named Entity Recognition +4

Simple Features for Strong Performance on Named Entity Recognition in Code-Switched Twitter Data

no code implementations WS 2018 Devanshu Jain, Maria Kustikova, Mayank Darbari, Rishabh Gupta, Stephen Mayhew

In this work, we address the problem of Named Entity Recognition (NER) in code-switched tweets as a part of the Workshop on Computational Approaches to Linguistic Code-switching (CALCS) at ACL{'}18.

Language Identification Named Entity Recognition +3

TALEN: Tool for Annotation of Low-resource ENtities

1 code implementation ACL 2018 Stephen Mayhew, Dan Roth

We present a new web-based interface, TALEN, designed for named entity annotation in low-resource settings where the annotators do not speak the language.

Named Entity Recognition

Cheap Translation for Cross-Lingual Named Entity Recognition

no code implementations EMNLP 2017 Stephen Mayhew, Chen-Tse Tsai, Dan Roth

Recent work in NLP has attempted to deal with low-resource languages but still assumed a resource level that is not present for most languages, e. g., the availability of Wikipedia in the target language.

Cross-Lingual NER Named Entity Recognition +2

Cross-lingual Dataless Classification for Languages with Small Wikipedia Presence

no code implementations13 Nov 2016 Yangqiu Song, Stephen Mayhew, Dan Roth

We use a word-level dictionary to convert documents in a SWL to a large-Wikipedia language (LWLs), and then perform CLDDC based on the LWL's Wikipedia.

Classification Document Classification +4

Transliteration in Any Language with Surrogate Languages

no code implementations14 Sep 2016 Stephen Mayhew, Christos Christodoulopoulos, Dan Roth

We introduce a method for transliteration generation that can produce transliterations in every language.

Transliteration

ILLINOISCLOUDNLP: Text Analytics Services in the Cloud

no code implementations LREC 2014 Hao Wu, Zhiye Fei, Aaron Dai, Mark Sammons, Dan Roth, Stephen Mayhew

Natural Language Processing (NLP) continues to grow in popularity in a range of research and commercial applications.

Knowledge Base Population

Cannot find the paper you are looking for? You can Submit a new open access paper.