Native Language Identification
5 papers with code • 1 benchmarks • 2 datasets
Native Language Identification (NLI) is the task of determining an author's native language (L1) based only on their writings in a second language (L2).
Latest papers with no code
Native Language Identification with Large Language Models
We present the first experiments on Native Language Identification (NLI) using LLMs such as GPT-4.
Turkish Native Language Identification
In this paper, we present the first application of Native Language Identification (NLI) for the Turkish language.
Scaling Native Language Identification with Transformer Adapters
Native language identification (NLI) is the task of automatically identifying the native language (L1) of an individual based on their language production in a learned language.
Unravelling Interlanguage Facts via Explainable Machine Learning
We focus on a different facet of the NLI task, i. e., that of analysing the internals of an NLI classifier trained by an \emph{explainable} machine learning algorithm, in order to obtain explanations of its classification decisions, with the ultimate goal of gaining insight into which linguistic phenomena ``give a speaker's native language away''.
A Deep Generative Approach to Native Language Identification
Native language identification (NLI) {--} identifying the native language (L1) of a person based on his/her writing in the second language (L2) {--} is useful for a variety of purposes, including marketing, security, and educational applications.
Investigating the effect of auxiliary objectives for the automated grading of learner English speech transcriptions
We address the task of automatically grading the language proficiency of spontaneous speech based on textual features from automatic speech recognition transcripts.
A Report on the 2020 VUA and TOEFL Metaphor Detection Shared Task
In this paper, we report on the shared task on metaphor identification on VU Amsterdam Metaphor Corpus and on a subset of the TOEFL Native Language Identification Corpus.
Regression or classification? Automated Essay Scoring for Norwegian
In this paper we present first results for the task of Automated Essay Scoring for Norwegian learner language.
Anglicized Words and Misspelled Cognates in Native Language Identification
In this paper, we present experiments that estimate the impact of specific lexical choices of people writing in a second language (L2).
Transductive Learning with String Kernels for Cross-Domain Text Classification
Although classifiers for a target domain can be trained on labeled text data from a related source domain, the accuracy of such classifiers is usually lower in the cross-domain setting.