Native Language Identification

5 papers with code • 1 benchmarks • 2 datasets

Native Language Identification (NLI) is the task of determining an author's native language (L1) based only on their writings in a second language (L2).

Latest papers with no code

Native Language Identification with Large Language Models

no code yet • 13 Dec 2023

We present the first experiments on Native Language Identification (NLI) using LLMs such as GPT-4.

Turkish Native Language Identification

no code yet • 27 Jul 2023

In this paper, we present the first application of Native Language Identification (NLI) for the Turkish language.

Scaling Native Language Identification with Transformer Adapters

no code yet • 18 Nov 2022

Native language identification (NLI) is the task of automatically identifying the native language (L1) of an individual based on their language production in a learned language.

Unravelling Interlanguage Facts via Explainable Machine Learning

no code yet • 2 Aug 2022

We focus on a different facet of the NLI task, i. e., that of analysing the internals of an NLI classifier trained by an \emph{explainable} machine learning algorithm, in order to obtain explanations of its classification decisions, with the ultimate goal of gaining insight into which linguistic phenomena ``give a speaker's native language away''.

A Deep Generative Approach to Native Language Identification

no code yet • COLING 2020

Native language identification (NLI) {--} identifying the native language (L1) of a person based on his/her writing in the second language (L2) {--} is useful for a variety of purposes, including marketing, security, and educational applications.

Investigating the effect of auxiliary objectives for the automated grading of learner English speech transcriptions

no code yet • ACL 2020

We address the task of automatically grading the language proficiency of spontaneous speech based on textual features from automatic speech recognition transcripts.

A Report on the 2020 VUA and TOEFL Metaphor Detection Shared Task

no code yet • WS 2020

In this paper, we report on the shared task on metaphor identification on VU Amsterdam Metaphor Corpus and on a subset of the TOEFL Native Language Identification Corpus.

Regression or classification? Automated Essay Scoring for Norwegian

no code yet • WS 2019

In this paper we present first results for the task of Automated Essay Scoring for Norwegian learner language.

Anglicized Words and Misspelled Cognates in Native Language Identification

no code yet • WS 2019

In this paper, we present experiments that estimate the impact of specific lexical choices of people writing in a second language (L2).

Transductive Learning with String Kernels for Cross-Domain Text Classification

no code yet • 2 Nov 2018

Although classifiers for a target domain can be trained on labeled text data from a related source domain, the accuracy of such classifiers is usually lower in the cross-domain setting.