Browse > Natural Language Processing > Language Identification

Language Identification

26 papers with code · Natural Language Processing

Leaderboards

No evaluation results yet. Help compare methods by submit evaluation metrics.

Greatest papers with code

Language Identification Using Deep Convolutional Recurrent Neural Networks

16 Aug 2017HPI-DeepLearning/crnn-lid

Language Identification (LID) systems are used to classify the spoken language from a given audio sample and are typically the first step for many spoken language processing tasks, such as Automatic Speech Recognition (ASR) systems.

LANGUAGE IDENTIFICATION SPEECH RECOGNITION

Common Voice: A Massively-Multilingual Speech Corpus

13 Dec 2019facebookresearch/covost

To our knowledge this is the largest audio corpus in the public domain for speech recognition, both in terms of number of hours and number of languages.

LANGUAGE IDENTIFICATION SPEECH RECOGNITION TRANSFER LEARNING

A Semisupervised Approach for Language Identification based on Ladder Networks

1 Apr 2016udibr/LRE

In this study we address the problem of training a neuralnetwork for language identification using both labeled and unlabeled speech samples in the form of i-vectors.

DENOISING LANGUAGE IDENTIFICATION

Automatic Dialect Detection in Arabic Broadcast Speech

23 Sep 2015Qatar-Computing-Research-Institute/dialectID

We used these features in a binary classifier to discriminate between Modern Standard Arabic (MSA) and Dialectal Arabic, with an accuracy of 100%.

LANGUAGE IDENTIFICATION SPEECH RECOGNITION

LanideNN: Multilingual Language Identification on Character Window

EACL 2017 tomkocmi/LanideNN

In language identification, a common first step in natural language processing, we want to automatically determine the language of some input text.

LANGUAGE IDENTIFICATION

What's in a Domain? Learning Domain-Robust Text Representations using Adversarial Training

NAACL 2018 lrank/Domain_Robust_Text_Representation

Most real world language problems require learning from heterogenous corpora, raising the problem of learning robust models which generalise well to both similar (in domain) and dissimilar (out of domain) instances to those seen in training.

DOMAIN ADAPTATION LANGUAGE IDENTIFICATION SENTIMENT ANALYSIS

Hierarchical Character-Word Models for Language Identification

WS 2016 ajaech/twitter_langid

Social media messages' brevity and unconventional spelling pose a challenge to language identification.

LANGUAGE IDENTIFICATION

On the End-to-End Solution to Mandarin-English Code-switching Speech Recognition

1 Nov 2018zengzp0912/SEAME-dev-set

Code-switching (CS) refers to a linguistic phenomenon where a speaker uses different languages in an utterance or between alternating utterances.

DATA AUGMENTATION LANGUAGE IDENTIFICATION LANGUAGE MODELLING SPEECH RECOGNITION