1 code implementation • 22 Apr 2018 • Tommi Jauhiainen, Marco Lui, Marcos Zampieri, Timothy Baldwin, Krister Lindén
Language identification (LI) is the problem of determining the natural language that a document or part thereof is written in.
no code implementations • TACL 2014 • Marco Lui, Jey Han Lau, Timothy Baldwin
Language identification is the task of automatically detecting the language(s) present in a document based on the content of the document.
no code implementations • IJCNLP 2011 • Marco Lui, Timothy Baldwin
We show that transductive (cross-domain) learning is an important consideration in building a general-purpose language identification system, and develop a feature selection method that generalizes across domains.