no code implementations • LREC 2020 • Alina Maria Ciobanu, Liviu P. Dinu, Laurentiu Zoicas
Producing related words is a key concern in historical linguistics.
no code implementations • CL 2019 • Alina Maria Ciobanu, Liviu P. Dinu
We apply our method to multiple data sets, showing that our approach improves on previous results, also having the advantage of requiring less input data, which is essential in historical linguistics, where resources are generally scarce.
no code implementations • WS 2019 • Ana Uban, Alina Maria Ciobanu, Liviu P. Dinu
Semantic divergence in related languages is a key concern of historical linguistics.
no code implementations • 14 Aug 2018 • Liviu P. Dinu, Alina Maria Ciobanu, Marcos Zampieri, Shervin Malmasi
In this paper we present ensemble-based systems for dialect and language variety identification using the datasets made available by the organizers of the VarDial Evaluation Campaign 2018.
no code implementations • COLING 2018 • Alina Maria Ciobanu, Liviu P. Dinu
Language change across space and time is one of the main concerns in historical linguistics.
no code implementations • COLING 2018 • Alina Maria Ciobanu, Liviu P. Dinu
Proto-word reconstruction is central to the study of language evolution.
no code implementations • COLING 2018 • Alina Maria Ciobanu, Shervin Malmasi, Liviu P. Dinu
In this paper we present the GDI_classification entry to the second German Dialect Identification (GDI) shared task organized within the scope of the VarDial Evaluation Campaign 2018.
no code implementations • COLING 2018 • Alina Maria Ciobanu, Marcos Zampieri, Shervin Malmasi, Santanu Pal, Liviu P. Dinu
In this paper we present a system based on SVM ensembles trained on characters and words to discriminate between five similar languages of the Indo-Aryan family: Hindi, Braj Bhasha, Awadhi, Bhojpuri, and Magahi.
no code implementations • SEMEVAL 2018 • Bogdan Dumitru, Alina Maria Ciobanu, Liviu P. Dinu
Semantic difference detection attempts to capture whether a word is a discriminative attribute between two other words.
no code implementations • WS 2017 • Marcos Zampieri, Alina Maria Ciobanu, Liviu P. Dinu
This paper presents an ensemble system combining the output of multiple SVM classifiers to native language identification (NLI).
no code implementations • 3 Jul 2017 • Alina Maria Ciobanu, Marcos Zampieri, Shervin Malmasi, Liviu P. Dinu
This paper presents a computational approach to author profiling taking gender and language variety into account.
no code implementations • WS 2016 • Sergiu Nisioi, Alina Maria Ciobanu, Liviu P. Dinu
In this paper we describe the submission of the UniBuc-NLP team for the Discriminating between Similar Languages Shared Task, DSL 2016.
no code implementations • LREC 2016 • Alina Maria Ciobanu, Liviu P. Dinu
In this paper we conduct an initial study on the dialects of Romanian.
no code implementations • LREC 2014 • Liviu Dinu, Alina Maria Ciobanu
Identifying cognates is an interesting task with applications in numerous research areas, such as historical and comparative linguistics, language acquisition, cross-lingual information retrieval, readability and machine translation.
no code implementations • LREC 2014 • Liviu Dinu, Alina Maria Ciobanu
We propose a method for computing the similarity of natural languages and for clustering them based on their lexical similarity.
no code implementations • LREC 2014 • Liviu Dinu, Alina Maria Ciobanu, Ioana Chitoran, Vlad Niculae
We address the task of stress prediction as a sequence tagging problem.