no code implementations • LREC 2022 • Alexandra Ciobotaru, Mihai Vlad Constantinescu, Liviu P. Dinu, Stefan Dumitrescu
RED (Romanian Emotion Dataset) is a machine learning-based resource developed for the automatic detection of emotions in Romanian texts, containing single-label annotated tweets with one of the following emotions: joy, fear, sadness, anger and neutral.
no code implementations • RANLP 2021 • Alexandra Ciobotaru, Liviu P. Dinu
In this article we present some features of our novel dataset, and create a benchmark to achieve the first supervised machine learning model for automatic Emotion Detection in Romanian short texts.
no code implementations • LREC 2022 • Ștefan Cobeli, Ioan-Bogdan Iordache, Shweta Yadav, Cornelia Caragea, Liviu P. Dinu, Dragoș Iliescu
Later, we devised a multi-task knowledge distillation framework to simultaneously learn the target task of optimism detection with the help of the auxiliary task of sentiment analysis and hate speech detection.
no code implementations • RANLP 2021 • Alina Maria Cristea, Anca Dinu, Liviu P. Dinu, Simona Georgescu, Ana Sabina Uban, Laurentiu Zoicas
In this paper we investigate the etymology of Romanian words.
no code implementations • ACL (LChange) 2021 • Ana Sabina Uban, Alina Maria Cristea, Anca Dinu, Liviu P. Dinu, Simona Georgescu, Laurentiu Zoicas
To this end, we introduce a new curated dataset of cognates in all pairs of those languages.
no code implementations • LREC 2022 • Ioan-Bogdan Iordache, Ana Sabina Uban, Catalin Stoean, Liviu P. Dinu
It is encouraging that all models, be that they are applied to Romanian or English texts, indicate a correlation between the sentiment scores and the increase or decrease of the stock closing prices.
no code implementations • Findings (EMNLP) 2021 • Liviu P. Dinu, Ioan-Bogdan Iordache, Ana Sabina Uban, Marcos Zampieri
In this paper we study pejorative language, an under-explored topic in computational linguistics.
no code implementations • Findings (EMNLP) 2021 • Alina Maria Cristea, Liviu P. Dinu, Simona Georgescu, Mihnea-Lucian Mihai, Ana Sabina Uban
In this paper, we address the problem of automatically discriminating between inherited and borrowed Latin words.
1 code implementation • 13 Jan 2023 • Ana-Maria Bucur, Adrian Cosma, Paolo Rosso, Liviu P. Dinu
In this work, we propose a flexible time-enriched multimodal transformer architecture for detecting depression from social media posts, using pretrained models for extracting image and text embeddings.
no code implementations • 2 Jul 2022 • Ana-Maria Bucur, Adrian Cosma, Liviu P. Dinu, Paolo Rosso
This work proposes a transformer architecture for user-level classification of gambling addiction and depression that is trainable end-to-end.
no code implementations • LREC 2022 • Ana-Maria Bucur, Adrian Cosma, Liviu P. Dinu
In this work, we explore the relationship between depression and manifestations of happiness in social media.
no code implementations • WNUT (ACL) 2021 • Ana-Maria Bucur, Adrian Cosma, Liviu P. Dinu
Our results show that while word-level, intrinsic, performance evaluation is behind other methods, our model improves performance on extrinsic, downstream tasks through normalization compared to models operating on raw, unprocessed, social media text.
no code implementations • RANLP 2021 • Ana-Maria Bucur, Ioana R. Podină, Liviu P. Dinu
In this work, we provide an extensive part-of-speech analysis of the discourse of social media users with depression.
no code implementations • 30 Jun 2021 • Ana-Maria Bucur, Adrian Cosma, Liviu P. Dinu
Early risk detection of mental illnesses has a massive positive impact upon the well-being of people.
no code implementations • Findings (ACL) 2021 • Ana-Maria Bucur, Marcos Zampieri, Liviu P. Dinu
In this paper, we analyze the interplay between the use of offensive language and mental health.
no code implementations • 2 Dec 2020 • Liviu P. Dinu, Ana-Sabina Uban
We also perform an analysis of the variation in topic between the two epochs, to compare with the variation at the style level.
no code implementations • 2 Dec 2020 • Ana-Sabina Uban, Alina-Maria Ciobanu, Liviu P. Dinu
In this paper we investigate semantic divergence across languages by measuring the semantic similarity of cognate sets in multiple languages.
no code implementations • 3 Nov 2020 • Ana-Maria Bucur, Liviu P. Dinu
Computational research on mental health disorders from written texts covers an interdisciplinary area between natural language processing and psychology.
no code implementations • LREC 2020 • Ana Sabina Uban, Liviu P. Dinu
Cognate words, defined as words in different languages which derive from a common etymon, can be useful for language learners, who can leverage the orthographical similarity of cognates to more easily understand a text in a foreign language.
no code implementations • LREC 2020 • Alina Maria Ciobanu, Liviu P. Dinu, Laurentiu Zoicas
Producing related words is a key concern in historical linguistics.
no code implementations • CL 2019 • Alina Maria Ciobanu, Liviu P. Dinu
We apply our method to multiple data sets, showing that our approach improves on previous results, also having the advantage of requiring less input data, which is essential in historical linguistics, where resources are generally scarce.
no code implementations • IJCNLP 2019 • Cornelia Caragea, Ana Uban, Liviu P. Dinu
We study this question on the ACL and EMNLP paper collections and present an analysis on how well deep learning techniques can infer the authors of a paper.
no code implementations • RANLP 2019 • Laura Franzoi, Andrea Sgarro, Anca Dinu, Liviu P. Dinu
In this paper, we present new methods for language classification which put to good use both syntax and fuzzy tools, and are capable of dealing with irrelevant linguistic features (i. e. features which should not contribute to the classification) and even inconsistent features (which do not make sense for specific languages).
no code implementations • RANLP 2019 • Daniela Onita, Liviu P. Dinu, Adriana Birlutiu
In this paper, we investigate an approach for mapping images to text for three types of sentiment categories: positive, neutral and negative.
no code implementations • WS 2019 • Ana Uban, Alina Maria Ciobanu, Liviu P. Dinu
Semantic divergence in related languages is a key concern of historical linguistics.
no code implementations • WS 2018 • Sergiu Nisioi, Anca Bucur, Liviu P. Dinu
In this paper, we provide a lexical comparative analysis of the vocabulary used by customers and agents in an Enterprise Resource Planning (ERP) environment and a potential solution to clean the data and extract relevant content for NLP.
no code implementations • EMNLP 2018 • Cornelia Caragea, Liviu P. Dinu, Bogdan Dumitru
Identifying optimistic and pessimistic viewpoints and users from Twitter is useful for providing better social support to those who need such support, and for minimizing the negative influence among users and maximizing the spread of positive attitudes and ideas.
no code implementations • 14 Aug 2018 • Liviu P. Dinu, Alina Maria Ciobanu, Marcos Zampieri, Shervin Malmasi
In this paper we present ensemble-based systems for dialect and language variety identification using the datasets made available by the organizers of the VarDial Evaluation Campaign 2018.
no code implementations • COLING 2018 • Alina Maria Ciobanu, Liviu P. Dinu
Proto-word reconstruction is central to the study of language evolution.
no code implementations • COLING 2018 • Alina Maria Ciobanu, Liviu P. Dinu
Language change across space and time is one of the main concerns in historical linguistics.
no code implementations • COLING 2018 • Alina Maria Ciobanu, Shervin Malmasi, Liviu P. Dinu
In this paper we present the GDI_classification entry to the second German Dialect Identification (GDI) shared task organized within the scope of the VarDial Evaluation Campaign 2018.
no code implementations • COLING 2018 • Alina Maria Ciobanu, Marcos Zampieri, Shervin Malmasi, Santanu Pal, Liviu P. Dinu
In this paper we present a system based on SVM ensembles trained on characters and words to discriminate between five similar languages of the Indo-Aryan family: Hindi, Braj Bhasha, Awadhi, Bhojpuri, and Magahi.
no code implementations • SEMEVAL 2018 • Bogdan Dumitru, Alina Maria Ciobanu, Liviu P. Dinu
Semantic difference detection attempts to capture whether a word is a discriminative attribute between two other words.
no code implementations • 25 Oct 2017 • Octavia-Maria Sulea, Marcos Zampieri, Shervin Malmasi, Mihaela Vela, Liviu P. Dinu, Josef van Genabith
In this paper, we investigate the application of text classification methods to support law professionals.
no code implementations • RANLP 2017 • Anca Dinu, Liviu P. Dinu, Bogdan Dumitru
In this article we propose a stylistic analysis of Solomon Marcus{'} non-scientific published texts, gathered in six volumes, aiming to uncover some of his quantitative and qualitative fingerprints.
no code implementations • WS 2017 • Liviu P. Dinu, Ana Sabina Uban
We investigate in this paper the problem of classifying the stylome of characters in a literary work.
no code implementations • WS 2017 • Marcos Zampieri, Alina Maria Ciobanu, Liviu P. Dinu
This paper presents an ensemble system combining the output of multiple SVM classifiers to native language identification (NLI).
no code implementations • 3 Jul 2017 • Alina Maria Ciobanu, Marcos Zampieri, Shervin Malmasi, Liviu P. Dinu
This paper presents a computational approach to author profiling taking gender and language variety into account.
1 code implementation • ACL 2017 • Sergiu Nisioi, Sanja {\v{S}}tajner, Simone Paolo Ponzetto, Liviu P. Dinu
Unlike the previously proposed automated TS systems, our neural text simplification (NTS) systems are able to simultaneously perform lexical simplification and content reduction.
Ranked #14 on
Text Simplification
on TurkCorpus
no code implementations • WS 2016 • Sergiu Nisioi, Alina Maria Ciobanu, Liviu P. Dinu
In this paper we describe the submission of the UniBuc-NLP team for the Discriminating between Similar Languages Shared Task, DSL 2016.
no code implementations • LREC 2016 • Alina Maria Ciobanu, Liviu P. Dinu
In this paper we conduct an initial study on the dialects of Romanian.
no code implementations • LREC 2016 • Octavia-Maria {\c{S}}ulea, Sergiu Nisioi, Liviu P. Dinu
In this paper we investigate the usefulness of neural word embeddings in the process of translating Named Entities (NEs) from a resource-rich language to a language low on resources relevant to the task at hand, introducing a novel, yet simple way of obtaining bilingual word vectors.
Chinese Named Entity Recognition
named-entity-recognition
+4
no code implementations • LREC 2016 • Sergiu Nisioi, Ella Rabinovich, Liviu P. Dinu, Shuly Wintner
We describe a monolingual English corpus of original and (human) translated texts, with an accurate annotation of speaker properties, including the original language of the utterances and the speaker{'}s country of origin.
no code implementations • LREC 2012 • Liviu P. Dinu, Vlad Niculae, Octavia-Maria {\c{S}}ulea
A recent analysis of the Romanian gender system described in (Bateman and Polinsky, 2010), based on older observations, argues that there are two lexically unspecified noun classes in the singular and two different ones in the plural and that what is generally called neuter in Romanian shares the class in the singular with masculines, and the class in the plural with feminines based not only on agreement features but also on form.