Search Results for author: Liviu P. Dinu

Found 58 papers, 2 papers with code

Paper
Add Code

Pastiche Detection Based on Stopword Rankings. Exposing Impersonators of a Romanian Writer

no code implementations • WS 2012 • Liviu P. Dinu, Vlad Niculae, Maria-Octavia Sulea

Paper
Add Code

The Romanian Neuter Examined Through A Two-Gender N-Gram Classification System

no code implementations • LREC 2012 • Liviu P. Dinu, Vlad Niculae, Octavia-Maria {\c{S}}ulea

A recent analysis of the Romanian gender system described in (Bateman and Polinsky, 2010), based on older observations, argues that there are two lexically unspecified noun classes in the singular and two different ones in the plural and that what is generally called neuter in Romanian shares the class in the singular with masculines, and the class in the plural with feminines based not only on agreement features but also on form.

General Classification

Paper
Add Code

Authorial Studies using Ranked Lexical Features

no code implementations • COLING 2012 • Liviu P. Dinu, Sergiu Nisioi

Paper
Add Code

Dealing with the Grey Sheep of the Romanian Gender System, the Neuter

no code implementations • COLING 2012 • Liviu P. Dinu, Vlad Niculae, Maria Sulea

Paper
Add Code

On the Romanian Rhyme Detection

no code implementations • COLING 2012 • Alina Ciobanu, Liviu P. Dinu

Paper
Add Code

Temporal Text Classification for Romanian Novels set in the Past

no code implementations • RANLP 2013 • Alina Maria Ciobanu, Liviu P. Dinu, Octavia-Maria {\c{S}}ulea, Anca Dinu, Vlad Niculae

General Classification text-classification +1

Paper
Add Code

A clustering approach for translationese identification

no code implementations • RANLP 2013 • Sergiu Nisioi, Liviu P. Dinu

Clustering

Paper
Add Code

Automatic Detection of Cognates Using Orthographic Alignment

no code implementations • ACL 2014 • Alina Maria Ciobanu, Liviu P. Dinu

Information Retrieval Language Acquisition +2

Paper
Add Code

An Etymological Approach to Cross-Language Orthographic Similarity. Application on Romanian

no code implementations • EMNLP 2014 • Alina Maria Ciobanu, Liviu P. Dinu

Language Acquisition Machine Translation

Paper
Add Code

AMBRA: A Ranking Approach to Temporal Text Classification

no code implementations • SEMEVAL 2015 • Marcos Zampieri, Alina Maria Ciobanu, Vlad Niculae, Liviu P. Dinu

General Classification Information Retrieval +2

Paper
Add Code

Automatic Discrimination between Cognates and Borrowings

no code implementations • IJCNLP 2015 • Alina Maria Ciobanu, Liviu P. Dinu

Machine Translation

Paper
Add Code

Cross-lingual Synonymy Overlap

no code implementations • RANLP 2015 • Anca Dinu, Liviu P. Dinu, Ana Sabina Uban

Information Retrieval Machine Translation +4

Paper
Add Code

Readability Assessment of Translated Texts

no code implementations • RANLP 2015 • Alina Maria Ciobanu, Liviu P. Dinu, Flaviu Pepelea

Machine Translation Speech Recognition +1

Paper
Add Code

A Computational Perspective on the Romanian Dialects

no code implementations • LREC 2016 • Alina Maria Ciobanu, Liviu P. Dinu

In this paper we conduct an initial study on the dialects of Romanian.

Dialect Identification General Classification

Paper
Add Code

A Corpus of Native, Non-native and Translated Texts

no code implementations • LREC 2016 • Sergiu Nisioi, Ella Rabinovich, Liviu P. Dinu, Shuly Wintner

We describe a monolingual English corpus of original and (human) translated texts, with an accurate annotation of speaker properties, including the original language of the utterances and the speaker{'}s country of origin.

Paper
Add Code

Using Word Embeddings to Translate Named Entities

no code implementations • LREC 2016 • Octavia-Maria {\c{S}}ulea, Sergiu Nisioi, Liviu P. Dinu

In this paper we investigate the usefulness of neural word embeddings in the process of translating Named Entities (NEs) from a resource-rich language to a language low on resources relevant to the task at hand, introducing a novel, yet simple way of obtaining bilingual word vectors.

Chinese Named Entity Recognition named-entity-recognition +5

Paper
Add Code

Vanilla Classifiers for Distinguishing between Similar Languages

no code implementations • WS 2016 • Sergiu Nisioi, Alina Maria Ciobanu, Liviu P. Dinu

In this paper we describe the submission of the UniBuc-NLP team for the Discriminating between Similar Languages Shared Task, DSL 2016.

Information Retrieval Language Identification +3

Paper
Add Code

Exploring Neural Text Simplification Models

1 code implementation • ACL 2017 • Sergiu Nisioi, Sanja {\v{S}}tajner, Simone Paolo Ponzetto, Liviu P. Dinu

Unlike the previously proposed automated TS systems, our neural text simplification (NTS) systems are able to simultaneously perform lexical simplification and content reduction.

Ranked #14 on Text Simplification on TurkCorpus

Lexical Simplification Machine Translation +2

Paper
Code

Including Dialects and Language Varieties in Author Profiling

no code implementations • 3 Jul 2017 • Alina Maria Ciobanu, Marcos Zampieri, Shervin Malmasi, Liviu P. Dinu

This paper presents a computational approach to author profiling taking gender and language variety into account.

Paper
Add Code

Native Language Identification on Text and Speech

no code implementations • WS 2017 • Marcos Zampieri, Alina Maria Ciobanu, Liviu P. Dinu

This paper presents an ensemble system combining the output of multiple SVM classifiers to native language identification (NLI).

Native Language Identification

Paper
Add Code

Finding a Character's Voice: Stylome Classification on Literary Characters

no code implementations • WS 2017 • Liviu P. Dinu, Ana Sabina Uban

We investigate in this paper the problem of classifying the stylome of characters in a literary work.

Authorship Attribution Classification +2

Paper
Add Code

On the stylistic evolution from communism to democracy: Solomon Marcus study case

no code implementations • RANLP 2017 • Anca Dinu, Liviu P. Dinu, Bogdan Dumitru

In this article we propose a stylistic analysis of Solomon Marcus{'} non-scientific published texts, gathered in six volumes, aiming to uncover some of his quantitative and qualitative fingerprints.

Paper
Add Code

Exploring the Use of Text Classification in the Legal Domain

no code implementations • 25 Oct 2017 • Octavia-Maria Sulea, Marcos Zampieri, Shervin Malmasi, Mihaela Vela, Liviu P. Dinu, Josef van Genabith

In this paper, we investigate the application of text classification methods to support law professionals.

General Classification text-classification +1

Paper
Add Code

ALB at SemEval-2018 Task 10: A System for Capturing Discriminative Attributes

no code implementations • SEMEVAL 2018 • Bogdan Dumitru, Alina Maria Ciobanu, Liviu P. Dinu

Semantic difference detection attempts to capture whether a word is a discriminative attribute between two other words.

Attribute Dependency Parsing +3

Paper
Add Code

Discriminating between Indo-Aryan Languages Using SVM Ensembles

no code implementations • COLING 2018 • Alina Maria Ciobanu, Marcos Zampieri, Shervin Malmasi, Santanu Pal, Liviu P. Dinu

In this paper we present a system based on SVM ensembles trained on characters and words to discriminate between five similar languages of the Indo-Aryan family: Hindi, Braj Bhasha, Awadhi, Bhojpuri, and Magahi.

Language Identification

Paper
Add Code

German Dialect Identification Using Classifier Ensembles

no code implementations • COLING 2018 • Alina Maria Ciobanu, Shervin Malmasi, Liviu P. Dinu

In this paper we present the GDI_classification entry to the second German Dialect Identification (GDI) shared task organized within the scope of the VarDial Evaluation Campaign 2018.

Dialect Identification

Paper
Add Code

Ab Initio: Automatic Latin Proto-word Reconstruction

no code implementations • COLING 2018 • Alina Maria Ciobanu, Liviu P. Dinu

Proto-word reconstruction is central to the study of language evolution.

Paper
Add Code

Simulating Language Evolution: a Tool for Historical Linguistics

no code implementations • COLING 2018 • Alina Maria Ciobanu, Liviu P. Dinu

Language change across space and time is one of the main concerns in historical linguistics.

Machine Translation

Paper
Add Code

Classifier Ensembles for Dialect and Language Variety Identification

no code implementations • 14 Aug 2018 • Liviu P. Dinu, Alina Maria Ciobanu, Marcos Zampieri, Shervin Malmasi

In this paper we present ensemble-based systems for dialect and language variety identification using the datasets made available by the organizers of the VarDial Evaluation Campaign 2018.

Dialect Identification

Paper
Add Code

Exploring Optimism and Pessimism in Twitter Using Deep Learning

no code implementations • EMNLP 2018 • Cornelia Caragea, Liviu P. Dinu, Bogdan Dumitru

Identifying optimistic and pessimistic viewpoints and users from Twitter is useful for providing better social support to those who need such support, and for minimizing the negative influence among users and maximizing the spread of positive attitudes and ideas.

Paper
Add Code

Content Extraction and Lexical Analysis from Customer-Agent Interactions

no code implementations • WS 2018 • Sergiu Nisioi, Anca Bucur, Liviu P. Dinu

In this paper, we provide a lexical comparative analysis of the vocabulary used by customers and agents in an Enterprise Resource Planning (ERP) environment and a potential solution to clean the data and extract relevant content for NLP.

ERP Lexical Analysis

Paper
Add Code

Studying Laws of Semantic Divergence across Languages using Cognate Sets

no code implementations • WS 2019 • Ana Uban, Alina Maria Ciobanu, Liviu P. Dinu

Semantic divergence in related languages is a key concern of historical linguistics.

Paper
Add Code

Linguistic classification: dealing jointly with irrelevance and inconsistency

no code implementations • RANLP 2019 • Laura Franzoi, Andrea Sgarro, Anca Dinu, Liviu P. Dinu

In this paper, we present new methods for language classification which put to good use both syntax and fuzzy tools, and are capable of dealing with irrelevant linguistic features (i. e. features which should not contribute to the classification) and even inconsistent features (which do not make sense for specific languages).

Classification General Classification

Paper
Add Code

From Image to Text in Sentiment Analysis via Regression and Deep Learning

no code implementations • RANLP 2019 • Daniela Onita, Liviu P. Dinu, Adriana Birlutiu

In this paper, we investigate an approach for mapping images to text for three types of sentiment categories: positive, neutral and negative.

regression Sentiment Analysis

Paper
Add Code

The Myth of Double-Blind Review Revisited: ACL vs. EMNLP

no code implementations • IJCNLP 2019 • Cornelia Caragea, Ana Uban, Liviu P. Dinu

We study this question on the ACL and EMNLP paper collections and present an analysis on how well deep learning techniques can infer the authors of a paper.

Paper
Add Code

Automatic Identification and Production of Related Words for Historical Linguistics

no code implementations • CL 2019 • Alina Maria Ciobanu, Liviu P. Dinu

We apply our method to multiple data sets, showing that our approach improves on previous results, also having the advantage of requiring less input data, which is essential in historical linguistics, where resources are generally scarce.

Paper
Add Code

Automatic Reconstruction of Missing Romanian Cognates and Unattested Latin Words

no code implementations • LREC 2020 • Alina Maria Ciobanu, Liviu P. Dinu, Laurentiu Zoicas

Producing related words is a key concern in historical linguistics.

Re-Ranking

Paper
Add Code

Automatically Building a Multilingual Lexicon of False Friends With No Supervision

no code implementations • LREC 2020 • Ana Sabina Uban, Liviu P. Dinu

Cognate words, defined as words in different languages which derive from a common etymon, can be useful for language learners, who can leverage the orthographical similarity of cognates to more easily understand a text in a foreign language.

Cross-Lingual Word Embeddings Language Acquisition +1

Paper
Add Code

Detecting Early Onset of Depression from Social Media Text using Learned Confidence Scores

no code implementations • 3 Nov 2020 • Ana-Maria Bucur, Liviu P. Dinu

Computational research on mental health disorders from written texts covers an interdisciplinary area between natural language processing and psychology.

Depression Detection

Paper
Add Code

Analyzing Stylistic Variation across Different Political Regimes

no code implementations • 2 Dec 2020 • Liviu P. Dinu, Ana-Sabina Uban

We also perform an analysis of the variation in topic between the two epochs, to compare with the variation at the style level.

Authorship Attribution Clustering

Paper
Add Code

A Computational Approach to Measuring the Semantic Divergence of Cognates

no code implementations • 2 Dec 2020 • Ana-Sabina Uban, Alina-Maria Ciobanu, Liviu P. Dinu

In this paper we investigate semantic divergence across languages by measuring the semantic similarity of cognate sets in multiple languages.

Cross-Lingual Word Embeddings Semantic Similarity +3

Paper
Add Code

An Exploratory Analysis of the Relation Between Offensive Language and Mental Health

no code implementations • Findings (ACL) 2021 • Ana-Maria Bucur, Marcos Zampieri, Liviu P. Dinu

In this paper, we analyze the interplay between the use of offensive language and mental health.

Depression Detection Language Identification +1

Paper
Add Code

Early Risk Detection of Pathological Gambling, Self-Harm and Depression Using BERT

no code implementations • 30 Jun 2021 • Ana-Maria Bucur, Adrian Cosma, Liviu P. Dinu

Early risk detection of mental illnesses has a massive positive impact upon the well-being of people.

Paper
Add Code

A Psychologically Informed Part-of-Speech Analysis of Depression in Social Media

no code implementations • RANLP 2021 • Ana-Maria Bucur, Ioana R. Podină, Liviu P. Dinu

In this work, we provide an extensive part-of-speech analysis of the discourse of social media users with depression.

Paper
Add Code

Sequence-to-Sequence Lexical Normalization with Multilingual Transformers

no code implementations • WNUT (ACL) 2021 • Ana-Maria Bucur, Adrian Cosma, Liviu P. Dinu

Our results show that while word-level, intrinsic, performance evaluation is behind other methods, our model improves performance on extrinsic, downstream tasks through normalization compared to models operating on raw, unprocessed, social media text.

Lexical Normalization Machine Translation +2

Paper
Add Code

Life is not Always Depressing: Exploring the Happy Moments of People Diagnosed with Depression

no code implementations • LREC 2022 • Ana-Maria Bucur, Adrian Cosma, Liviu P. Dinu

In this work, we explore the relationship between depression and manifestations of happiness in social media.

Paper
Add Code

An End-to-End Set Transformer for User-Level Classification of Depression and Gambling Disorder

no code implementations • 2 Jul 2022 • Ana-Maria Bucur, Adrian Cosma, Liviu P. Dinu, Paolo Rosso

This work proposes a transformer architecture for user-level classification of gambling addiction and depression that is trainable end-to-end.

Sentence

Paper
Add Code

It's Just a Matter of Time: Detecting Depression with Time-Enriched Multimodal Transformers

1 code implementation • 13 Jan 2023 • Ana-Maria Bucur, Adrian Cosma, Paolo Rosso, Liviu P. Dinu

In this work, we propose a flexible time-enriched multimodal transformer architecture for detecting depression from social media posts, using pretrained models for extracting image and text embeddings.

Depression Detection

Paper
Code

Tracking Semantic Change in Cognate Sets for English and Romance Languages

no code implementations • ACL (LChange) 2021 • Ana Sabina Uban, Alina Maria Cristea, Anca Dinu, Liviu P. Dinu, Simona Georgescu, Laurentiu Zoicas

To this end, we introduce a new curated dataset of cognates in all pairs of those languages.

Word Embeddings

Paper
Add Code

Studying the Evolution of Scientific Topics and their Relationships

no code implementations • Findings (ACL) 2021 • Ana Sabina Uban, Cornelia Caragea, Liviu P. Dinu

Paper
Add Code

Investigating the Relationship Between Romanian Financial News and Closing Prices from the Bucharest Stock Exchange

no code implementations • LREC 2022 • Ioan-Bogdan Iordache, Ana Sabina Uban, Catalin Stoean, Liviu P. Dinu

It is encouraging that all models, be that they are applied to Romanian or English texts, indicate a correlation between the sentiment scores and the increase or decrease of the stock closing prices.

Translation

Paper
Add Code

RED v2: Enhancing RED Dataset for Multi-Label Emotion Detection

no code implementations • LREC 2022 • Alexandra Ciobotaru, Mihai Vlad Constantinescu, Liviu P. Dinu, Stefan Dumitrescu

RED (Romanian Emotion Dataset) is a machine learning-based resource developed for the automatic detection of emotions in Romanian texts, containing single-label annotated tweets with one of the following emotions: joy, fear, sadness, anger and neutral.

Multi-Label Classification regression

Paper
Add Code

Detecting Optimism in Tweets using Knowledge Distillation and Linguistic Analysis of Optimism

no code implementations • LREC 2022 • Ștefan Cobeli, Ioan-Bogdan Iordache, Shweta Yadav, Cornelia Caragea, Liviu P. Dinu, Dragoș Iliescu

Later, we devised a multi-task knowledge distillation framework to simultaneously learn the target task of optimism detection with the help of the auxiliary task of sentiment analysis and hate speech detection.

Hate Speech Detection Knowledge Distillation +1

Paper
Add Code

Towards an Etymological Map of Romanian

no code implementations • RANLP 2021 • Alina Maria Cristea, Anca Dinu, Liviu P. Dinu, Simona Georgescu, Ana Sabina Uban, Laurentiu Zoicas

In this paper we investigate the etymology of Romanian words.

Paper
Add Code

A Computational Exploration of Pejorative Language in Social Media

no code implementations • Findings (EMNLP) 2021 • Liviu P. Dinu, Ioan-Bogdan Iordache, Ana Sabina Uban, Marcos Zampieri

In this paper we study pejorative language, an under-explored topic in computational linguistics.

Word Sense Disambiguation

Paper
Add Code

Automatic Discrimination between Inherited and Borrowed Latin Words in Romance Languages

no code implementations • Findings (EMNLP) 2021 • Alina Maria Cristea, Liviu P. Dinu, Simona Georgescu, Mihnea-Lucian Mihai, Ana Sabina Uban

In this paper, we address the problem of automatically discriminating between inherited and borrowed Latin words.

Paper
Add Code

RED: A Novel Dataset for Romanian Emotion Detection from Tweets

no code implementations • RANLP 2021 • Alexandra Ciobotaru, Liviu P. Dinu

In this article we present some features of our novel dataset, and create a benchmark to achieve the first supervised machine learning model for automatic Emotion Detection in Romanian short texts.

BIG-bench Machine Learning Opinion Mining +3

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.