Search Results for author: Verginica Barbu Mititelu

Found 29 papers, 0 papers with code

It Takes Two to Tango – Towards a Multilingual MWE Resource

no code implementations CLIB 2020 Svetlozara Leseva, Verginica Barbu Mititelu, Ivelina Stoyanova

Mature wordnets offer the opportunity of digging out interesting linguistic information otherwise not explicitly marked in the network.

Vocal Bursts Valence Prediction

A Customizable WordNet Editor

no code implementations CLIB 2020 Andrei-Marius Avram, Verginica Barbu Mititelu

This paper presents an open-source wordnet editor that has been developed to ensure further expansion of the Romanian wordnet.

Aligning the Romanian Reference Treebank and the Valence Lexicon of Romanian Verbs

no code implementations LREC 2022 Ana-Maria Barbu, Verginica Barbu Mititelu, Cătălin Mititelu

We present here the efforts of aligning two language resources for Romanian: the Romanian Reference Treebank and the Valence Lexicon of Romanian Verbs: for each occurrence of those verbs in the treebank that were included as entries in the lexicon, a set of valence frames is automatically assigned, then manually validated by two linguists and, when necessary, corrected.

A Romanian Treebank Annotated with Verbal Multiword Expressions

no code implementations CLIB 2022 Verginica Barbu Mititelu, Mihaela Cristescu, Maria Mitrofan, Bianca-Mădălina Zgreabăn, Elena-Andreea Bărbulescu

In this paper we present a new version of the Romanian journalistic treebank annotated with verbal multiword expressions of four types: idioms, light verb constructions, reflexive verbs and inherently adpositional verbs, the last type being recently added to the corpus.

Challenges in Creating a Representative Corpus of Romanian Micro-Blogging Text

no code implementations CMLC (LREC) 2022 Vasile Pais, Maria Mitrofan, Verginica Barbu Mititelu, Elena Irimia, Roxana Micu, Carol Luca Gasan

Following the successful creation of a national representative corpus of contemporary Romanian language, we turned our attention to the social media text, as present in micro-blogging platforms.

Use Case: Romanian Language Resources in the LOD Paradigm

no code implementations LDL (ACL) 2022 Verginica Barbu Mititelu, Elena Irimia, Vasile Pais, Andrei-Marius Avram, Maria Mitrofan

In this paper, we report on (i) the conversion of Romanian language resources to the Linked Open Data specifications and requirements, on (ii) their publication and (iii) interlinking with other language resources (for Romanian or for other languages).

Word Embeddings

Multilingual Multiword Expression Identification Using Lateral Inhibition and Domain Adaptation

no code implementations17 Jun 2023 Andrei-Marius Avram, Verginica Barbu Mititelu, Vasile Păiş, Dumitru-Clementin Cercel, Ştefan Trăuşan-Matu

Correctly identifying multiword expressions (MWEs) is an important task for most natural language processing systems since their misidentification can result in ambiguity and misunderstanding of the underlying text.

Domain Adaptation

Romanian Multiword Expression Detection Using Multilingual Adversarial Training and Lateral Inhibition

no code implementations22 Apr 2023 Andrei-Marius Avram, Verginica Barbu Mititelu, Dumitru-Clementin Cercel

Multiword expressions are a key ingredient for developing large-scale and linguistically sound natural language processing technology.

An Open-Domain QA System for e-Governance

no code implementations CLIB 2022 Radu Ion, Andrei-Marius Avram, Vasile Păiş, Maria Mitrofan, Verginica Barbu Mititelu, Elena Irimia, Valentin Badea

The paper will present the QA system and its integration with the Romanian language technologies portal RELATE, the COVID-19 data set and different evaluations of the QA performance.

Open-Domain Question Answering

Human-Machine Interaction Speech Corpus from the ROBIN project

no code implementations22 Nov 2021 Vasile Păiş, Radu Ion, Andrei-Marius Avram, Elena Irimia, Verginica Barbu Mititelu, Maria Mitrofan

The paper contains a detailed description of the acquisition process, corpus statistics as well as an evaluation of the corpus influence on a low-latency ASR system as well as a dialogue component.

Hear about Verbal Multiword Expressions in the Bulgarian and the Romanian Wordnets Straight from the Horse's Mouth

no code implementations WS 2019 Verginica Barbu Mititelu, Ivelina Stoyanova, Svetlozara Leseva, Maria Mitrofan, Tsvetana Dimitrova, Maria Todorova

The contribution of this work is in outlining essential features of the description and classification of VMWEs and the cross-language comparison at the lexical level, which is essential for the understanding of the need for uniform annotation guidelines and a viable procedure for validation of the annotation.

Classification General Classification

MoNERo: a Biomedical Gold Standard Corpus for the Romanian Language

no code implementations WS 2019 Maria Mitrofan, Verginica Barbu Mititelu, Grigorina Mitrofan

In an era when large amounts of data are generated daily in various fields, the biomedical field among others, linguistic resources can be exploited for various tasks of Natural Language Processing.

The Romanian Corpus Annotated with Verbal Multiword Expressions

no code implementations WS 2019 Verginica Barbu Mititelu, Mihaela Cristescu, Mihaela Onofrei

This paper reports on the Romanian journalistic corpus annotated with verbal multiword expressions following the PARSEME guidelines.

Sentence

A hybrid pipeline of rules and machine learning to filter web-crawled parallel corpora

no code implementations WS 2018 Eduard Barbu, Verginica Barbu Mititelu

A hybrid pipeline comprising rules and machine learning is used to filter a noisy web English-German parallel corpus for the Parallel Corpus Filtering task.

BIG-bench Machine Learning Machine Translation +3

Adding Morpho-semantic Relations to the Romanian Wordnet

no code implementations LREC 2012 Verginica Barbu Mititelu

Keeping pace with other wordnets development, we present the challenges raised by the Romanian derivational system and our methodology for identifying derived words and their stems in the Romanian Wordnet.

Information Retrieval Question Answering

Cannot find the paper you are looking for? You can Submit a new open access paper.