Search Results for author: Andrei-Marius Avram

Found 25 papers, 8 papers with code

Exploring the Power of Romanian BERT for Dialect Identification

no code implementations VarDial (COLING) 2020 George-Eduard Zaharia, Andrei-Marius Avram, Dumitru-Clementin Cercel, Traian Rebedea

Dialect identification represents a key aspect for improving a series of tasks, for example, opinion mining, considering that the location of the speaker can greatly influence the attitude towards a subject.

Dialect Identification Opinion Mining

A Customizable WordNet Editor

no code implementations CLIB 2020 Andrei-Marius Avram, Verginica Barbu Mititelu

This paper presents an open-source wordnet editor that has been developed to ensure further expansion of the Romanian wordnet.

Use Case: Romanian Language Resources in the LOD Paradigm

no code implementations LDL (ACL) 2022 Verginica Barbu Mititelu, Elena Irimia, Vasile Pais, Andrei-Marius Avram, Maria Mitrofan

In this paper, we report on (i) the conversion of Romanian language resources to the Linked Open Data specifications and requirements, on (ii) their publication and (iii) interlinking with other language resources (for Romanian or for other languages).

Word Embeddings

RACAI@SMM4H’22: Tweets Disease Mention Detection Using a Neural Lateral Inhibitory Mechanism

no code implementations SMM4H (COLING) 2022 Andrei-Marius Avram, Vasile Pais, Maria Mitrofan

This paper presents our system employed for the Social Media Mining for Health (SMM4H) 2022 competition Task 10 - SocialDisNER.

Romanian Language Translation in the RELATE Platform

no code implementations loresmt (COLING) 2022 Vasile Pais, Maria Mitrofan, Andrei-Marius Avram

This paper presents the usage of the RELATE platform for translation tasks involving the Romanian language.

Translation

Approaching SMM4H 2020 with Ensembles of BERT Flavours

no code implementations SMM4H (COLING) 2020 George-Andrei Dima, Andrei-Marius Avram, Dumitru-Clementin Cercel

This paper describes our solutions submitted to the Social Media Mining for Health Applications (#SMM4H) Shared Task 2020.

Task 2

End-to-End Lip Reading in Romanian with Cross-Lingual Domain Adaptation and Lateral Inhibition

no code implementations7 Oct 2023 Emilian-Claudiu Mănescu, Răzvan-Alexandru Smădu, Andrei-Marius Avram, Dumitru-Clementin Cercel, Florin Pop

Lip reading or visual speech recognition has gained significant attention in recent years, particularly because of hardware development and innovations in computer vision.

Domain Adaptation Lip Reading +2

Towards Improving the Performance of Pre-Trained Speech Models for Low-Resource Languages Through Lateral Inhibition

no code implementations30 Jun 2023 Andrei-Marius Avram, Răzvan-Alexandru Smădu, Vasile Păiş, Dumitru-Clementin Cercel, Radu Ion, Dan Tufiş

With the rise of bidirectional encoder representations from Transformer models in natural language processing, the speech community has adopted some of their development methodologies.

Multilingual Multiword Expression Identification Using Lateral Inhibition and Domain Adaptation

no code implementations17 Jun 2023 Andrei-Marius Avram, Verginica Barbu Mititelu, Vasile Păiş, Dumitru-Clementin Cercel, Ştefan Trăuşan-Matu

Correctly identifying multiword expressions (MWEs) is an important task for most natural language processing systems since their misidentification can result in ambiguity and misunderstanding of the underlying text.

Domain Adaptation

Adversarial Capsule Networks for Romanian Satire Detection and Sentiment Analysis

no code implementations13 Jun 2023 Sebastian-Vasile Echim, Răzvan-Alexandru Smădu, Andrei-Marius Avram, Dumitru-Clementin Cercel, Florin Pop

Satire detection and sentiment analysis are intensively explored natural language processing (NLP) tasks that study the identification of the satirical tone from texts and extracting sentiments in relationship with their targets.

Satire Detection Sentiment Analysis

RoBERTweet: A BERT Language Model for Romanian Tweets

no code implementations11 Jun 2023 Iulian-Marius Tăiatu, Andrei-Marius Avram, Dumitru-Clementin Cercel, Florin Pop

Developing natural language processing (NLP) systems for social media analysis remains an important topic in artificial intelligence research.

Language Identification Language Modelling +2

Romanian Multiword Expression Detection Using Multilingual Adversarial Training and Lateral Inhibition

no code implementations22 Apr 2023 Andrei-Marius Avram, Verginica Barbu Mititelu, Dumitru-Clementin Cercel

Multiword expressions are a key ingredient for developing large-scale and linguistically sound natural language processing technology.

TA-DA: Topic-Aware Domain Adaptation for Scientific Keyphrase Identification and Classification (Student Abstract)

no code implementations30 Dec 2022 Răzvan-Alexandru Smădu, George-Eduard Zaharia, Andrei-Marius Avram, Dumitru-Clementin Cercel, Mihai Dascalu, Florin Pop

Keyphrase identification and classification is a Natural Language Processing and Information Retrieval task that involves extracting relevant groups of words from a given text related to the main topic.

Domain Adaptation Information Retrieval +3

An Open-Domain QA System for e-Governance

no code implementations CLIB 2022 Radu Ion, Andrei-Marius Avram, Vasile Păiş, Maria Mitrofan, Verginica Barbu Mititelu, Elena Irimia, Valentin Badea

The paper will present the QA system and its integration with the Romanian language technologies portal RELATE, the COVID-19 data set and different evaluations of the QA performance.

Open-Domain Question Answering

Distilling the Knowledge of Romanian BERTs Using Multiple Teachers

1 code implementation LREC 2022 Andrei-Marius Avram, Darius Catrina, Dumitru-Clementin Cercel, Mihai Dascălu, Traian Rebedea, Vasile Păiş, Dan Tufiş

In this work, we introduce three light and fast versions of distilled BERT models for the Romanian language: Distil-BERT-base-ro, Distil-RoBERT-base, and DistilMulti-BERT-base-ro.

Dialect Identification Knowledge Distillation +9

Romanian Speech Recognition Experiments from the ROBIN Project

1 code implementation23 Nov 2021 Andrei-Marius Avram, Vasile Păiş, Dan Tufiş

One of the fundamental functionalities for accepting a socially assistive robot is its communication capabilities with other agents in the environment.

Language Modelling speech-recognition +1

Human-Machine Interaction Speech Corpus from the ROBIN project

no code implementations22 Nov 2021 Vasile Păiş, Radu Ion, Andrei-Marius Avram, Elena Irimia, Verginica Barbu Mititelu, Maria Mitrofan

The paper contains a detailed description of the acquisition process, corpus statistics as well as an evaluation of the corpus influence on a low-latency ASR system as well as a dialogue component.

PyEuroVoc: A Tool for Multilingual Legal Document Classification with EuroVoc Descriptors

2 code implementations RANLP 2021 Andrei-Marius Avram, Vasile Pais, Dan Tufis

EuroVoc is a multilingual thesaurus that was built for organizing the legislative documentary of the European Union institutions.

Document Classification Specificity

Cannot find the paper you are looking for? You can Submit a new open access paper.