Search Results for author: Dumitru-Clementin Cercel

Found 33 papers, 5 papers with code

Approaching SMM4H 2020 with Ensembles of BERT Flavours

no code implementations SMM4H (COLING) 2020 George-Andrei Dima, Andrei-Marius Avram, Dumitru-Clementin Cercel

This paper describes our solutions submitted to the Social Media Mining for Health Applications (#SMM4H) Shared Task 2020.

Task 2

Exploring the Power of Romanian BERT for Dialect Identification

no code implementations VarDial (COLING) 2020 George-Eduard Zaharia, Andrei-Marius Avram, Dumitru-Clementin Cercel, Traian Rebedea

Dialect identification represents a key aspect for improving a series of tasks, for example, opinion mining, considering that the location of the speaker can greatly influence the attitude towards a subject.

Dialect Identification Opinion Mining

Evaluating Data Augmentation Techniques for Coffee Leaf Disease Classification

no code implementations11 Jan 2024 Adrian Gheorghiu, Iulian-Marius Tăiatu, Dumitru-Clementin Cercel, Iuliana Marin, Florin Pop

As the RoCoLe dataset is imbalanced and does not have many samples, fine-tuning of pre-trained models and multiple augmentation techniques need to be used.

Classification Data Augmentation +2

Explainability-Driven Leaf Disease Classification Using Adversarial Training and Knowledge Distillation

no code implementations30 Dec 2023 Sebastian-Vasile Echim, Iulian-Marius Tăiatu, Dumitru-Clementin Cercel, Florin Pop

Through our experiments, we determine that on a benchmark dataset, the robustness can be the price of the classification accuracy with performance reductions of 3%-20% for regular tests and gains of 50%-70% for adversarial attack tests.

Adversarial Attack Classification +4

End-to-End Lip Reading in Romanian with Cross-Lingual Domain Adaptation and Lateral Inhibition

no code implementations7 Oct 2023 Emilian-Claudiu Mănescu, Răzvan-Alexandru Smădu, Andrei-Marius Avram, Dumitru-Clementin Cercel, Florin Pop

Lip reading or visual speech recognition has gained significant attention in recent years, particularly because of hardware development and innovations in computer vision.

Domain Adaptation Lip Reading +2

SkinDistilViT: Lightweight Vision Transformer for Skin Lesion Classification

1 code implementation16 Aug 2023 Vlad-Constantin Lungu-Stan, Dumitru-Clementin Cercel, Florin Pop

By adding classification heads at each level of the transformer and employing a cascading distillation process, we improve the balanced multi-class accuracy of the base model by 2. 1%, while creating a range of models of various sizes but comparable performance.

Classification Knowledge Distillation +3

From Fake to Hyperpartisan News Detection Using Domain Adaptation

no code implementations4 Aug 2023 Răzvan-Alexandru Smădu, Sebastian-Vasile Echim, Dumitru-Clementin Cercel, Iuliana Marin, Florin Pop

In the current work, we explore the effects of various unsupervised domain adaptation techniques between two text classification tasks: fake and hyperpartisan news detection.

Clustering Contrastive Learning +5

Towards Improving the Performance of Pre-Trained Speech Models for Low-Resource Languages Through Lateral Inhibition

no code implementations30 Jun 2023 Andrei-Marius Avram, Răzvan-Alexandru Smădu, Vasile Păiş, Dumitru-Clementin Cercel, Radu Ion, Dan Tufiş

With the rise of bidirectional encoder representations from Transformer models in natural language processing, the speech community has adopted some of their development methodologies.

Multilingual Multiword Expression Identification Using Lateral Inhibition and Domain Adaptation

no code implementations17 Jun 2023 Andrei-Marius Avram, Verginica Barbu Mititelu, Vasile Păiş, Dumitru-Clementin Cercel, Ştefan Trăuşan-Matu

Correctly identifying multiword expressions (MWEs) is an important task for most natural language processing systems since their misidentification can result in ambiguity and misunderstanding of the underlying text.

Domain Adaptation

Adversarial Capsule Networks for Romanian Satire Detection and Sentiment Analysis

no code implementations13 Jun 2023 Sebastian-Vasile Echim, Răzvan-Alexandru Smădu, Andrei-Marius Avram, Dumitru-Clementin Cercel, Florin Pop

Satire detection and sentiment analysis are intensively explored natural language processing (NLP) tasks that study the identification of the satirical tone from texts and extracting sentiments in relationship with their targets.

Satire Detection Sentiment Analysis

RoBERTweet: A BERT Language Model for Romanian Tweets

no code implementations11 Jun 2023 Iulian-Marius Tăiatu, Andrei-Marius Avram, Dumitru-Clementin Cercel, Florin Pop

Developing natural language processing (NLP) systems for social media analysis remains an important topic in artificial intelligence research.

Language Identification Language Modelling +2

Romanian Multiword Expression Detection Using Multilingual Adversarial Training and Lateral Inhibition

no code implementations22 Apr 2023 Andrei-Marius Avram, Verginica Barbu Mititelu, Dumitru-Clementin Cercel

Multiword expressions are a key ingredient for developing large-scale and linguistically sound natural language processing technology.

TA-DA: Topic-Aware Domain Adaptation for Scientific Keyphrase Identification and Classification (Student Abstract)

no code implementations30 Dec 2022 Răzvan-Alexandru Smădu, George-Eduard Zaharia, Andrei-Marius Avram, Dumitru-Clementin Cercel, Mihai Dascalu, Florin Pop

Keyphrase identification and classification is a Natural Language Processing and Information Retrieval task that involves extracting relevant groups of words from a given text related to the main topic.

Domain Adaptation Information Retrieval +3

UPB at SemEval-2022 Task 5: Enhancing UNITER with Image Sentiment and Graph Convolutional Networks for Multimedia Automatic Misogyny Identification

1 code implementation SemEval (NAACL) 2022 Andrei Paraschiv, Mihai Dascalu, Dumitru-Clementin Cercel

In recent times, the detection of hate-speech, offensive, or abusive language in online media has become an important topic in NLP research due to the exponential growth of social media and the propagation of such messages, as well as their impact.

Abusive Language Hate Speech Detection

Domain Adaptation in Multilingual and Multi-Domain Monolingual Settings for Complex Word Identification

no code implementations ACL 2022 George-Eduard Zaharia, Răzvan-Alexandru Smădu, Dumitru-Clementin Cercel, Mihai Dascalu

Our model obtains a boost of up to 2. 42% in terms of Pearson Correlation Coefficients in contrast to vanilla training techniques, when considering the CompLex from the Lexical Complexity Prediction 2021 dataset.

Complex Word Identification Domain Adaptation +2

Distilling the Knowledge of Romanian BERTs Using Multiple Teachers

1 code implementation LREC 2022 Andrei-Marius Avram, Darius Catrina, Dumitru-Clementin Cercel, Mihai Dascălu, Traian Rebedea, Vasile Păiş, Dan Tufiş

In this work, we introduce three light and fast versions of distilled BERT models for the Romanian language: Distil-BERT-base-ro, Distil-RoBERT-base, and DistilMulti-BERT-base-ro.

Dialect Identification Knowledge Distillation +9

UPB at SemEval-2021 Task 5: Virtual Adversarial Training for Toxic Spans Detection

no code implementations SEMEVAL 2021 Andrei Paraschiv, Dumitru-Clementin Cercel, Mihai Dascalu

The real-world impact of polarization and toxicity in the online sphere marked the end of 2020 and the beginning of this year in a negative way.

Toxic Spans Detection

UPB at SemEval-2021 Task 1: Combining Deep Learning and Hand-Crafted Features for Lexical Complexity Prediction

no code implementations SEMEVAL 2021 George-Eduard Zaharia, Dumitru-Clementin Cercel, Mihai Dascalu

Our models are applicable on both subtasks and achieve good performance results, with a MAE below 0. 07 and a Person correlation of . 73 for single word identification, as well as a MAE below 0. 08 and a Person correlation of . 79 for multiple word targets.

Lexical Complexity Prediction Word Embeddings

UPB at SemEval-2021 Task 7: Adversarial Multi-Task Learning for Detecting and Rating Humor and Offense

no code implementations SEMEVAL 2021 Răzvan-Alexandru Smădu, Dumitru-Clementin Cercel, Mihai Dascalu

Detecting humor is a challenging task since words might share multiple valences and, depending on the context, the same words can be even used in offensive expressions.

Multi-Task Learning text-classification +1

UPB at SemEval-2020 Task 12: Multilingual Offensive Language Detection on Social Media by Fine-tuning a Variety of BERT-based Models

no code implementations SEMEVAL 2020 Mircea-Adrian Tanase, Dumitru-Clementin Cercel, Costin-Gabriel Chiru

Offensive language detection is one of the most challenging problem in the natural language processing field, being imposed by the rising presence of this phenomenon in online social media.

Cross-Lingual Transfer Learning for Complex Word Identification

no code implementations2 Oct 2020 George-Eduard Zaharia, Dumitru-Clementin Cercel, Mihai Dascalu

Our aim is to provide evidence that the proposed models can learn the characteristics of complex words in a multilingual environment by relying on the CWI shared task 2018 dataset available for four different languages (i. e., English, German, Spanish, and also French).

Complex Word Identification Cross-Lingual Transfer +3

UPB at SemEval-2020 Task 11: Propaganda Detection with Domain-Specific Trained BERT

no code implementations SEMEVAL 2020 Andrei Paraschiv, Dumitru-Clementin Cercel, Mihai Dascalu

Manipulative and misleading news have become a commodity for some online news outlets and these news have gained a significant impact on the global mindset of people.

Propaganda span identification

Cannot find the paper you are looking for? You can Submit a new open access paper.