Search Results for author: Erion Çano

Found 24 papers, 8 papers with code

AlbNews: A Corpus of Headlines for Topic Modeling in Albanian

1 code implementation6 Feb 2024 Erion Çano, Dario Lamaj

The scarcity of available text corpora for low-resource languages like Albanian is a serious hurdle for research in natural language processing tasks.

Ensemble Learning

AlbNER: A Corpus for Named Entity Recognition in Albanian

no code implementations15 Sep 2023 Erion Çano

Scarcity of resources such as annotated text corpora for under-resourced languages like Albanian is a serious impediment in computational linguistics and natural language processing research.

named-entity-recognition Named Entity Recognition +1

CSREU: A Novel Dataset about Corporate Social Responsibility and Performance Indicators

no code implementations16 Jun 2023 Erion Çano, Xhesilda Vogli

Corporate Social Responsibility (CSR) has become an important topic that is gaining academic interest.

AlbMoRe: A Corpus of Movie Reviews for Sentiment Analysis in Albanian

2 code implementations14 Jun 2023 Erion Çano

Lack of available resources such as text corpora for low-resource languages seriously hinders research on natural language processing and computational linguistics.

Sentiment Analysis

MemeGraphs: Linking Memes to Knowledge Graphs

1 code implementation28 May 2023 Vasiliki Kougia, Simon Fetzel, Thomas Kirchmair, Erion Çano, Sina Moayed Baharlou, Sahand Sharifzadeh, Benjamin Roth

In this work, we propose to use scene graphs, that express images in terms of objects and their visual relations, and knowledge graphs as structured representations for meme classification with a Transformer-based architecture.

Entity Linking Knowledge Graphs +1

CSRCZ: A Dataset About Corporate Social Responsibility in Czech Republic

1 code implementation5 Jan 2023 Xhesilda Vogli, Erion Çano

As stakeholders' pressure on corporates for disclosing their corporate social responsibility operations grows, it is crucial to understand how efficient corporate disclosure systems are in bridging the gap between corporate social responsibility reports and their actual practice.

Batch Layer Normalization, A new normalization layer for CNNs and RNN

1 code implementation19 Sep 2022 Amir Ziaee, Erion Çano

As a combined version of batch and layer normalization, BLN adaptively puts appropriate weight on mini-batch and feature normalization based on the inverse size of mini-batches to normalize the input to a layer during the learning process.

Topic Segmentation of Research Article Collections

no code implementations18 May 2022 Erion Çano, Benjamin Roth

In this work, we perform topic segmentation of a paper data collection that we crawled and produce a multitopic dataset of roughly seven million paper data records.

named-entity-recognition Named Entity Recognition +3

Focused Contrastive Training for Test-based Constituency Analysis

no code implementations30 Sep 2021 Benjamin Roth, Erion Çano

We propose a scheme for self-training of grammaticality models for constituency analysis based on linguistic tests.

Language Modelling Position +1

How Many Pages? Paper Length Prediction from the Metadata

1 code implementation29 Oct 2020 Erion Çano, Ondřej Bojar

Being able to predict the length of a scientific paper may be helpful in numerous situations.

BIG-bench Machine Learning regression

A Data-driven Neural Network Architecture for Sentiment Analysis

no code implementations30 Jun 2020 Erion Çano, Maurizio Morisio

Our results indicate that parallel convolutions of filter lengths up to three are usually enough for capturing relevant text features.

Sentiment Analysis

Mood-based On-Car Music Recommendations

no code implementations25 Jun 2020 Erion Çano, Riccardo Coppola, Eleonora Gargiulo, Marco Marengo, Maurizio Morisio

Driving and music listening are two inseparable everyday activities for millions of people today in the world.

Recommendation Systems

Automating Text Naturalness Evaluation of NLG Systems

no code implementations23 Jun 2020 Erion Çano, Ondřej Bojar

Instead of relying on human participants for scoring or labeling the text samples, we propose to automate the process by using a human likeliness metric we define and a discrimination procedure based on large pretrained language models with their probability distributions.

Text Generation

Human or Machine: Automating Human Likeliness Evaluation of NLG Texts

no code implementations5 Jun 2020 Erion Çano, Ondřej Bojar

Automatic evaluation of various text quality criteria produced by data-driven intelligent methods is very common and useful because it is cheap, fast, and usually yields repeatable results.

Text Generation

Quality of Word Embeddings on Sentiment Analysis Tasks

no code implementations6 Mar 2020 Erion Çano, Maurizio Morisio

Quality of word embeddings and performance of their applications depends on several factors like training method, corpus size and relevance etc.

Machine Translation Sentiment Analysis +2

Two Huge Title and Keyword Generation Corpora of Research Articles

no code implementations11 Feb 2020 Erion Çano, Ondřej Bojar

Recent developments in sequence-to-sequence learning with neural networks have considerably improved the quality of automatically generated text summaries and document keywords, stipulating the need for even bigger training corpora.

Text Summarization Vocal Bursts Valence Prediction

Keyphrase Generation: A Multi-Aspect Survey

no code implementations11 Oct 2019 Erion Çano, Ondřej Bojar

In this survey, we examine various aspects of the extractive keyphrase generation methods and focus mostly on the more recent abstractive methods that are based on neural networks.

Keyphrase Generation Text Summarization

Efficiency Metrics for Data-Driven Models: A Text Summarization Case Study

no code implementations14 Sep 2019 Erion Çano, Ondřej Bojar

Using data-driven models for solving text summarization or similar tasks has become very common in the last years.

Text Summarization

Keyphrase Generation: A Text Summarization Struggle

no code implementations29 Mar 2019 Erion Çano, Ondřej Bojar

Most of the proposed supervised and unsupervised methods for keyphrase generation are unable to produce terms that are valuable but do not appear in the text.

Keyphrase Generation Text Summarization

Word Embeddings for Sentiment Analysis: A Comprehensive Empirical Survey

no code implementations2 Feb 2019 Erion Çano, Maurizio Morisio

This work investigates the role of factors like training method, training corpus size and thematic relevance of texts in the performance of word embedding features on sentiment analysis of tweets, song lyrics, movie reviews and item reviews.

Sentiment Analysis Word Embeddings

Hybrid Recommender Systems: A Systematic Literature Review

1 code implementation12 Jan 2019 Erion Çano, Maurizio Morisio

Also cold-start and data sparsity are the two traditional and top problems being addressed in 23 and 22 studies each, while movies and movie datasets are still widely used by most of the authors.

Collaborative Filtering Recommendation Systems

Sentiment Analysis of Czech Texts: An Algorithmic Survey

no code implementations9 Jan 2019 Erion Çano, Ondřej Bojar

In the area of online communication, commerce and transactions, analyzing sentiment polarity of texts written in various natural languages has become crucial.

Sentiment Analysis

Text-based Sentiment Analysis and Music Emotion Recognition

no code implementations6 Oct 2018 Erion Çano

Second, there are various uncertainties regarding the use of word embedding vectors: should they be generated from the same data set that is used to train the model or it is better to source them from big and popular collections?

Emotion Recognition Music Emotion Recognition +3

Cannot find the paper you are looking for? You can Submit a new open access paper.