Search Results for author: Boshko Koloski

Found 19 papers, 5 papers with code

E8-IJS@LT-EDI-ACL2022 - BERT, AutoML and Knowledge-graph backed Detection of Depression

no code implementations LTEDI (ACL) 2022 Ilija Tavchioski, Boshko Koloski, Blaž Škrlj, Senja Pollak

Depression is a mental illness that negatively affects a person’s well-being and can, if left untreated, lead to serious consequences such as suicide.

AutoML

Interesting cross-border news discovery using cross-lingual article linking and document similarity

no code implementations EACL (Hackashop) 2021 Boshko Koloski, Elaine Zosa, Timen Stepišnik-Perdih, Blaž Škrlj, Tarmo Paju, Senja Pollak

Team Name: team-8 Embeddia Tool: Cross-Lingual Document Retrieval Zosa et al. Dataset: Estonian and Latvian news datasets abstract: Contemporary news media face increasing amounts of available data that can be of use when prioritizing, selecting and discovering new news.

Retrieval

SEKE: Specialised Experts for Keyword Extraction

1 code implementation18 Dec 2024 Matej Martinc, Hanh Thi Hong Tran, Senja Pollak, Boshko Koloski

Relying on the insight that real-world keyword detection often requires handling of diverse content, we propose a novel supervised keyword extraction approach based on the mixture of experts (MoE) technique.

Descriptive Keyword Extraction

Evaluating and explaining training strategies for zero-shot cross-lingual news sentiment analysis

no code implementations30 Sep 2024 Luka Andrenšek, Boshko Koloski, Andraž Pelicon, Nada Lavrač, Senja Pollak, Matthew Purver

We investigate zero-shot cross-lingual news sentiment detection, aiming to develop robust sentiment classifiers that can be deployed across multiple languages without target-language training data.

Cross-Lingual Transfer In-Context Learning +2

ICML Topological Deep Learning Challenge 2024: Beyond the Graph Domain

no code implementations8 Sep 2024 Guillermo Bernárdez, Lev Telyatnikov, Marco Montagna, Federica Baccini, Mathilde Papillon, Miquel Ferriol-Galmés, Mustafa Hajij, Theodore Papamarkou, Maria Sofia Bucarelli, Olga Zaghen, Johan Mathe, Audun Myers, Scott Mahan, Hansen Lillemark, Sharvaree Vadgama, Erik Bekkers, Tim Doster, Tegan Emerson, Henry Kvinge, Katrina Agate, Nesreen K Ahmed, Pengfei Bai, Michael Banf, Claudio Battiloro, Maxim Beketov, Paul Bogdan, Martin Carrasco, Andrea Cavallo, Yun Young Choi, George Dasoulas, Matouš Elphick, Giordan Escalona, Dominik Filipiak, Halley Fritze, Thomas Gebhart, Manel Gil-Sorribes, Salvish Goomanee, Victor Guallar, Liliya Imasheva, Andrei Irimia, Hongwei Jin, Graham Johnson, Nikos Kanakaris, Boshko Koloski, Veljko Kovač, Manuel Lecha, Minho Lee, Pierrick Leroy, Theodore Long, German Magai, Alvaro Martinez, Marissa Masden, Sebastian Mežnar, Bertran Miquel-Oliver, Alexis Molina, Alexander Nikitin, Marco Nurisso, Matt Piekenbrock, Yu Qin, Patryk Rygiel, Alessandro Salatiello, Max Schattauer, Pavel Snopov, Julian Suk, Valentina Sánchez, Mauricio Tec, Francesco Vaccarino, Jonas Verhellen, Frederic Wantiez, Alexander Weers, Patrik Zajec, Blaž Škrlj, Nina Miolane

This paper describes the 2nd edition of the ICML Topological Deep Learning Challenge that was hosted within the ICML 2024 ELLIS Workshop on Geometry-grounded Representation Learning and Generative Modeling (GRaM).

Deep Learning Representation Learning

AutoML-guided Fusion of Entity and LLM-based Representations for Document Classification

no code implementations19 Aug 2024 Boshko Koloski, Senja Pollak, Roberto Navigli, Blaž Škrlj

This work demonstrates that injecting embedded information from knowledge bases can augment the performance of contemporary Large Language Model (LLM)-based representations for the task of text classification.

AutoML Document Classification +5

Multi-Task Learning for Features Extraction in Financial Annual Reports

1 code implementation8 Apr 2024 Syrielle Montariol, Matej Martinc, Andraž Pelicon, Senja Pollak, Boshko Koloski, Igor Lončarski, Aljoša Valentinčič

For assessing various performance indicators of companies, the focus is shifting from strictly financial (quantitative) publicly disclosed information to qualitative (textual) information.

Multi-Task Learning Sentence +2

AHAM: Adapt, Help, Ask, Model -- Harvesting LLMs for literature mining

no code implementations25 Dec 2023 Boshko Koloski, Nada Lavrač, Bojan Cestnik, Senja Pollak, Blaž Škrlj, Andrej Kastrin

Our system aims to reduce both the ratio of outlier topics to the total number of topics and the similarity between topic definitions.

Domain Adaptation Language Modelling +6

Latent Graphs for Semi-Supervised Learning on Biomedical Tabular Data

no code implementations27 Sep 2023 Boshko Koloski, Nada Lavrač, Senja Pollak, Blaž Škrlj

In the domain of semi-supervised learning, the current approaches insufficiently exploit the potential of considering inter-instance relationships among (un)labeled data.

Measuring Catastrophic Forgetting in Cross-Lingual Transfer Paradigms: Exploring Tuning Strategies

no code implementations12 Sep 2023 Boshko Koloski, Blaž Škrlj, Marko Robnik-Šikonja, Senja Pollak

As cross-lingual transfer strategies, we compare the intermediate-training (\textit{IT}) that uses each language sequentially and cross-lingual validation (\textit{CLV}) that uses a target language already in the validation phase of fine-tuning.

Cross-Lingual Transfer Hate Speech Detection

Out of Thin Air: Is Zero-Shot Cross-Lingual Keyword Detection Better Than Unsupervised?

no code implementations LREC 2022 Boshko Koloski, Senja Pollak, Blaž Škrlj, Matej Martinc

We find that the pretrained models fine-tuned on a multilingual corpus covering languages that do not appear in the test set (i. e. in a zero-shot setting), consistently outscore unsupervised models in all six languages.

Keyword Extraction Pretrained Multilingual Language Models

Knowledge Graph informed Fake News Classification via Heterogeneous Representation Ensembles

2 code implementations20 Oct 2021 Boshko Koloski, Timen Stepišnik-Perdih, Marko Robnik-Šikonja, Senja Pollak, Blaž Škrlj

Increasing amounts of freely available data both in textual and relational form offers exploration of richer document representations, potentially improving the model performance and robustness.

Classification Fake News Detection +5

Extending Neural Keyword Extraction with TF-IDF tagset matching

1 code implementation EACL (Hackashop) 2021 Boshko Koloski, Senja Pollak, Blaž Škrlj, Matej Martinc

Keyword extraction is the task of identifying words (or multi-word expressions) that best describe a given document and serve in news portals to link articles of similar topics.

Keyword Extraction

Identification of COVID-19 related Fake News via Neural Stacking

no code implementations11 Jan 2021 Boshko Koloski, Timen Stepišnik Perdih, Senja Pollak, Blaž Škrlj

Identification of Fake News plays a prominent role in the ongoing pandemic, impacting multiple aspects of day-to-day life.

Fake News Detection General Classification

Cannot find the paper you are looking for? You can Submit a new open access paper.