no code implementations • 12 Mar 2025 • Gorjan Radevski, Teodora Popordanoska, Matthew B. Blaschko, Tinne Tuytelaars
Audio-visual understanding is a rapidly evolving field that seeks to integrate and interpret information from both auditory and visual modalities.
no code implementations • 14 Dec 2023 • Teodora Popordanoska, Gorjan Radevski, Tinne Tuytelaars, Matthew B. Blaschko
In the face of dataset shift, model calibration plays a pivotal role in ensuring the reliability of machine learning systems.
1 code implementation • 23 Oct 2023 • Gorjan Radevski, Kiril Gashteovski, Chia-Chien Hung, Carolin Lawrence, Goran Glavaš
Open Information Extraction (OIE) methods extract facts from natural language text in the form of ("subject"; "relation"; "object") triples.
1 code implementation • ICCV 2023 • Gorjan Radevski, Dusan Grujicic, Marie-Francine Moens, Matthew Blaschko, Tinne Tuytelaars
The goal of this work is to retain the performance of such a multimodal approach, while using only the RGB frames as input at inference time.
no code implementations • 9 Oct 2022 • Gorjan Radevski, Dusan Grujicic, Matthew Blaschko, Marie-Francine Moens, Tinne Tuytelaars
Our approach is based on multimodal knowledge distillation, featuring a multimodal teacher (in the current experiments trained only using object detections, optical flow and RGB frames) and a unimodal student (using only RGB frames as input).
1 code implementation • 2 Nov 2021 • Gorjan Radevski, Marie-Francine Moens, Tinne Tuytelaars
Recognizing human actions is fundamentally a spatio-temporal reasoning problem, and should be, at least to some extent, invariant to the appearance of the human and the objects involved.
Ranked #36 on
Action Classification
on Charades
1 code implementation • Findings of the Association for Computational Linguistics 2020 • Gorjan Radevski, Guillem Collell, Marie-Francine Moens, Tinne Tuytelaars
We address the problem of multimodal spatial understanding by decoding a set of language-expressed spatial relations to a set of 2D spatial arrangements in a multi-object and multi-relationship setting.
1 code implementation • CONLL 2020 • Dusan Grujicic, Gorjan Radevski, Tinne Tuytelaars, Matthew Blaschko
In this paper, we develop a method for grounding medical text into a physically meaningful and interpretable space corresponding to a human atlas.
1 code implementation • ACL 2020 • Dusan Grujicic, Gorjan Radevski, Tinne Tuytelaars, Matthew Blaschko
In this paper, we aim to develop a self-supervised grounding of Covid-related medical text based on the actual spatial relationships between the referred anatomical concepts.