Search Results for author: Ioannis Kakogeorgiou

Found 9 papers, 7 papers with code

Advancing Semantic Future Prediction through Multimodal Visual Sequence Transformers

1 code implementation14 Jan 2025 Efstathios Karypidis, Ioannis Kakogeorgiou, Spyros Gidaris, Nikos Komodakis

Our approach incorporates a multimodal masked visual modeling objective and a novel masking mechanism designed for multimodal training.

Future prediction Prediction +1

$\texttt{DINO-Foresight}$: Looking into the Future with DINO

1 code implementation16 Dec 2024 Efstathios Karypidis, Ioannis Kakogeorgiou, Spyros Gidaris, Nikos Komodakis

Our approach trains a masked feature transformer in a self-supervised manner to predict the evolution of VFM features over time.

Autonomous Driving Scene Understanding

Keep It SimPool: Who Said Supervised Transformers Suffer from Attention Deficit?

1 code implementation ICCV 2023 Bill Psomas, Ioannis Kakogeorgiou, Konstantinos Karantzalos, Yannis Avrithis

By discussing the properties of each group of methods, we derive SimPool, a simple attention-based pooling mechanism as a replacement of the default one for both convolutional and transformer encoders.

Image Classification Image Retrieval +5

What to Hide from Your Students: Attention-Guided Masked Image Modeling

1 code implementation23 Mar 2022 Ioannis Kakogeorgiou, Spyros Gidaris, Bill Psomas, Yannis Avrithis, Andrei Bursuc, Konstantinos Karantzalos, Nikos Komodakis

In this work, we argue that image token masking differs from token masking in text, due to the amount and correlation of tokens in an image.

Language Modeling Language Modelling +2

MARIDA: A benchmark for Marine Debris detection from Sentinel-2 remote sensing data

1 code implementation Plos one journal 2022 Katerina Kikaki, Ioannis Kakogeorgiou, Paraskevi Mikeli, Dionysios E. Raitsos, Konstantinos Karantzalos

Currently, a significant amount of research is focused on detecting Marine Debris and assessing its spectral behaviour via remote sensing, ultimately aiming at new operational monitoring solutions.

Image Segmentation Multi-Label Classification +4

Evaluating explainable artificial intelligence methods for multi-label deep learning classification tasks in remote sensing

no code implementations3 Apr 2021 Ioannis Kakogeorgiou, Konstantinos Karantzalos

Although deep neural networks hold the state-of-the-art in several remote sensing tasks, their black-box operation hinders the understanding of their decisions, concealing any bias and other shortcomings in datasets and model performance.

Explainable artificial intelligence Explainable Artificial Intelligence (XAI) +2

Cannot find the paper you are looking for? You can Submit a new open access paper.