no code implementations • 27 Nov 2023 • Sotiris Karapiperis, Markos Diomataris, Vassilis Pitsikalis
Visual relations are complex, multimodal concepts that play an important role in the way humans perceive the world.
1 code implementation • 8 Nov 2023 • Zacharias Anastasakis, Dimitrios Mallis, Markos Diomataris, George Alexandridis, Stefanos Kollias, Vassilis Pitsikalis
We present a novel self-supervised approach for representation learning, particularly for the task of Visual Relationship Detection (VRD).
no code implementations • 7 Sep 2023 • Maria Parelli, Dimitrios Mallis, Markos Diomataris, Vassilis Pitsikalis
Transformer-based architectures have recently demonstrated remarkable performance in the Visual Question Answering (VQA) task.
no code implementations • 1 Sep 2022 • Petros Katsileros, Nikiforos Mandilaras, Dimitrios Mallis, Vassilis Pitsikalis, Stavros Theodorakis, Gil Chamiel
In this work we introduce an incremental learning framework for Click-Through-Rate (CTR) prediction and demonstrate its effectiveness for Taboola's massive-scale recommendation service.
1 code implementation • ICCV 2021 • Markos Diomataris, Nikolaos Gkanatsios, Vassilis Pitsikalis, Petros Maragos
Scene Graph Generators (SGGs) are models that, given an image, build a directed graph where each edge represents a predicted subject predicate object triplet.
1 code implementation • 15 Feb 2019 • Nikolaos Gkanatsios, Vassilis Pitsikalis, Petros Koutras, Athanasia Zlatintsi, Petros Maragos
Detecting visual relationships, i. e. <Subject, Predicate, Object> triplets, is a challenging Scene Understanding task approached in the past via linguistic priors or spatial information in a single feature branch.