no code implementations • CVPR 2024 • Markos Diomataris, Nikos Athanasiou, Omid Taheri, Xi Wang, Otmar Hilliges, Michael J. Black
To address this, we introduce WANDR, a data-driven model that takes an avatar's initial pose and a goal's 3D position and generates natural human motions that place the end effector (wrist) on the goal location.
no code implementations • 27 Nov 2023 • Sotiris Karapiperis, Markos Diomataris, Vassilis Pitsikalis
Visual relations are complex, multimodal concepts that play an important role in the way humans perceive the world.
1 code implementation • 8 Nov 2023 • Zacharias Anastasakis, Dimitrios Mallis, Markos Diomataris, George Alexandridis, Stefanos Kollias, Vassilis Pitsikalis
We present a novel self-supervised approach for representation learning, particularly for the task of Visual Relationship Detection (VRD).
no code implementations • 7 Sep 2023 • Maria Parelli, Dimitrios Mallis, Markos Diomataris, Vassilis Pitsikalis
Transformer-based architectures have recently demonstrated remarkable performance in the Visual Question Answering (VQA) task.
1 code implementation • ICCV 2021 • Markos Diomataris, Nikolaos Gkanatsios, Vassilis Pitsikalis, Petros Maragos
Scene Graph Generators (SGGs) are models that, given an image, build a directed graph where each edge represents a predicted subject predicate object triplet.