Search Results for author: Maria Parelli

Found 4 papers, 2 papers with code

HOLD: Category-agnostic 3D Reconstruction of Interacting Hands and Objects from Video

1 code implementation30 Nov 2023 Zicong Fan, Maria Parelli, Maria Eleni Kadoglou, Muhammed Kocabas, Xu Chen, Michael J. Black, Otmar Hilliges

Since humans interact with diverse objects every day, the holistic 3D capture of these interactions is important to understand and model human behaviour.

3D Reconstruction Object +1

Interpretable Visual Question Answering via Reasoning Supervision

no code implementations7 Sep 2023 Maria Parelli, Dimitrios Mallis, Markos Diomataris, Vassilis Pitsikalis

Transformer-based architectures have recently demonstrated remarkable performance in the Visual Question Answering (VQA) task.

Common Sense Reasoning Question Answering +2

Multi-CLIP: Contrastive Vision-Language Pre-training for Question Answering tasks in 3D Scenes

no code implementations4 Jun 2023 Alexandros Delitzas, Maria Parelli, Nikolas Hars, Georgios Vlassis, Sotirios Anagnostidis, Gregor Bachmann, Thomas Hofmann

Training models to apply common-sense linguistic knowledge and visual concepts from 2D images to 3D scene understanding is a promising direction that researchers have only recently started to explore.

Common Sense Reasoning Question Answering +2

CLIP-Guided Vision-Language Pre-training for Question Answering in 3D Scenes

1 code implementation12 Apr 2023 Maria Parelli, Alexandros Delitzas, Nikolas Hars, Georgios Vlassis, Sotirios Anagnostidis, Gregor Bachmann, Thomas Hofmann

Training models to apply linguistic knowledge and visual concepts from 2D images to 3D world understanding is a promising direction that researchers have only recently started to explore.

Question Answering Visual Question Answering

Cannot find the paper you are looking for? You can Submit a new open access paper.