Search Results for author: Adnen Abdessaied

Found 4 papers, 1 papers with code

Video Language Co-Attention with Multimodal Fast-Learning Feature Fusion for VideoQA

no code implementations • RepL4NLP (ACL) 2022 • Adnen Abdessaied, Ekta Sood, Andreas Bulling

We propose the Video Language Co-Attention Network (VLCN) – a novel memory-enhanced model for Video Question Answering (VideoQA).

Question Answering Video Question Answering

Paper
Add Code

OLViT: Multi-Modal State Tracking via Attention-Based Embeddings for Video-Grounded Dialog

no code implementations • 20 Feb 2024 • Adnen Abdessaied, Manuel von Hochmeister, Andreas Bulling

OLViT addresses these challenges by maintaining a global dialog state based on the output of an Object State Tracker (OST) and a Language State Tracker (LST): while the OST attends to the most important objects within the video, the LST keeps track of the most important linguistic co-references to previous dialog turns.

Object Object Tracking +2

Paper
Add Code

$\mathbb{VD}$-$\mathbb{GR}$: Boosting $\mathbb{V}$isual $\mathbb{D}$ialog with Cascaded Spatial-Temporal Multi-Modal $\mathbb{GR}$aphs

no code implementations • 25 Oct 2023 • Adnen Abdessaied, Lei Shi, Andreas Bulling

We propose $\mathbb{VD}$-$\mathbb{GR}$ - a novel visual dialog model that combines pre-trained language models (LMs) with graph neural networks (GNNs).

Visual Dialog

Paper
Add Code

Neuro-Symbolic Visual Dialog

1 code implementation • COLING 2022 • Adnen Abdessaied, Mihai Bâce, Andreas Bulling

We propose Neuro-Symbolic Visual Dialog (NSVD) -the first method to combine deep learning and symbolic program execution for multi-round visually-grounded reasoning.

Question Answering

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.