Search Results for author: Kai Norman Clasen

Found 4 papers, 1 papers with code

Transformer-based Multi-Modal Learning for Multi Label Remote Sensing Image Classification

no code implementations2 Jun 2023 David Hoffmann, Kai Norman Clasen, Begüm Demir

In this paper, we introduce a novel Synchronized Class Token Fusion (SCT Fusion) architecture in the framework of multi-modal multi-label classification (MLC) of remote sensing (RS) images.

Image Classification Multi-Label Classification +1

LiT-4-RSVQA: Lightweight Transformer-based Visual Question Answering in Remote Sensing

no code implementations1 Jun 2023 Leonard Hackel, Kai Norman Clasen, Mahdyar Ravanbakhsh, Begüm Demir

Visual question answering (VQA) methods in remote sensing (RS) aim to answer natural language questions with respect to an RS image.

Question Answering Visual Question Answering

Multi-Modal Fusion Transformer for Visual Question Answering in Remote Sensing

no code implementations10 Oct 2022 Tim Siebert, Kai Norman Clasen, Mahdyar Ravanbakhsh, Begüm Demir

To make the intrinsic information of each RS image easily accessible, visual question answering (VQA) has been introduced in RS.

Question Answering Representation Learning +1

Cannot find the paper you are looking for? You can Submit a new open access paper.