Search Results for author: Marcel Worring

Found 50 papers, 18 papers with code

GO4Align: Group Optimization for Multi-Task Alignment

1 code implementation • 9 Apr 2024 • Jiayi Shen, Cheems Wang, Zehao Xiao, Nanne van Noord, Marcel Worring

This paper proposes \textit{GO4Align}, a multi-task optimization approach that tackles task imbalance by explicitly aligning the optimization across tasks.

Paper
Code

Conditional Modeling Based Automatic Video Summarization

no code implementations • 20 Nov 2023 • Jia-Hong Huang, Chao-Han Huck Yang, Pin-Yu Chen, Min-Hung Chen, Marcel Worring

The aim of video summarization is to shorten videos automatically while retaining the key information necessary to convey the overall story.

Video Summarization

Paper
Add Code

Episodic Multi-Task Learning with Heterogeneous Neural Processes

1 code implementation • NeurIPS 2023 • Jiayi Shen, XianTong Zhen, QI, Wang, Marcel Worring

This paper focuses on the data-insufficiency problem in multi-task learning within an episodic training setup.

Meta-Learning Multi-Task Learning

Paper
Code

Self-Supervised Open-Ended Classification with Small Visual Language Models

no code implementations • 30 Sep 2023 • Mohammad Mahdi Derakhshani, Ivona Najdenkoska, Cees G. M. Snoek, Marcel Worring, Yuki M. Asano

We present Self-Context Adaptation (SeCAt), a self-supervised approach that unlocks few-shot abilities for open-ended classification with small visual language models.

Few-Shot Learning Image Captioning

Paper
Add Code

Prototype-Enhanced Hypergraph Learning for Heterogeneous Information Networks

no code implementations • 22 Sep 2023 • Shuai Wang, Jiayi Shen, Athanasios Efthymiou, Stevan Rudinac, Monika Kackovic, Nachoem Wijnberg, Marcel Worring

The variety and complexity of relations in multimedia data lead to Heterogeneous Information Networks (HINs).

Node Classification

Paper
Add Code

ProtoExplorer: Interpretable Forensic Analysis of Deepfake Videos using Prototype Exploration and Refinement

no code implementations • 20 Sep 2023 • Merel de Leeuw den Bouter, Javier Lloret Pardo, Zeno Geradts, Marcel Worring

This paper proposes a Visual Analytics process model for prototype learning, and, based on this, presents ProtoExplorer, a Visual Analytics system for the exploration and refinement of prototype-based deepfake detection models.

DeepFake Detection Face Swapping

Paper
Add Code

Knowledge Graph Embeddings for Multi-Lingual Structured Representations of Radiology Reports

no code implementations • 2 Sep 2023 • Tom van Sonsbeek, XianTong Zhen, Marcel Worring

We show the use of this embedding on two tasks namely disease classification of X-ray reports and image classification.

Classification Image Classification +2

Paper
Add Code

Multimodal Temporal Fusion Transformers Are Good Product Demand Forecasters

no code implementations • 5 Jul 2023 • Maarten Sukel, Stevan Rudinac, Marcel Worring

Traditional approaches to demand forecasting rely on historical demand, product categories, and additional contextual information such as seasonality and events.

Paper
Add Code

Query-based Video Summarization with Pseudo Label Supervision

no code implementations • 4 Jul 2023 • Jia-Hong Huang, Luka Murn, Marta Mrak, Marcel Worring

Existing datasets for manually labelled query-based video summarization are costly and thus small, limiting the performance of supervised deep video summarization models.

Pseudo Label Video Summarization

Paper
Add Code

Causal Video Summarizer for Video Exploration

no code implementations • 4 Jul 2023 • Jia-Hong Huang, Chao-Han Huck Yang, Pin-Yu Chen, Andrew Brown, Marcel Worring

Multi-modal video summarization has a video input and a text-based query input.

Video Summarization

Paper
Add Code

Causalainer: Causal Explainer for Automatic Video Summarization

no code implementations • 30 Apr 2023 • Jia-Hong Huang, Chao-Han Huck Yang, Pin-Yu Chen, Min-Hung Chen, Marcel Worring

In this work, a Causal Explainer, dubbed Causalainer, is proposed to address this issue.

Video Summarization

Paper
Add Code

Improving Visual Question Answering Models through Robustness Analysis and In-Context Learning with a Chain of Basic Questions

no code implementations • 6 Apr 2023 • Jia-Hong Huang, Modar Alfadly, Bernard Ghanem, Marcel Worring

This work proposes a new method that utilizes semantically related questions, referred to as basic questions, acting as noise to evaluate the robustness of VQA models.

In-Context Learning Question Answering +1

Paper
Add Code

Open-Ended Medical Visual Question Answering Through Prefix Tuning of Language Models

1 code implementation • 10 Mar 2023 • Tom van Sonsbeek, Mohammad Mahdi Derakhshani, Ivona Najdenkoska, Cees G. M. Snoek, Marcel Worring

Most existing methods approach it as a multi-class classification problem, which restricts the outcome to a predefined closed-set of curated answers.

Ranked #1 on Medical Visual Question Answering on OVQA

Language Modelling Medical Visual Question Answering +3

Paper
Code

Meta Learning to Bridge Vision and Language Models for Multimodal Few-Shot Learning

1 code implementation • 28 Feb 2023 • Ivona Najdenkoska, XianTong Zhen, Marcel Worring

Existing methods are trying to communicate visual concepts as prompts to frozen language models, but rely on hand-engineered task induction to reduce the hypothesis space.

Few-Shot Learning

Paper
Code

X-TRA: Improving Chest X-ray Tasks with Cross-Modal Retrieval Augmentation

no code implementations • 22 Feb 2023 • Tom van Sonsbeek, Marcel Worring

In this paper we mimic this ability by using multi-modal retrieval augmentation and apply it to several tasks in chest X-ray analysis.

Cross-Modal Retrieval Retrieval

Paper
Add Code

An Analytics of Culture: Modeling Subjectivity, Scalability, Contextuality, and Temporality

no code implementations • 14 Nov 2022 • Nanne van Noord, Melvin Wevers, Tobias Blanke, Julia Noordegraaf, Marcel Worring

We believe that possible implementations of these aspects into AI research leads to AI that better captures the complexities of culture.

Cultural Vocal Bursts Intensity Prediction

Paper
Add Code

Probabilistic Integration of Object Level Annotations in Chest X-ray Classification

no code implementations • 13 Oct 2022 • Tom van Sonsbeek, XianTong Zhen, Dwarikanath Mahapatra, Marcel Worring

This shows how two-stage learning of labels from coarse to fine-grained, in particular with object level annotations, is an effective method for more optimal annotation usage.

Knowledge Distillation Variational Inference

Paper
Add Code

Association Graph Learning for Multi-Task Classification with Category Shifts

1 code implementation • 10 Oct 2022 • Jiayi Shen, Zehao Xiao, XianTong Zhen, Cees G. M. Snoek, Marcel Worring

To generalize to such test data, it is crucial for individual tasks to leverage knowledge from related tasks.

General Classification Graph Learning +3

Paper
Code

PanorAMS: Automatic Annotation for Detecting Objects in Urban Context

no code implementations • 30 Aug 2022 • Inske Groenen, Stevan Rudinac, Marcel Worring

With the PanorAMS framework we introduce a method to automatically generate bounding box annotations for panoramic images based on urban context information.

Image Classification Object +2

Paper
Add Code

LifeLonger: A Benchmark for Continual Disease Classification

1 code implementation • 12 Apr 2022 • Mohammad Mahdi Derakhshani, Ivona Najdenkoska, Tom van Sonsbeek, XianTong Zhen, Dwarikanath Mahapatra, Marcel Worring, Cees G. M. Snoek

Task and class incremental learning of diseases address the issue of classifying new samples without re-training the models from scratch, while cross-domain incremental learning addresses the issue of dealing with datasets originating from different institutions while retaining the previously obtained knowledge.

Classification Class Incremental Learning +1

Paper
Code

Inside Out Visual Place Recognition

1 code implementation • 26 Nov 2021 • Sarah Ibrahimi, Nanne van Noord, Tim Alpherts, Marcel Worring

Additionally, we introduce a new training protocol Inside Out Data Augmentation to adapt Visual Place Recognition methods for localizing indoor images, demonstrating the potential of Inside Out Visual Place Recognition.

Data Augmentation Visual Place Recognition

Paper
Code

Multi-Task Neural Processes

no code implementations • 10 Nov 2021 • Jiayi Shen, XianTong Zhen, Marcel Worring, Ling Shao

Our multi-task neural processes methodologically expand the scope of vanilla neural processes and provide a new way of exploring task relatedness in function spaces for multi-task learning.

Bayesian Inference Brain Image Segmentation +4

Paper
Add Code

Variational Multi-Task Learning with Gumbel-Softmax Priors

1 code implementation • NeurIPS 2021 • Jiayi Shen, XianTong Zhen, Marcel Worring, Ling Shao

Multi-task learning aims to explore task relatedness to improve individual tasks, which is of particular significance in the challenging scenario that only limited data is available for each task.

Bayesian Inference Multi-Task Learning

Paper
Code

The Dawn of Quantum Natural Language Processing

2 code implementations • 13 Oct 2021 • Riccardo Di Sipio, Jia-Hong Huang, Samuel Yen-Chi Chen, Stefano Mangini, Marcel Worring

In this paper, we discuss the initial attempts at boosting understanding human language based on deep-learning models with quantum computing.

Sentiment Analysis

Paper
Code

Adaptive Neural Message Passing for Inductive Learning on Hypergraphs

no code implementations • 22 Sep 2021 • Devanshu Arya, Deepak K. Gupta, Stevan Rudinac, Marcel Worring

Most hypergraph learning approaches convert the hypergraph structure to that of a graph and then deploy existing geometric deep learning methods.

Paper
Add Code

Variational Topic Inference for Chest X-Ray Report Generation

no code implementations • 15 Jul 2021 • Ivona Najdenkoska, XianTong Zhen, Marcel Worring, Ling Shao

The topics are inferred in a conditional variational inference framework, with each topic governing the generation of a sentence in the report.

Sentence Text Generation +1

Paper
Add Code

Longer Version for "Deep Context-Encoding Network for Retinal Image Captioning"

no code implementations • 30 May 2021 • Jia-Hong Huang, Ting-Wei Wu, Chao-Han Huck Yang, Marcel Worring

Automatically generating medical reports for retinal images is one of the promising ways to help ophthalmologists reduce their workload and improve work efficiency.

Avg Image Captioning +1

Paper
Add Code

Graph Neural Networks for Knowledge Enhanced Visual Representation of Paintings

no code implementations • 17 May 2021 • Athanasios Efthymiou, Stevan Rudinac, Monika Kackovic, Marcel Worring, Nachoem Wijnberg

We propose ArtSAGENet, a novel multimodal architecture that integrates Graph Neural Networks (GNNs) and Convolutional Neural Networks (CNNs), to jointly learn visual and semantic-based artistic representations.

Art Analysis Multi-Task Learning +1

Paper
Add Code

GPT2MVS: Generative Pre-trained Transformer-2 for Multi-modal Video Summarization

2 code implementations • 26 Apr 2021 • Jia-Hong Huang, Luka Murn, Marta Mrak, Marcel Worring

Traditional video summarization methods generate fixed video representations regardless of user interest.

Video Summarization

Paper
Code

Contextualized Keyword Representations for Multi-modal Retinal Image Captioning

no code implementations • 26 Apr 2021 • Jia-Hong Huang, Ting-Wei Wu, Marcel Worring

A traditional medical image captioning model creates a medical description only based on a single medical image input.

Avg Image Captioning

Paper
Add Code

NLQuAD: A Non-Factoid Long Question Answering Data Set

1 code implementation • EACL 2021 • Amir Soleimani, Christof Monz, Marcel Worring

We introduce NLQuAD, the first data set with baseline methods for non-factoid long question answering, a task requiring document-level language understanding.

Descriptive Position +2

Paper
Code

Variational Knowledge Distillation for Disease Classification in Chest X-Rays

no code implementations • 19 Mar 2021 • Tom van Sonsbeek, XianTong Zhen, Marcel Worring, Ling Shao

It is challenging to incorporate this information into disease classification due to the high reliance on clinician input in EHRs, limiting the possibility for automated diagnosis.

General Classification Image Classification +2

Paper
Add Code

Variational Multi-Task Learning

no code implementations • 1 Jan 2021 • Jiayi Shen, XianTong Zhen, Marcel Worring, Ling Shao

Multi-task learning aims to improve the overall performance of a set of tasks by leveraging their relatedness.

Bayesian Inference Inductive Bias +1

Paper
Add Code

DeepOpht: Medical Report Generation for Retinal Images via Deep Models and Visual Explanation

1 code implementation • 1 Nov 2020 • Jia-Hong Huang, Chao-Han Huck Yang, Fangyu Liu, Meng Tian, Yi-Chieh Liu, Ting-Wei Wu, I-Hung Lin, Kang Wang, Hiromasa Morikawa, Hernghua Chang, Jesper Tegner, Marcel Worring

To train and validate the effectiveness of our DNN-based module, we propose a large-scale retinal disease image dataset.

Medical Report Generation

Paper
Code

HyperSAGE: Generalizing Inductive Representation Learning on Hypergraphs

no code implementations • 9 Oct 2020 • Devanshu Arya, Deepak K. Gupta, Stevan Rudinac, Marcel Worring

To model such complex relations, hypergraphs have proven to be a natural representation.

Representation Learning

Paper
Add Code

Detecting CNN-Generated Facial Images in Real-World Scenarios

no code implementations • 12 May 2020 • Nils Hulzebosch, Sarah Ibrahimi, Marcel Worring

Artificial, CNN-generated images are now of such high quality that humans have trouble distinguishing them from real images.

Paper
Add Code

II-20: Intelligent and pragmatic analytic categorization of image collections

1 code implementation • 5 May 2020 • Jan Zahálka, Marcel Worring, Jarke J. van Wijk

Analytic categorization, however, is not machine classification (the difference between the two is called the pragmatic gap): a human adds/redefines/deletes categories of relevance on the fly to build insight, whereas the machine classifier is rigid and non-adaptive.

Paper
Code

Query-controllable Video Summarization

1 code implementation • 7 Apr 2020 • Jia-Hong Huang, Marcel Worring

In this work, we introduce a method which takes a text-based query as input and generates a video summary corresponding to it.

Video Summarization

Paper
Code

Fusing Structural and Functional MRIs using Graph Convolutional Networks for Autism Classification

no code implementations • MIDL 2019 • Devanshu Arya, Richard Olij, Deepak K. Gupta, Ahmed El Gazzar, Guido van Wingen, Marcel Worring, Rajat Mani Thomas

We alleviate the use of such non-imaging metadata and propose a fully imaging-based approach where information from structural and functional Magnetic Resonance Imaging (MRI) data are fused to construct the edges and nodes of the graph.

Disease Prediction

Paper
Add Code

Assessing the Robustness of Visual Question Answering Models

no code implementations • 30 Nov 2019 • Jia-Hong Huang, Modar Alfadly, Bernard Ghanem, Marcel Worring

In this work, we propose a new method that uses semantically related questions, dubbed basic questions, acting as noise to evaluate the robustness of VQA models.

Question Answering Visual Question Answering

Paper
Add Code

4-Connected Shift Residual Networks

1 code implementation • 22 Oct 2019 • Andrew Brown, Pascal Mettes, Marcel Worring

Interestingly, when incorporating shifts to all point-wise convolutions in residual networks, 4-connected shifts outperform 8-connected shifts.

Paper
Code

BERT for Evidence Retrieval and Claim Verification

2 code implementations • 7 Oct 2019 • Amir Soleimani, Christof Monz, Marcel Worring

Motivated by the promising performance of pre-trained language models, we investigate BERT in an evidence retrieval and claim verification pipeline for the FEVER fact extraction and verification challenge.

Claim Verification Retrieval

170

Paper
Code

HyperLearn: A Distributed Approach for Representation Learning in Datasets With Many Modalities

no code implementations • 19 Sep 2019 • Devanshu Arya, Stevan Rudinac, Marcel Worring

Encoding multimedia items into a continuous low-dimensional semantic space such that both types of relations are captured and preserved is extremely challenging, especially if the goal is a unified end-to-end learning framework.

Representation Learning

Paper
Add Code

Interactive Search and Exploration in Online Discussion Forums Using Multimodal Embeddings

no code implementations • 7 May 2019 • Iva Gornishka, Stevan Rudinac, Marcel Worring

In this paper we present a novel interactive multimodal learning system, which facilitates search and exploration in large networks of social multimedia users.

Paper
Add Code

Multimodal Classification of Urban Micro-Events

no code implementations • 30 Apr 2019 • Maarten Sukel, Stevan Rudinac, Marcel Worring

In this paper we explore several methods of creating such a classifier, including early, late, hybrid fusion and representation learning using multimodal graphs.

Classification General Classification +1

Paper
Add Code

Exquisitor: Interactive Learning at Large

no code implementations • 18 Apr 2019 • Björn Þór Jónsson, Omar Shahbaz Khan, Hanna Ragnarsdóttir, Þórhildur Þorleiksdóttir, Jan Zahálka, Stevan Rudinac, Gylfi Þór Guðmundsson, Laurent Amsaleg, Marcel Worring

Increasing scale is a dominant trend in today's multimedia collections, which especially impacts interactive applications.

Retrieval

Paper
Add Code

Learning Task Relatedness in Multi-Task Learning for Images in Context

no code implementations • 5 Apr 2019 • Gjorgji Strezoski, Nanne van Noord, Marcel Worring

When task relations are explicitly defined based on domain knowledge multi-task learning (MTL) offers such concurrent solutions, while exploiting relatedness between multiple tasks performed over the same dataset.

Multi-Task Learning

Paper
Add Code

Many Task Learning with Task Routing

1 code implementation • ICCV 2019 • Gjorgji Strezoski, Nanne van Noord, Marcel Worring

Typical multi-task learning (MTL) methods rely on architectural adjustments and a large trainable parameter set to jointly optimize over several tasks.

Multi-Task Learning

Paper
Code

OmniArt: Multi-task Deep Learning for Artistic Data Analysis

no code implementations • 2 Aug 2017 • Gjorgji Strezoski, Marcel Worring

Vast amounts of artistic data is scattered on-line from both museums and art applications.

Multi-Task Learning

Paper
Add Code

Unsupervised, Efficient and Semantic Expertise Retrieval

1 code implementation • 23 Aug 2016 • Christophe Van Gysel, Maarten de Rijke, Marcel Worring

We compare our model to state-of-the-art unsupervised statistical vector space and probabilistic generative approaches.

Feature Engineering Retrieval

109

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.