no code implementations • 6 Aug 2024 • Athanasios Efthymiou, Stevan Rudinac, Monika Kackovic, Nachoem Wijnberg, Marcel Worring
To that end, we show that our Set2Seq Transformer can leverage visual set and temporal position-aware representations for modelling visual artists' oeuvres for predicting artistic success.
no code implementations • 23 May 2024 • Yijia Zheng, Marcel Worring
Existing solutions for this task are based on message passing and model within-edge and within-node interactions as multi-input single-output functions.
no code implementations • 22 May 2024 • Shuai Wang, David W. Zhang, Jia-Hong Huang, Stevan Rudinac, Monika Kackovic, Nachoem Wijnberg, Marcel Worring
Hypergraphs serve as an effective model for depicting complex connections in various real-world scenarios, from social to biological networks.
1 code implementation • 9 Apr 2024 • Jiayi Shen, Cheems Wang, Zehao Xiao, Nanne van Noord, Marcel Worring
This paper proposes \textit{GO4Align}, a multi-task optimization approach that tackles task imbalance by explicitly aligning the optimization across tasks.
no code implementations • 20 Nov 2023 • Jia-Hong Huang, Chao-Han Huck Yang, Pin-Yu Chen, Min-Hung Chen, Marcel Worring
The aim of video summarization is to shorten videos automatically while retaining the key information necessary to convey the overall story.
1 code implementation • NeurIPS 2023 • Jiayi Shen, XianTong Zhen, QI, Wang, Marcel Worring
This paper focuses on the data-insufficiency problem in multi-task learning within an episodic training setup.
no code implementations • 30 Sep 2023 • Mohammad Mahdi Derakhshani, Ivona Najdenkoska, Cees G. M. Snoek, Marcel Worring, Yuki M. Asano
We present Self-Context Adaptation (SeCAt), a self-supervised approach that unlocks few-shot abilities for open-ended classification with small visual language models.
no code implementations • 22 Sep 2023 • Shuai Wang, Jiayi Shen, Athanasios Efthymiou, Stevan Rudinac, Monika Kackovic, Nachoem Wijnberg, Marcel Worring
The variety and complexity of relations in multimedia data lead to Heterogeneous Information Networks (HINs).
no code implementations • 20 Sep 2023 • Merel de Leeuw den Bouter, Javier Lloret Pardo, Zeno Geradts, Marcel Worring
This paper proposes a Visual Analytics process model for prototype learning, and, based on this, presents ProtoExplorer, a Visual Analytics system for the exploration and refinement of prototype-based deepfake detection models.
no code implementations • 2 Sep 2023 • Tom van Sonsbeek, XianTong Zhen, Marcel Worring
We show the use of this embedding on two tasks namely disease classification of X-ray reports and image classification.
no code implementations • 5 Jul 2023 • Maarten Sukel, Stevan Rudinac, Marcel Worring
Traditional approaches to demand forecasting rely on historical demand, product categories, and additional contextual information such as seasonality and events.
no code implementations • 4 Jul 2023 • Jia-Hong Huang, Luka Murn, Marta Mrak, Marcel Worring
Existing datasets for manually labelled query-based video summarization are costly and thus small, limiting the performance of supervised deep video summarization models.
no code implementations • 4 Jul 2023 • Jia-Hong Huang, Chao-Han Huck Yang, Pin-Yu Chen, Andrew Brown, Marcel Worring
Multi-modal video summarization has a video input and a text-based query input.
no code implementations • 30 Apr 2023 • Jia-Hong Huang, Chao-Han Huck Yang, Pin-Yu Chen, Min-Hung Chen, Marcel Worring
In this work, a Causal Explainer, dubbed Causalainer, is proposed to address this issue.
no code implementations • 6 Apr 2023 • Jia-Hong Huang, Modar Alfadly, Bernard Ghanem, Marcel Worring
This work proposes a new method that utilizes semantically related questions, referred to as basic questions, acting as noise to evaluate the robustness of VQA models.
1 code implementation • 10 Mar 2023 • Tom van Sonsbeek, Mohammad Mahdi Derakhshani, Ivona Najdenkoska, Cees G. M. Snoek, Marcel Worring
Most existing methods approach it as a multi-class classification problem, which restricts the outcome to a predefined closed-set of curated answers.
Ranked #1 on Medical Visual Question Answering on OVQA
1 code implementation • 28 Feb 2023 • Ivona Najdenkoska, XianTong Zhen, Marcel Worring
Existing methods are trying to communicate visual concepts as prompts to frozen language models, but rely on hand-engineered task induction to reduce the hypothesis space.
no code implementations • 22 Feb 2023 • Tom van Sonsbeek, Marcel Worring
In this paper we mimic this ability by using multi-modal retrieval augmentation and apply it to several tasks in chest X-ray analysis.
no code implementations • 14 Nov 2022 • Nanne van Noord, Melvin Wevers, Tobias Blanke, Julia Noordegraaf, Marcel Worring
We believe that possible implementations of these aspects into AI research leads to AI that better captures the complexities of culture.
no code implementations • 13 Oct 2022 • Tom van Sonsbeek, XianTong Zhen, Dwarikanath Mahapatra, Marcel Worring
This shows how two-stage learning of labels from coarse to fine-grained, in particular with object level annotations, is an effective method for more optimal annotation usage.
1 code implementation • 10 Oct 2022 • Jiayi Shen, Zehao Xiao, XianTong Zhen, Cees G. M. Snoek, Marcel Worring
To generalize to such test data, it is crucial for individual tasks to leverage knowledge from related tasks.
no code implementations • 30 Aug 2022 • Inske Groenen, Stevan Rudinac, Marcel Worring
With the PanorAMS framework we introduce a method to automatically generate bounding box annotations for panoramic images based on urban context information.
1 code implementation • 12 Apr 2022 • Mohammad Mahdi Derakhshani, Ivona Najdenkoska, Tom van Sonsbeek, XianTong Zhen, Dwarikanath Mahapatra, Marcel Worring, Cees G. M. Snoek
Task and class incremental learning of diseases address the issue of classifying new samples without re-training the models from scratch, while cross-domain incremental learning addresses the issue of dealing with datasets originating from different institutions while retaining the previously obtained knowledge.
1 code implementation • 26 Nov 2021 • Sarah Ibrahimi, Nanne van Noord, Tim Alpherts, Marcel Worring
Additionally, we introduce a new training protocol Inside Out Data Augmentation to adapt Visual Place Recognition methods for localizing indoor images, demonstrating the potential of Inside Out Visual Place Recognition.
no code implementations • 10 Nov 2021 • Jiayi Shen, XianTong Zhen, Marcel Worring, Ling Shao
Our multi-task neural processes methodologically expand the scope of vanilla neural processes and provide a new way of exploring task relatedness in function spaces for multi-task learning.
1 code implementation • NeurIPS 2021 • Jiayi Shen, XianTong Zhen, Marcel Worring, Ling Shao
Multi-task learning aims to explore task relatedness to improve individual tasks, which is of particular significance in the challenging scenario that only limited data is available for each task.
2 code implementations • 13 Oct 2021 • Riccardo Di Sipio, Jia-Hong Huang, Samuel Yen-Chi Chen, Stefano Mangini, Marcel Worring
In this paper, we discuss the initial attempts at boosting understanding human language based on deep-learning models with quantum computing.
no code implementations • 22 Sep 2021 • Devanshu Arya, Deepak K. Gupta, Stevan Rudinac, Marcel Worring
Most hypergraph learning approaches convert the hypergraph structure to that of a graph and then deploy existing geometric deep learning methods.
no code implementations • 15 Jul 2021 • Ivona Najdenkoska, XianTong Zhen, Marcel Worring, Ling Shao
The topics are inferred in a conditional variational inference framework, with each topic governing the generation of a sentence in the report.
no code implementations • 30 May 2021 • Jia-Hong Huang, Ting-Wei Wu, Chao-Han Huck Yang, Marcel Worring
Automatically generating medical reports for retinal images is one of the promising ways to help ophthalmologists reduce their workload and improve work efficiency.
no code implementations • 17 May 2021 • Athanasios Efthymiou, Stevan Rudinac, Monika Kackovic, Marcel Worring, Nachoem Wijnberg
We propose ArtSAGENet, a novel multimodal architecture that integrates Graph Neural Networks (GNNs) and Convolutional Neural Networks (CNNs), to jointly learn visual and semantic-based artistic representations.
3 code implementations • 26 Apr 2021 • Jia-Hong Huang, Luka Murn, Marta Mrak, Marcel Worring
Traditional video summarization methods generate fixed video representations regardless of user interest.
no code implementations • 26 Apr 2021 • Jia-Hong Huang, Ting-Wei Wu, Marcel Worring
A traditional medical image captioning model creates a medical description only based on a single medical image input.
1 code implementation • EACL 2021 • Amir Soleimani, Christof Monz, Marcel Worring
We introduce NLQuAD, the first data set with baseline methods for non-factoid long question answering, a task requiring document-level language understanding.
no code implementations • 19 Mar 2021 • Tom van Sonsbeek, XianTong Zhen, Marcel Worring, Ling Shao
It is challenging to incorporate this information into disease classification due to the high reliance on clinician input in EHRs, limiting the possibility for automated diagnosis.
no code implementations • 1 Jan 2021 • Jiayi Shen, XianTong Zhen, Marcel Worring, Ling Shao
Multi-task learning aims to improve the overall performance of a set of tasks by leveraging their relatedness.
1 code implementation • 1 Nov 2020 • Jia-Hong Huang, Chao-Han Huck Yang, Fangyu Liu, Meng Tian, Yi-Chieh Liu, Ting-Wei Wu, I-Hung Lin, Kang Wang, Hiromasa Morikawa, Hernghua Chang, Jesper Tegner, Marcel Worring
To train and validate the effectiveness of our DNN-based module, we propose a large-scale retinal disease image dataset.
no code implementations • 9 Oct 2020 • Devanshu Arya, Deepak K. Gupta, Stevan Rudinac, Marcel Worring
To model such complex relations, hypergraphs have proven to be a natural representation.
no code implementations • 12 May 2020 • Nils Hulzebosch, Sarah Ibrahimi, Marcel Worring
Artificial, CNN-generated images are now of such high quality that humans have trouble distinguishing them from real images.
1 code implementation • 5 May 2020 • Jan Zahálka, Marcel Worring, Jarke J. van Wijk
Analytic categorization, however, is not machine classification (the difference between the two is called the pragmatic gap): a human adds/redefines/deletes categories of relevance on the fly to build insight, whereas the machine classifier is rigid and non-adaptive.
1 code implementation • 7 Apr 2020 • Jia-Hong Huang, Marcel Worring
In this work, we introduce a method which takes a text-based query as input and generates a video summary corresponding to it.
no code implementations • MIDL 2019 • Devanshu Arya, Richard Olij, Deepak K. Gupta, Ahmed El Gazzar, Guido van Wingen, Marcel Worring, Rajat Mani Thomas
We alleviate the use of such non-imaging metadata and propose a fully imaging-based approach where information from structural and functional Magnetic Resonance Imaging (MRI) data are fused to construct the edges and nodes of the graph.
no code implementations • 30 Nov 2019 • Jia-Hong Huang, Modar Alfadly, Bernard Ghanem, Marcel Worring
In this work, we propose a new method that uses semantically related questions, dubbed basic questions, acting as noise to evaluate the robustness of VQA models.
1 code implementation • 22 Oct 2019 • Andrew Brown, Pascal Mettes, Marcel Worring
Interestingly, when incorporating shifts to all point-wise convolutions in residual networks, 4-connected shifts outperform 8-connected shifts.
2 code implementations • 7 Oct 2019 • Amir Soleimani, Christof Monz, Marcel Worring
Motivated by the promising performance of pre-trained language models, we investigate BERT in an evidence retrieval and claim verification pipeline for the FEVER fact extraction and verification challenge.
no code implementations • 19 Sep 2019 • Devanshu Arya, Stevan Rudinac, Marcel Worring
Encoding multimedia items into a continuous low-dimensional semantic space such that both types of relations are captured and preserved is extremely challenging, especially if the goal is a unified end-to-end learning framework.
no code implementations • 7 May 2019 • Iva Gornishka, Stevan Rudinac, Marcel Worring
In this paper we present a novel interactive multimodal learning system, which facilitates search and exploration in large networks of social multimedia users.
no code implementations • 30 Apr 2019 • Maarten Sukel, Stevan Rudinac, Marcel Worring
In this paper we explore several methods of creating such a classifier, including early, late, hybrid fusion and representation learning using multimodal graphs.
no code implementations • 18 Apr 2019 • Björn Þór Jónsson, Omar Shahbaz Khan, Hanna Ragnarsdóttir, Þórhildur Þorleiksdóttir, Jan Zahálka, Stevan Rudinac, Gylfi Þór Guðmundsson, Laurent Amsaleg, Marcel Worring
Increasing scale is a dominant trend in today's multimedia collections, which especially impacts interactive applications.
no code implementations • 5 Apr 2019 • Gjorgji Strezoski, Nanne van Noord, Marcel Worring
When task relations are explicitly defined based on domain knowledge multi-task learning (MTL) offers such concurrent solutions, while exploiting relatedness between multiple tasks performed over the same dataset.
1 code implementation • ICCV 2019 • Gjorgji Strezoski, Nanne van Noord, Marcel Worring
Typical multi-task learning (MTL) methods rely on architectural adjustments and a large trainable parameter set to jointly optimize over several tasks.
no code implementations • 2 Aug 2017 • Gjorgji Strezoski, Marcel Worring
Vast amounts of artistic data is scattered on-line from both museums and art applications.
Ranked #1 on Period Estimation on OmniArt
1 code implementation • 23 Aug 2016 • Christophe Van Gysel, Maarten de Rijke, Marcel Worring
We compare our model to state-of-the-art unsupervised statistical vector space and probabilistic generative approaches.