1 code implementation • 25 Sep 2024 • Tommie Kerssies, Daan de Geus, Gijs Dubbelman
In this report, we present our solution for Track 1 of the 2024 BRAVO Challenge, where a model is trained on Cityscapes and its robustness is evaluated on several out-of-distribution datasets.
1 code implementation • 23 Sep 2024 • Tuan-Hung Vu, Eduardo Valle, Andrei Bursuc, Tommie Kerssies, Daan de Geus, Gijs Dubbelman, Long Qian, Bingke Zhu, Yingying Chen, Ming Tang, Jinqiao Wang, Tomáš Vojíř, Jan Šochman, Jiří Matas, Michael Smith, Frank Ferrie, Shamik Basu, Christos Sakaridis, Luc van Gool
We propose the unified BRAVO challenge to benchmark the reliability of semantic segmentation models under realistic perturbations and unknown out-of-distribution (OOD) scenarios.
1 code implementation • CVPR 2024 • Daan de Geus, Gijs Dubbelman
However, their part-level predictions are not linked to individual parent objects.
1 code implementation • 14 Jun 2024 • Brunó B. Englert, Fabrizio J. Piva, Tommie Kerssies, Daan de Geus, Gijs Dubbelman
These results underscore the significant benefits of combining VFMs with UDA, setting new standards and baselines for Unsupervised Domain Adaptation in semantic segmentation.
1 code implementation • CVPR 2024 • Narges Norouzi, Svetlana Orlova, Daan de Geus, Gijs Dubbelman
This work presents Adaptive Local-then-Global Merging (ALGM), a token reduction method for semantic segmentation networks that use plain Vision Transformers.
2 code implementations • 18 Apr 2024 • Tommie Kerssies, Daan de Geus, Gijs Dubbelman
The benchmarking setup recommended in this paper enables a performance analysis of VFMs for semantic segmentation.
1 code implementation • CVPR 2023 • Chenyang Lu, Daan de Geus, Gijs Dubbelman
This paper introduces Content-aware Token Sharing (CTS), a token reduction approach that improves the computational efficiency of semantic segmentation networks that use Vision Transformers (ViTs).
2 code implementations • IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2023 • Daan de Geus, Gijs Dubbelman
Unified panoptic segmentation methods are achieving state-of-the-art results on several datasets.
Ranked #5 on Panoptic Segmentation on Mapillary val
no code implementations • 4 Apr 2023 • Ariyan Bighashdel, Daan de Geus, Pavol Jancura, Gijs Dubbelman
Learning anticipation in Multi-Agent Reinforcement Learning (MARL) is a reasoning paradigm where agents anticipate the learning steps of other agents to improve cooperation among themselves.
no code implementations • 3 Mar 2023 • Kurt Stolle, Gijs Dubbelman
Furthermore, we show that our tracking strategies are effective for long-term object association on KITTI-STEP, achieving an STQ of 59. 1 which exceeded the performance of state-of-the-art methods that employ the same backbone network.
Depth-aware Video Panoptic Segmentation Scene Understanding +1
no code implementations • 18 Jan 2023 • Panagiotis Meletis, Gijs Dubbelman
We explore semantic segmentation beyond the conventional, single-dataset homogeneous training and bring forward the problem of Heterogeneous Training of Semantic Segmentation (HTSS).
1 code implementation • 21 Mar 2022 • Chenyang Lu, Gijs Dubbelman
Aiming for higher-level scene understanding, this work presents a neural network approach that takes a road-layout map in bird's-eye-view as input, and predicts a human-interpretable graph that represents the road's topological layout.
1 code implementation • 14 Jul 2021 • Ariyan Bighashdel, Panagiotis Meletis, Pavol Jancura, Gijs Dubbelman
This paper presents a deep Inverse Reinforcement Learning (IRL) framework that can learn an a priori unknown number of nonlinear reward functions from unlabeled experts' demonstrations.
no code implementations • 13 Jul 2021 • Fabrizio J. Piva, Gijs Dubbelman
To exploit the advantage of using multiple image translations, we propose an ensemble learning approach, where three classifiers calculate their prediction by taking as input features of different image translations, making each classifier learn independently, with the purpose of combining their outputs by sparse Multinomial Logistic Regression.
Ranked #1 on Domain Generalization on WildDash
1 code implementation • CVPR 2021 • Daan de Geus, Panagiotis Meletis, Chenyang Lu, Xiaoxiao Wen, Gijs Dubbelman
In this work, we introduce the new scene understanding task of Part-aware Panoptic Segmentation (PPS), which aims to understand a scene at multiple levels of abstraction, and unifies the tasks of scene parsing and part parsing.
Ranked #2 on Image Segmentation on Pascal Panoptic Parts
no code implementations • 10 Dec 2020 • Chenyang Lu, Gijs Dubbelman
To overcome this, we are the first to present a self-supervised approach based on a fully-differentiable auto-encoder in which the bottleneck encodes the graph's nodes and edges.
4 code implementations • 16 Apr 2020 • Panagiotis Meletis, Xiaoxiao Wen, Chenyang Lu, Daan de Geus, Gijs Dubbelman
In this technical report, we present two novel datasets for image scene understanding.
no code implementations • 9 Oct 2019 • Daan de Geus, Panagiotis Meletis, Gijs Dubbelman
For lower resolutions of the Cityscapes dataset and for the Pascal VOC dataset, FPSNet runs at 22 and 35 frames per second, respectively.
1 code implementation • 10 Sep 2019 • Chenyang Lu, Gijs Dubbelman
Our approach is inherently more efficient than the previous two-stage state-of-the-art method, and outperforms it by a margin of 3% IoU for the inpainted foreground regions on Cityscapes.
no code implementations • 23 Jul 2019 • Chenyang Lu, Gijs Dubbelman
We propose a novel single-step training strategy that allows convolutional encoder-decoder networks that use skip connections, to complete partially observed data by means of hallucination.
no code implementations • 16 Jul 2019 • Panagiotis Meletis, Rob Romijnders, Gijs Dubbelman
Training convolutional networks for semantic segmentation with strong (per-pixel) and weak (per-bounding-box) supervision requires a large amount of weakly labeled data.
1 code implementation • 8 Mar 2019 • Panagiotis Meletis, Gijs Dubbelman
We collect street scene images and weak labels from the immense Open Images dataset to generate the OpenScapes dataset, and we use this novel dataset to increase segmentation performance on two established per-pixel labeled datasets, Cityscapes and Vistas.
1 code implementation • 7 Feb 2019 • Daan de Geus, Panagiotis Meletis, Gijs Dubbelman
Our network is evaluated on two street scene datasets: Cityscapes and Mapillary Vistas.
no code implementations • 14 Sep 2018 • Rob Romijnders, Panagiotis Meletis, Gijs Dubbelman
We show that conventional normalization layers worsen the performance of current Unsupervised Adversarial Domain Adaption (UADA), which is a method to improve network performance on unlabeled datasets and the focus of our research.
no code implementations • CoRR 2019 • Daan de Geus, Panagiotis Meletis, Gijs Dubbelman
For instance segmentation, a Mask R-CNN type of architecture is used, while the semantic segmentation branch is augmented with a Pyramid Pooling Module.
Ranked #11 on Panoptic Segmentation on Mapillary val
no code implementations • 6 Apr 2018 • Chenyang Lu, Marinus Jacobus Gerardus van de Molengraft, Gijs Dubbelman
In this work, we research and evaluate end-to-end learning of monocular semantic-metric occupancy grid mapping from weak binocular ground truth.
Ranked #2 on Bird's-Eye View Semantic Segmentation on nuScenes (IoU veh - 224x480 - No vis filter - 100x50 at 0.25 metric)
2 code implementations • 15 Mar 2018 • Panagiotis Meletis, Gijs Dubbelman
We propose a convolutional network with hierarchical classifiers for per-pixel semantic segmentation, which is able to be trained on multiple, heterogeneous datasets and exploit their semantic hierarchy.
Ranked #5 on Semantic Segmentation on KITTI Semantic Segmentation
no code implementations • 8 Apr 2016 • Willem P. Sanberg, Gijs Dubbelman, Peter H. N. de With
Experiments show that the online training boosts performance with 5% when compared to offline training, both for Fmax and AP.