no code implementations • 1 Jun 2023 • Yuanwen Yue, Sabarinath Mahadevan, Jonas Schult, Francis Engelmann, Bastian Leibe, Konrad Schindler, Theodora Kontogianni
We introduce AGILE3D, an efficient, attention-based model that (1) supports simultaneous segmentation of multiple 3D objects, (2) yields more accurate segmentation masks with fewer user clicks, and (3) offers faster inference.
no code implementations • 13 Apr 2023 • Amit Kumar Rana, Sabarinath Mahadevan, Alexander Hermans, Bastian Leibe
We introduce a more efficient approach, called DynaMITe, in which we represent user interactions as spatio-temporal queries to a Transformer decoder with a potential to segment multiple object instances in a single iteration.
1 code implementation • 29 Mar 2023 • Karim Abou Zeid, Jonas Schult, Alexander Hermans, Bastian Leibe
Recently, the self-supervised learning framework data2vec has shown inspiring performance for various modalities using a masked student-teacher approach.
Ranked #2 on
Few-Shot 3D Point Cloud Classification
on ModelNet40 10-way (20-shot)
(using extra training data)
3D Part Segmentation
Few-Shot 3D Point Cloud Classification
+3
1 code implementation • CVPR 2023 • Ali Athar, Alexander Hermans, Jonathon Luiten, Deva Ramanan, Bastian Leibe
A single TarViS model can be trained jointly on a collection of datasets spanning different tasks, and can hot-swap between tasks during inference without any task-specific retraining.
Ranked #1 on
Video Panoptic Segmentation
on KITTI-STEP
(using extra training data)
no code implementations • 29 Dec 2022 • István Sárándi, Alexander Hermans, Bastian Leibe
Our approach scales to an extreme multi-dataset regime, where we use 28 3D human pose datasets to supervise one model, which outperforms prior work on a range of benchmarks, including the challenging 3D Poses in the Wild (3DPW) dataset.
no code implementations • 1 Dec 2022 • Ayça Takmaz, Jonas Schult, Irem Kaftan, Mertcan Akçay, Bastian Leibe, Robert Sumner, Francis Engelmann, Siyu Tang
Our analysis of different training schemes using a combination of synthetic and realistic data shows that synthetic data for pre-training improves performance in a wide variety of segmentation tasks and models.
1 code implementation • 6 Oct 2022 • Jonas Schult, Francis Engelmann, Alexander Hermans, Or Litany, Siyu Tang, Bastian Leibe
Modern 3D semantic instance segmentation approaches predominantly rely on specialized voting mechanisms followed by carefully designed geometric clustering techniques.
Ranked #1 on
3D Instance Segmentation
on ScanNet200
1 code implementation • 29 Sep 2022 • Lars Kreuzberg, Idil Esen Zulfikar, Sabarinath Mahadevan, Francis Engelmann, Bastian Leibe
Our voting-based tracklet generation method followed by geometric feature-based aggregation generates significantly improved panoptic LiDAR segmentation quality when compared to modeling the entire 4D volume using Gaussian probability distributions.
1 code implementation • 25 Sep 2022 • Ali Athar, Jonathon Luiten, Paul Voigtlaender, Tarasha Khurana, Achal Dave, Bastian Leibe, Deva Ramanan
Multiple existing benchmarks involve tracking and segmenting objects in video e. g., Video Object Segmentation (VOS) and Multi-Object Tracking and Segmentation (MOTS), but there is little interaction between them due to the use of disparate benchmark datasets and metrics (e. g. J&F, mAP, sMOTSA).
Multi-Object Tracking
Multi-Object Tracking and Segmentation
+4
no code implementations • 7 Aug 2022 • Dan Jia, Alexander Hermans, Bastian Leibe
For the 3D object detection task, GHA improves the CenterPoint baseline by +0. 5% mAP on the nuScenes dataset, and the 3DETR baseline by +2. 1% mAP25 and +1. 5% mAP50 on ScanNet.
1 code implementation • 3 Aug 2022 • Diego Paez-Granados, Yujie He, David Gonon, Dan Jia, Bastian Leibe, Kenji Suzuki, Aude Billard
Autonomous navigation in highly populated areas remains a challenging task for robots because of the difficulty in guaranteeing safe interactions with pedestrians in unstructured situations.
1 code implementation • 1 Jun 2022 • Ali Athar, Jonathon Luiten, Alexander Hermans, Deva Ramanan, Bastian Leibe
Recently, "Masked Attention" was proposed in which a given object representation only attends to those image pixel features for which the segmentation mask of that object is active.
no code implementations • CVPR 2022 • Yang Liu, Idil Esen Zulfikar, Jonathon Luiten, Achal Dave, Deva Ramanan, Bastian Leibe, Aljoša Ošep, Laura Leal-Taixé
A benchmark that would allow us to perform an apple-to-apple comparison of existing efforts is a crucial first step towards advancing this important research field.
1 code implementation • CVPR 2022 • Ali Athar, Jonathon Luiten, Alexander Hermans, Deva Ramanan, Bastian Leibe
Existing state-of-the-art methods for Video Object Segmentation (VOS) learn low-level pixel-to-pixel correspondences between frames to propagate object masks across video.
1 code implementation • WACV 2021 • Christian Schmidt, Ali Athar, Sabarinath Mahadevan, Bastian Leibe
We further show that D2Conv3D out-performs trivial extensions of existing dilated and deformable convolutions to 3D.
Multi-Object Tracking and Segmentation
Semantic Segmentation
+4
1 code implementation • 15 Nov 2021 • Christian Schmidt, Ali Athar, Sabarinath Mahadevan, Bastian Leibe
We further show that D^2Conv3D out-performs trivial extensions of existing dilated and deformable convolutions to 3D.
Semantic Segmentation
Unsupervised Video Object Segmentation
+2
3 code implementations • 5 Oct 2021 • Alexey Nekrasov, Jonas Schult, Or Litany, Bastian Leibe, Francis Engelmann
Since scene context helps reasoning about object semantics, current works focus on models with large capacity and receptive fields that can fully capture the global context of an input 3D scene.
Ranked #8 on
Semantic Segmentation
on ScanNet
1 code implementation • 3 Jul 2021 • Dan Jia, Bastian Leibe
In this preliminary work we attempt to apply submanifold sparse convolution to the task of 3D person detection.
no code implementations • 21 Jun 2021 • Dan Jia, Alexander Hermans, Bastian Leibe
Person detection is a crucial task for mobile robots navigating in human-populated environments.
no code implementations • 22 Apr 2021 • Yang Liu, Idil Esen Zulfikar, Jonathon Luiten, Achal Dave, Deva Ramanan, Bastian Leibe, Aljoša Ošep, Laura Leal-Taixé
We hope to open a new front in multi-object tracking research that will hopefully bring us a step closer to intelligent systems that can operate safely in the real world.
1 code implementation • 23 Feb 2021 • Mark Weber, Jun Xie, Maxwell Collins, Yukun Zhu, Paul Voigtlaender, Hartwig Adam, Bradley Green, Andreas Geiger, Bastian Leibe, Daniel Cremers, Aljoša Ošep, Laura Leal-Taixé, Liang-Chieh Chen
The task of assigning semantic classes and track identities to every pixel in a video is called video panoptic segmentation.
no code implementations • CVPR 2021 • Francis Engelmann, Konstantinos Rematas, Bastian Leibe, Vittorio Ferrari
We propose a method to detect and reconstruct multiple 3D objects from a single RGB image.
1 code implementation • 16 Dec 2020 • Dan Jia, Mats Steinweg, Alexander Hermans, Bastian Leibe
Through experiments on the JackRabbot dataset with two detector models, DROW3 and DR-SPAAM, we show that self-supervised detectors, trained or fine-tuned with pseudo-labels, outperform detectors trained only on a different dataset.
no code implementations • 2 Nov 2020 • Paul Voigtlaender, Lishu Luo, Chun Yuan, Yong Jiang, Bastian Leibe
We use a deep convolutional network to automatically create pseudo-labels on a pixel level from much cheaper bounding box annotations and investigate how far such pseudo-labels can carry us for training state-of-the-art VOS approaches.
3 code implementations • 16 Sep 2020 • Jonathon Luiten, Aljosa Osep, Patrick Dendorfer, Philip Torr, Andreas Geiger, Laura Leal-Taixe, Bastian Leibe
Multi-Object Tracking (MOT) has been notoriously difficult to evaluate.
1 code implementation • 26 Aug 2020 • Sabarinath Mahadevan, Ali Athar, Aljoša Ošep, Sebastian Hennen, Laura Leal-Taixé, Bastian Leibe
On the other hand, 3D convolutional networks have been successfully applied for video classification tasks, but have not been leveraged as effectively to problems involving dense per-pixel interpretation of videos compared to their 2D convolutional counterparts and lag behind the aforementioned networks in terms of performance.
Semantic Segmentation
Unsupervised Video Object Segmentation
+4
1 code implementation • 12 Jul 2020 • István Sárándi, Timm Linder, Kai O. Arras, Bastian Leibe
Heatmap representations have formed the basis of human pose estimation systems for many years, and their extension to 3D has been a fruitful line of recent research.
Ranked #1 on
3D Human Pose Estimation
on 3D Poses in the Wild Challenge
(MPJPE metric)
1 code implementation • 8 Jun 2020 • Markus Knoche, István Sárándi, Bastian Leibe
We address the problem of reposing an image of a human into any desired novel pose.
1 code implementation • 2 May 2020 • Francis Engelmann, Jörg Stückler, Bastian Leibe
In this paper, we propose to use 3D shape and motion priors to regularize the estimation of the trajectory and the shape of vehicles in sequences of stereo images.
2 code implementations • 29 Apr 2020 • Dan Jia, Alexander Hermans, Bastian Leibe
Detecting persons using a 2D LiDAR is a challenging task due to the low information content of 2D range data.
no code implementations • ECCV 2020 • Umer Rafi, Andreas Doering, Bastian Leibe, Juergen Gall
Instead of training the network for estimating keypoint correspondences on video data, it is trained on a large scale image datasets for human pose estimation using self-supervision.
Multi-Person Pose Estimation
Multi-Person Pose Estimation and Tracking
+1
1 code implementation • CVPR 2020 • Jonas Schult, Francis Engelmann, Theodora Kontogianni, Bastian Leibe
That is, the convolutional kernel weights are mapped to the local surface of a given mesh.
1 code implementation • 30 Mar 2020 • Francis Engelmann, Martin Bokeloh, Alireza Fathi, Bastian Leibe, Matthias Nießner
We show that grouping proposals improves over NMS and outperforms previous state-of-the-art methods on the tasks of 3D object detection and semantic instance segmentation on the ScanNetV2 benchmark and the S3DIS dataset.
Ranked #1 on
3D Semantic Instance Segmentation
on ScanNetV2
1 code implementation • ECCV 2020 • Ali Athar, Sabarinath Mahadevan, Aljoša Ošep, Laura Leal-Taixé, Bastian Leibe
In this paper, we pro-pose a different approach that is well-suited to a variety of tasks involvinginstance segmentation in videos.
Ranked #4 on
Unsupervised Video Object Segmentation
on DAVIS 2017 (val)
(using extra training data)
1 code implementation • 5 Mar 2020 • István Sárándi, Timm Linder, Kai O. Arras, Bastian Leibe
Furthermore, as the image space is decoupled from the heatmap space, the network can learn to reason about joints beyond the image boundary.
Ranked #125 on
3D Human Pose Estimation
on Human3.6M
1 code implementation • 15 Jan 2020 • Jonathon Luiten, Idil Esen Zulfikar, Bastian Leibe
UnOVOST even performs competitively with many semi-supervised video object segmentation algorithms even though it is not given any input as to which objects should be tracked and segmented.
Semantic Segmentation
Semi-Supervised Video Object Segmentation
+2
1 code implementation • CVPR 2020 • Paul Voigtlaender, Jonathon Luiten, Philip H. S. Torr, Bastian Leibe
We present Siam R-CNN, a Siamese re-detection architecture which unleashes the full power of two-stage object detection approaches for visual object tracking.
Ranked #15 on
Visual Object Tracking
on TrackingNet
no code implementations • 2 Nov 2019 • Mark Weber, Jonathon Luiten, Bastian Leibe
We present a novel end-to-end single-shot method that segments countable object instances (things) as well as background regions (stuff) into a non-overlapping panoptic segmentation at almost video frame rate.
1 code implementation • 10 Oct 2019 • Johannes Groß, Aljosa Osep, Bastian Leibe
In this work, we focus on precise 3D track state estimation and propose a learning-based approach for object-centric relative motion estimation of partially observed objects.
1 code implementation • 30 Sep 2019 • Jonathon Luiten, Tobias Fischer, Bastian Leibe
Object tracking and 3D reconstruction are often performed together, with tracking used as input for reconstruction.
1 code implementation • 28 Jul 2019 • Francis Engelmann, Theodora Kontogianni, Bastian Leibe
In a thorough ablation study, we show that the receptive field size is directly related to the performance of 3D point cloud processing tasks, including semantic segmentation and object classification.
Ranked #27 on
Semantic Segmentation
on S3DIS Area5
no code implementations • 7 Jun 2019 • Kilian Pfeiffer, Alexander Hermans, István Sárándi, Mark Weber, Bastian Leibe
We address the problem of learning a single model for person re-identification, attribute classification, body part segmentation, and pose estimation.
no code implementations • 9 Apr 2019 • Paul Voigtlaender, Jonathon Luiten, Bastian Leibe
Following this paradigm, we present BoLTVOS (Box-Level Tracking for VOS), which consists of an R-CNN detector conditioned on the first-frame bounding box to detect the object of interest, a temporal consistency rescoring algorithm, and a Box2Seg network that converts bounding boxes to segmentation masks.
One-shot visual object segmentation
Semantic Segmentation
+2
no code implementations • 3 Apr 2019 • Cathrin Elich, Francis Engelmann, Theodora Kontogianni, Bastian Leibe
A lot of progress was made in the field of object classification and semantic segmentation.
Ranked #4 on
3D Semantic Instance Segmentation
on ScanNetV2
3D Instance Segmentation
3D Semantic Instance Segmentation
+2
no code implementations • 28 Feb 2019 • Aljosa Osep, Paul Voigtlaender, Jonathon Luiten, Stefan Breuers, Bastian Leibe
This paper addresses the problem of object discovery from unlabeled driving videos captured in a realistic automotive setting.
3 code implementations • CVPR 2019 • Paul Voigtlaender, Yuning Chai, Florian Schroff, Hartwig Adam, Bastian Leibe, Liang-Chieh Chen
Many of the recent successful methods for video object segmentation (VOS) are overly complicated, heavily rely on fine-tuning on the first frame, and/or are slow, and are hence of limited practical use.
Ranked #1 on
Semi-Supervised Video Object Segmentation
on YouTube
Semantic Segmentation
Semi-Supervised Video Object Segmentation
+1
no code implementations • CVPR 2019 • Paul Voigtlaender, Michael Krause, Aljosa Osep, Jonathon Luiten, Berin Balachandar Gnana Sekar, Andreas Geiger, Bastian Leibe
This paper extends the popular task of multi-object tracking to multi-object tracking and segmentation (MOTS).
Ranked #6 on
Multi-Object Tracking
on MOTS20
Multi-Object Tracking
Multi-Object Tracking and Segmentation
+1
1 code implementation • 26 Jan 2019 • Aljosa Osep, Paul Voigtlaender, Mark Weber, Jonathon Luiten, Bastian Leibe
Many high-level video understanding methods require input in the form of object proposals.
no code implementations • 2 Oct 2018 • Francis Engelmann, Theodora Kontogianni, Jonas Schult, Bastian Leibe
In this paper, we present a deep learning architecture which addresses the problem of 3D semantic segmentation of unstructured point clouds.
no code implementations • 19 Sep 2018 • Aljosa Osep, Paul Voigtlaender, Jonathon Luiten, Stefan Breuers, Bastian Leibe
We propose to leverage a generic object tracker in order to perform object mining in large-scale unlabeled videos, captured in a realistic automotive setting.
no code implementations • 19 Sep 2018 • Aljosa Osep, Wolfgang Mehner, Markus Mathias, Bastian Leibe
Most of the current vision-based tracking methods perform tracking in the image domain.
Ranked #21 on
Multiple Object Tracking
on KITTI Tracking test
1 code implementation • 13 Sep 2018 • István Sárándi, Timm Linder, Kai O. Arras, Bastian Leibe
In this paper we present our winning entry at the 2018 ECCV PoseTrack Challenge on 3D human pose estimation.
Ranked #178 on
3D Human Pose Estimation
on Human3.6M
1 code implementation • 28 Aug 2018 • István Sárándi, Timm Linder, Kai O. Arras, Bastian Leibe
Occlusion is commonplace in realistic human-robot shared environments, yet its effects are not considered in standard 3D human pose estimation benchmarks.
Ranked #188 on
3D Human Pose Estimation
on Human3.6M
5 code implementations • 24 Jul 2018 • Jonathon Luiten, Paul Voigtlaender, Bastian Leibe
We address semi-supervised video object segmentation, the task of automatically generating accurate and consistent pixel masks for objects in a video sequence, given the first-frame ground truth annotations.
One-shot visual object segmentation
Semantic Segmentation
+1
1 code implementation • 11 May 2018 • Sabarinath Mahadevan, Paul Voigtlaender, Bastian Leibe
Deep learning requires large amounts of training data to be effective.
1 code implementation • 26 Apr 2018 • Stefan Breuers, Lucas Beyer, Umer Rafi, Bastian Leibe
In the past decade many robots were deployed in the wild, and people detection and tracking is an important component of such deployments.
1 code implementation • 6 Apr 2018 • Lucas Beyer, Alexander Hermans, Timm Linder, Kai O. Arras, Bastian Leibe
Detecting humans is a key skill for mobile robots and intelligent vehicles in a large variety of applications.
1 code implementation • 5 Feb 2018 • Francis Engelmann, Theodora Kontogianni, Alexander Hermans, Bastian Leibe
The recently proposed PointNet architecture presents an interesting step ahead in that it can operate on unstructured point clouds, achieving encouraging segmentation results.
1 code implementation • 23 Dec 2017 • Aljoša Ošep, Paul Voigtlaender, Jonathon Luiten, Stefan Breuers, Bastian Leibe
We explore object discovery and detector adaptation based on unlabeled video sequences captured from a mobile platform.
no code implementations • 21 Dec 2017 • Aljoša Ošep, Wolfgang Mehner, Paul Voigtlaender, Bastian Leibe
In this paper, we propose a model-free multi-object tracking approach that uses a category-agnostic image segmentation method to track objects.
no code implementations • 28 Jun 2017 • Paul Voigtlaender, Bastian Leibe
We tackle the task of semi-supervised video object segmentation, i. e. segmenting the pixels belonging to an object in the video using the ground truth pixel mask for the first frame.
Ranked #2 on
Visual Object Tracking
on YouTube-VOS 2018
Semantic Segmentation
Semi-Supervised Video Object Segmentation
+2
2 code implementations • 31 May 2017 • Vitaly Kurin, Sebastian Nowozin, Katja Hofmann, Lucas Beyer, Bastian Leibe
Recent progress in Reinforcement Learning (RL), fueled by its combination, with Deep Learning has enabled impressive results in learning to interact with complex virtual environments, yet real-world applications of RL are still scarce.
2 code implementations • 12 May 2017 • Lucas Beyer, Stefan Breuers, Vitaly Kurin, Bastian Leibe
With the rise of end-to-end learning through deep learning, person detectors and re-identification (ReID) models have recently become very strong.
31 code implementations • 22 Mar 2017 • Alexander Hermans, Lucas Beyer, Bastian Leibe
In the past few years, the field of computer vision has gone through a revolution fueled mainly by the advent of large datasets and the adoption of deep convolutional neural networks for end-to-end learning.
Ranked #3 on
Person Re-Identification
on CUHK03
(Rank-5 metric)
no code implementations • CVPR 2017 • Yevhen Kuznietsov, Jörg Stückler, Bastian Leibe
Supervised deep learning often suffers from the lack of sufficient training data.
no code implementations • 7 Feb 2017 • Anton Kasyanov, Francis Engelmann, Jörg Stückler, Bastian Leibe
Our visual-inertial SLAM system is based on a real-time capable visual-inertial odometry method that provides locally consistent trajectory and map estimates.
2 code implementations • 6 Dec 2016 • David Stutz, Alexander Hermans, Bastian Leibe
As such, and due to their quick adoption in a wide range of applications, appropriate benchmarks are crucial for algorithm selection and comparison.
4 code implementations • CVPR 2017 • Tobias Pohlen, Alexander Hermans, Markus Mathias, Bastian Leibe
Therefore, additional processing steps have to be performed in order to obtain pixel-accurate segmentation masks at the full image resolution.
Ranked #24 on
Thermal Image Segmentation
on MFN Dataset
no code implementations • CVPR 2016 • Theodora Kontogianni, Markus Mathias, Bastian Leibe
Abstract In this paper, we address the problem of object discovery in time-varying, large-scale image collections.
no code implementations • 8 Mar 2016 • Lucas Beyer, Alexander Hermans, Bastian Leibe
We propose a Convolutional Neural Network (CNN) based detector for this task.
no code implementations • 18 Sep 2014 • Tobias Weyand, Bastian Leibe
We evaluate how different choices of methods and parameters for the individual pipeline steps affect overall system performance and examine their effects for different query categories such as buildings, paintings or sculptures.
no code implementations • CVPR 2014 • Ilya Kostrikov, Esther Horbert, Bastian Leibe
In this paper, we propose a novel labeling cost for multi- view reconstruction.
no code implementations • CVPR 2013 • Tobias Baumgartner, Dennis Mitzel, Bastian Leibe
Current pedestrian tracking approaches ignore important aspects of human behavior.