1 code implementation • 7 Feb 2025 • Steffen Eger, Yong Cao, Jennifer D'Souza, Andreas Geiger, Christian Greisinger, Stephanie Gross, Yufang Hou, Brigitte Krenn, Anne Lauscher, Yizhi Li, Chenghua Lin, Nafise Sadat Moosavi, Wei Zhao, Tristan Miller
With the advent of large multimodal language models, science is now at a threshold of an AI-based technological transformation.
no code implementations • 6 Feb 2025 • Eyvaz Najafli, Marius Kästingschäfer, Sebastian Bernhard, Thomas Brox, Andreas Geiger
We propose sshELF, a fast, single-shot pipeline for sparse-view 3D scene reconstruction via hierarchal extrapolation of latent features.
no code implementations • 24 Jan 2025 • Shaofei Wang, Tomas Simon, Igor Santesteban, Timur Bagautdinov, Junxuan Li, Vasu Agrawal, Fabian Prada, Shoou-I Yu, Pace Nalbone, Matt Gramlich, Roman Lubachersky, Chenglei Wu, Javier Romero, Jason Saragih, Michael Zollhoefer, Andreas Geiger, Siyu Tang, Shunsuke Saito
This allows us to learn diffuse radiance transfer in a local coordinate frame, which disentangles the local radiance transfer from the articulation of the body.
no code implementations • 30 Dec 2024 • Yuanbo Yang, Jiahao Shao, Xinyang Li, Yujun Shen, Andreas Geiger, Yiyi Liao
In this work, we introduce Prometheus, a 3D-aware latent diffusion model for text-to-3D generation at both object and scene levels in seconds.
no code implementations • 18 Dec 2024 • Huan Lei, Hongdong Li, Andreas Geiger, Anthony Dick
3D shape analysis has been largely focused on traditional 3D representations of point clouds and meshes, but the discrete nature of these data makes the analysis susceptible to variations in input resolutions.
1 code implementation • 12 Dec 2024 • Julian Zimmerlin, Jens Beißwenger, Bernhard Jaeger, Andreas Geiger, Kashyap Chitta
End-to-end driving systems have made rapid progress, but have so far not been applied to the challenging new CARLA Leaderboard 2. 0.
no code implementations • 2 Dec 2024 • HongYu Zhou, Longzhong Lin, Jiabao Wang, Yichong Lu, Dongfeng Bai, Bingbing Liu, Yue Wang, Andreas Geiger, Yiyi Liao
In the past few decades, autonomous driving algorithms have made significant progress in perception, planning, and control.
no code implementations • 28 Nov 2024 • Yichong Lu, Yichi Cai, Shangzhan Zhang, HongYu Zhou, Haoji Hu, Huimin Yu, Andreas Geiger, Yiyi Liao
In this work, we introduce UrbanCAD, a framework that pushes the frontier of the photorealism-controllability trade-off by generating highly controllable and photorealistic 3D vehicle digital twins from a single urban image and a collection of free 3D CAD models and handcrafted materials.
no code implementations • 12 Nov 2024 • Niklas Hanselmann, Simon Doll, Marius Cordts, Hendrik P. A. Lensch, Andreas Geiger
To handle the complexities of real-world traffic, learning planners for self-driving from data is a promising direction.
1 code implementation • 17 Oct 2024 • Takeru Miyato, Sindy Löwe, Andreas Geiger, Max Welling
It has long been known in both neuroscience and AI that ``binding'' between neurons leads to a form of competitive learning where representations are compressed in order to represent more abstract concepts in deeper layers of the network.
2 code implementations • 17 Oct 2024 • Haofei Xu, Songyou Peng, Fangjinhua Wang, Hermann Blum, Daniel Barath, Andreas Geiger, Marc Pollefeys
Gaussian splatting and single/multi-view depth estimation are typically studied in isolation.
no code implementations • 24 Sep 2024 • Chuqiao Li, Julian Chibane, Yannan He, Naama Pearl, Andreas Geiger, Gerard Pons-Moll
In contrast, Unimotion allows to control motion with global text, or local frame-level text, or both at once, providing more flexible control for users.
no code implementations • 4 Sep 2024 • Stefano Esposito, Anpei Chen, Christian Reiser, Samuel Rota Bulò, Lorenzo Porzi, Katja Schwarz, Christian Richardt, Michael Zollhöfer, Peter Kontschieder, Andreas Geiger
High-quality real-time view synthesis methods are based on volume rendering, splatting, or surface rendering.
no code implementations • 17 Jul 2024 • Sheng Miao, Jiaxin Huang, Dongfeng Bai, Weichao Qiu, Bingbing Liu, Andreas Geiger, Yiyi Liao
Recent advances in implicit scene representation enable high-fidelity street view novel view synthesis.
no code implementations • 11 Jul 2024 • Haoyu He, Markus Flicke, Jan Buchmann, Iryna Gurevych, Andreas Geiger
We address the technical challenge of implementing HDT's sample-dependent hierarchical attention pattern by developing a novel sparse attention kernel that considers the hierarchical structure of documents.
1 code implementation • 5 Jul 2024 • Anpei Chen, Haofei Xu, Stefano Esposito, Siyu Tang, Andreas Geiger
Radiance field methods have achieved photorealistic novel view synthesis and geometry reconstruction.
2 code implementations • 21 Jun 2024 • Daniel Dauner, Marcel Hallgarten, Tianyu Li, Xinshuo Weng, Zhiyu Huang, Zetong Yang, Hongyang Li, Igor Gilitschenski, Boris Ivanovic, Marco Pavone, Andreas Geiger, Kashyap Chitta
On a large set of challenging scenarios, we observe that simple methods with moderate compute requirements such as TransFuser can match recent large-scale end-to-end driving architectures such as UniAD.
2 code implementations • 27 May 2024 • Shenyuan Gao, Jiazhi Yang, Li Chen, Kashyap Chitta, Yihang Qiu, Andreas Geiger, Jun Zhang, Hongyang Li
In this paper, we present Vista, a generalizable driving world model with high fidelity and versatile controllability.
2 code implementations • 16 Apr 2024 • Zehao Yu, Torsten Sattler, Andreas Geiger
Recently, 3D Gaussian Splatting (3DGS) has demonstrated impressive novel view synthesis results, while allowing the rendering of high-resolution images in real-time.
2 code implementations • 26 Mar 2024 • Binbin Huang, Zehao Yu, Anpei Chen, Andreas Geiger, Shenghua Gao
3D Gaussian Splatting (3DGS) has recently revolutionized radiance field reconstruction, achieving high quality novel view synthesis and fast rendering speed without baking.
1 code implementation • 26 Mar 2024 • Kashyap Chitta, Daniel Dauner, Andreas Geiger
SLEDGE is the first generative simulator for vehicle motion planning trained on real-world driving logs.
1 code implementation • 21 Mar 2024 • Yuedong Chen, Haofei Xu, Chuanxia Zheng, Bohan Zhuang, Marc Pollefeys, Andreas Geiger, Tat-Jen Cham, Jianfei Cai
We introduce MVSplat, an efficient model that, given sparse multi-view images as input, predicts clean feed-forward 3D Gaussians.
Ranked #1 on
Generalizable Novel View Synthesis
on ACID
no code implementations • CVPR 2024 • HongYu Zhou, Jiahao Shao, Lu Xu, Dongfeng Bai, Weichao Qiu, Bingbing Liu, Yue Wang, Andreas Geiger, Yiyi Liao
Holistic understanding of urban scenes based on RGB images is a challenging yet important problem.
no code implementations • 14 Mar 2024 • Haiwen Huang, Songyou Peng, Dan Zhang, Andreas Geiger
Names are essential to both human cognition and vision-language models.
3 code implementations • CVPR 2024 • Jiazhi Yang, Shenyuan Gao, Yihang Qiu, Li Chen, Tianyu Li, Bo Dai, Kashyap Chitta, Penghao Wu, Jia Zeng, Ping Luo, Jun Zhang, Andreas Geiger, Yu Qiao, Hongyang Li
In this paper, we introduce the first large-scale video prediction model in the autonomous driving discipline.
1 code implementation • 11 Mar 2024 • Stefan Baur, Frank Moosmann, Andreas Geiger
3D object detection is one of the most important components in any Self-Driving stack, but current state-of-the-art (SOTA) lidar object detectors require costly & slow manual annotation of 3D bounding boxes to perform well.
no code implementations • 19 Feb 2024 • Christian Reiser, Stephan Garbin, Pratul P. Srinivasan, Dor Verbin, Richard Szeliski, Ben Mildenhall, Jonathan T. Barron, Peter Hedman, Andreas Geiger
Third, we minimize the binary entropy of the opacity values, which facilitates the extraction of surface geometry by encouraging opacity values to binarize towards the end of training.
2 code implementations • 21 Dec 2023 • Chonghao Sima, Katrin Renz, Kashyap Chitta, Li Chen, Hanxue Zhang, Chengen Xie, Jens Beißwenger, Ping Luo, Andreas Geiger, Hongyang Li
The experiments demonstrate that Graph VQA provides a simple, principled framework for reasoning about a driving scene, and DriveLM-Data provides a challenging benchmark for this task.
no code implementations • CVPR 2024 • Zinuo You, Andreas Geiger, Anpei Chen
In contrast to previous fast reconstruction methods that represent the 3D scene globally, we model the light field of a scene as a set of local light field feature probes, parameterized with position and multi-channel 2D feature maps.
1 code implementation • CVPR 2024 • Zhiyin Qian, Shaofei Wang, Marko Mihajlovic, Andreas Geiger, Siyu Tang
In this paper, we use 3D Gaussian Splatting and learn a non-rigid deformation network to reconstruct animatable clothed human avatars that can be trained within 30 minutes and rendered at real-time frame rates (50+ FPS).
no code implementations • 13 Dec 2023 • Bernhard Jaeger, Andreas Geiger
These networks can be optimized with supervised learning, if the target objective is differentiable.
1 code implementation • CVPR 2024 • Shaofei Wang, Božidar Antić, Andreas Geiger, Siyu Tang
We present IntrinsicAvatar, a novel approach to recovering the intrinsic properties of clothed human avatars including geometry, albedo, material, and environment lighting from only monocular videos.
1 code implementation • CVPR 2024 • Haofei Xu, Anpei Chen, Yuedong Chen, Christos Sakaridis, Yulun Zhang, Marc Pollefeys, Andreas Geiger, Fisher Yu
We present Multi-Baseline Radiance Fields (MuRF), a general feed-forward approach to solving sparse view synthesis under multiple different baseline settings (small and large baselines, and different number of input views).
no code implementations • CVPR 2024 • Gege Gao, Weiyang Liu, Anpei Chen, Andreas Geiger, Bernhard Schölkopf
As pretrained text-to-image diffusion models become increasingly powerful, recent efforts have been made to distill knowledge from these text-to-image pretrained models for optimizing a text-guided 3D model.
1 code implementation • CVPR 2024 • Zehao Yu, Anpei Chen, Binbin Huang, Torsten Sattler, Andreas Geiger
Recently, 3D Gaussian Splatting has demonstrated impressive novel view synthesis results, reaching high fidelity and efficiency.
no code implementations • 22 Nov 2023 • Katja Schwarz, Seung Wook Kim, Jun Gao, Sanja Fidler, Andreas Geiger, Karsten Kreis
Then, we train a diffusion model in the 3D-aware latent space, thereby enabling synthesis of high-quality 3D-consistent image samples, outperforming recent state-of-the-art GAN-based methods.
1 code implementation • 16 Oct 2023 • Takeru Miyato, Bernhard Jaeger, Max Welling, Andreas Geiger
As transformers are equivariant to the permutation of input tokens, encoding the positional information of tokens is necessary for many tasks.
1 code implementation • 19 Sep 2023 • Xiao Fu, Tianrun Chen, Yichong Lu, Xiaowei Zhou, Andreas Geiger, Yiyi Liao
Our key insight lies in exploiting the complementarity of 3D and 2D priors to mutually enhance geometry and semantics.
no code implementations • 24 Aug 2023 • Tim Schreier, Katrin Renz, Andreas Geiger, Kashyap Chitta
Prior work in 3D object detection evaluates models using offline metrics like average precision since closed-loop online evaluation on the downstream driving task is costly.
1 code implementation • 29 Jun 2023 • Li Chen, Penghao Wu, Kashyap Chitta, Bernhard Jaeger, Andreas Geiger, Hongyang Li
The autonomous driving community has witnessed a rapid growth in approaches that embrace an end-to-end algorithm framework, utilizing raw sensor input to generate vehicle motion plans, instead of concentrating on individual tasks such as detection and motion prediction.
2 code implementations • 13 Jun 2023 • Daniel Dauner, Marcel Hallgarten, Andreas Geiger, Kashyap Chitta
The release of nuPlan marks a new era in vehicle motion planning research, offering the first large-scale real-world dataset and evaluation schemes requiring both precise short-term planning and long-horizon ego-forecasting.
1 code implementation • ICCV 2023 • Bernhard Jaeger, Kashyap Chitta, Andreas Geiger
End-to-end driving systems have recently made rapid progress, in particular on CARLA.
Ranked #1 on
CARLA longest6
on CARLA
no code implementations • 6 Jun 2023 • Carolin Schmitt, Božidar Antić, Andrei Neculai, Joo Ho Lee, Andreas Geiger
In this paper, we propose a novel method for joint recovery of camera pose, object geometry and spatially-varying Bidirectional Reflectance Distribution Function (svBRDF) of 3D scenes that exceed object-scale and hence cannot be captured with stationary light stages.
no code implementations • ICCV 2023 • Zijian Dong, Xu Chen, Jinlong Yang, Michael J. Black, Otmar Hilliges, Andreas Geiger
The key to progress is hence to learn generative models of 3D avatars from abundant unstructured 2D image collections.
no code implementations • 23 Feb 2023 • Christian Reiser, Richard Szeliski, Dor Verbin, Pratul P. Srinivasan, Ben Mildenhall, Andreas Geiger, Jonathan T. Barron, Peter Hedman
We design a lossless procedure for baking the parameterization used during training into a model that achieves real-time rendering while still preserving the photorealistic view synthesis quality of a volumetric radiance field.
no code implementations • 7 Feb 2023 • Zihan Zhu, Songyou Peng, Viktor Larsson, Zhaopeng Cui, Martin R. Oswald, Andreas Geiger, Marc Pollefeys
Neural implicit representations have recently become popular in simultaneous localization and mapping (SLAM), especially in dense visual SLAM.
1 code implementation • 2 Feb 2023 • Anpei Chen, Zexiang Xu, Xinyue Wei, Siyu Tang, Hao Su, Andreas Geiger
Our experiments show that DiF leads to improvements in approximation quality, compactness, and training time when compared to previous fast reconstruction methods.
1 code implementation • 23 Jan 2023 • Axel Sauer, Tero Karras, Samuli Laine, Andreas Geiger, Timo Aila
Text-to-image synthesis has recently seen significant progress thanks to large pretrained language models, large-scale training data, and the introduction of scalable model families such as diffusion and autoregressive models.
Ranked #18 on
Text-to-Image Generation
on MS COCO
1 code implementation • 22 Dec 2022 • Haiwen Huang, Andreas Geiger, Dan Zhang
We address the task of open-world class-agnostic object detection, i. e., detecting every object in an image by learning from a limited number of base object classes.
Ranked #1 on
Open World Object Detection
on COCO VOC to non-VOC
1 code implementation • 28 Nov 2022 • Xu Chen, Tianjian Jiang, Jie Song, Max Rietmann, Andreas Geiger, Michael J. Black, Otmar Hilliges
A key challenge in making such methods applicable to articulated objects, such as the human body, is to model the deformation of 3D locations between the rest pose (a canonical space) and the deformed space.
1 code implementation • 10 Nov 2022 • Haofei Xu, Jing Zhang, Jianfei Cai, Hamid Rezatofighi, Fisher Yu, DaCheng Tao, Andreas Geiger
We present a unified formulation and model for three motion and 3D perception tasks: optical flow, rectified stereo matching and unrectified stereo depth estimation from posed images.
Ranked #1 on
Optical Flow Estimation
on Sintel-clean
no code implementations • 28 Oct 2022 • Liangchen Song, Anpei Chen, Zhong Li, Zhang Chen, Lele Chen, Junsong Yuan, Yi Xu, Andreas Geiger
Visually exploring in a real-world 4D spatiotemporal space freely in VR has been a long-term quest.
1 code implementation • 27 Oct 2022 • Zifan Shi, Sida Peng, Yinghao Xu, Andreas Geiger, Yiyi Liao, Yujun Shen
In this survey, we thoroughly review the ongoing developments of 3D generative models, including methods that employ 2D and 3D supervision.
1 code implementation • 25 Oct 2022 • Katrin Renz, Kashyap Chitta, Otniel-Bogdan Mercea, A. Sophia Koepke, Zeynep Akata, Andreas Geiger
Planning an optimal route in a complex environment requires efficient reasoning about the surrounding scene.
Ranked #6 on
CARLA longest6
on CARLA
no code implementations • 18 Oct 2022 • Shaofei Wang, Katja Schwarz, Andreas Geiger, Siyu Tang
We demonstrate that our proposed pipeline can generate clothed avatars with high-quality pose-dependent geometry and appearance from a sparse set of multi-view RGB videos.
1 code implementation • 15 Jun 2022 • Katja Schwarz, Axel Sauer, Michael Niemeyer, Yiyi Liao, Andreas Geiger
State-of-the-art 3D-aware generative models rely on coordinate-based MLPs to parameterize 3D radiance fields.
1 code implementation • 1 Jun 2022 • Zehao Yu, Songyou Peng, Michael Niemeyer, Torsten Sattler, Andreas Geiger
Motivated by recent advances in the area of monocular geometry prediction, we systematically explore the utility these cues provide for improving neural implicit surface reconstruction.
3 code implementations • 31 May 2022 • Kashyap Chitta, Aditya Prakash, Bernhard Jaeger, Zehao Yu, Katrin Renz, Andreas Geiger
At the time of submission, TransFuser outperforms all prior work on the CARLA leaderboard in terms of driving score by a large margin.
Ranked #6 on
Autonomous Driving
on CARLA Leaderboard
1 code implementation • 28 Apr 2022 • Niklas Hanselmann, Katrin Renz, Kashyap Chitta, Apratim Bhattacharyya, Andreas Geiger
Simulators offer the possibility of safe, low-cost development of self-driving systems.
1 code implementation • 29 Mar 2022 • Xiao Fu, Shangzhan Zhang, Tianrun Chen, Yichong Lu, Lanyun Zhu, Xiaowei Zhou, Andreas Geiger, Yiyi Liao
In this work, we present a novel 3D-to-2D label transfer method, Panoptic NeRF, which aims for obtaining per-pixel 2D semantic and instance labels from easy-to-obtain coarse 3D bounding primitives.
2 code implementations • 17 Mar 2022 • Anpei Chen, Zexiang Xu, Andreas Geiger, Jingyi Yu, Hao Su
We demonstrate that applying traditional CP decomposition -- that factorizes tensors into rank-one components with compact vectors -- in our framework leads to improvements over vanilla NeRF.
Ranked #3 on
Novel View Synthesis
on X3D
no code implementations • CVPR 2022 • Zijian Dong, Chen Guo, Jie Song, Xu Chen, Andreas Geiger, Otmar Hilliges
We present a novel method to learn Personalized Implicit Neural Avatars (PINA) from a short RGB-D sequence.
2 code implementations • 1 Feb 2022 • Axel Sauer, Katja Schwarz, Andreas Geiger
StyleGAN in particular sets new standards for generative modeling regarding image quality and controllability.
Ranked #1 on
Image Generation
on Pokemon 256x256
no code implementations • CVPR 2022 • Xu Chen, Tianjian Jiang, Jie Song, Jinlong Yang, Michael J. Black, Andreas Geiger, Otmar Hilliges
Furthermore, we show that our method can be used on the task of fitting human models to raw scans, outperforming the previous state-of-the-art.
1 code implementation • CVPR 2022 • Michael Niemeyer, Jonathan T. Barron, Ben Mildenhall, Mehdi S. M. Sajjadi, Andreas Geiger, Noha Radwan
We observe that the majority of artifacts in sparse input scenarios are caused by errors in the estimated scene geometry, and by divergent behavior at the start of training.
1 code implementation • NeurIPS 2021 • Katja Schwarz, Yiyi Liao, Andreas Geiger
2) Checkerboard artifacts introduced by upsampling cannot explain the spectral discrepancies alone as the generator is able to compensate for these artifacts.
3 code implementations • NeurIPS 2021 • Axel Sauer, Kashyap Chitta, Jens Müller, Andreas Geiger
Generative Adversarial Networks (GANs) produce high-quality images but are challenging to train.
Ranked #1 on
Image Generation
on Stanford Cars
1 code implementation • NeurIPS 2021 • Despoina Paschalidou, Amlan Kar, Maria Shugrina, Karsten Kreis, Andreas Geiger, Sanja Fidler
The ability to synthesize realistic and diverse indoor furniture layouts automatically or based on partial input, unlocks many applications, from better interactive 3D tools to data synthesis for training and simulation.
Ranked #3 on
Indoor Scene Synthesis
on PRO-teXt
2D Semantic Segmentation task 1 (8 classes)
3D Semantic Scene Completion
+1
2 code implementations • 28 Sep 2021 • Yiyi Liao, Jun Xie, Andreas Geiger
For the last few decades, several major subfields of artificial intelligence including computer vision, graphics, and robotics have progressed largely independently from each other.
1 code implementation • ICCV 2021 • Kashyap Chitta, Aditya Prakash, Andreas Geiger
Efficient reasoning about the semantic, spatial, and temporal structure of a scene is a crucial prerequisite for autonomous driving.
Ranked #4 on
Novel View Synthesis
on X3D
no code implementations • 9 Jul 2021 • Niklas Hanselmann, Nick Schneider, Benedikt Ortelt, Andreas Geiger
In order to handle the challenges of autonomous driving, deep learning has proven to be crucial in tackling increasingly complex tasks, such as 3D detection or instance segmentation.
1 code implementation • NeurIPS 2021 • Shaofei Wang, Marko Mihajlovic, Qianli Ma, Andreas Geiger, Siyu Tang
In contrast, we propose an approach that can quickly generate realistic clothed human avatars, represented as controllable neural SDFs, given only monocular depth images.
2 code implementations • NeurIPS 2021 • Songyou Peng, Chiyu "Max" Jiang, Yiyi Liao, Michael Niemeyer, Marc Pollefeys, Andreas Geiger
However, the implicit nature of neural implicit representations results in slow inference time and requires careful initialization.
2 code implementations • ICCV 2021 • Michael Oechsle, Songyou Peng, Andreas Geiger
At the same time, neural radiance fields have revolutionized novel view synthesis.
2 code implementations • CVPR 2021 • Aditya Prakash, Kashyap Chitta, Andreas Geiger
How should representations from complementary sensors be integrated for autonomous driving?
Ranked #1 on
Autonomous Driving
on Town05 Short
no code implementations • CVPR 2021 • Shaofei Wang, Andreas Geiger, Siyu Tang
We combine PTF with multi-class occupancy networks, obtaining a novel learning-based framework that learns to simultaneously predict shape and per-point correspondences between the posed space and the canonical space for clothed human.
2 code implementations • CVPR 2021 • Fabio Tosi, Yiyi Liao, Carolin Schmitt, Andreas Geiger
Despite stereo matching accuracy has greatly improved by deep learning in the last few years, recovering sharp boundaries and high-resolution outputs efficiently remains challenging.
1 code implementation • ICCV 2021 • Xu Chen, Yufeng Zheng, Michael J. Black, Otmar Hilliges, Andreas Geiger
However, this is problematic since the backward warp field is pose dependent and thus requires large amounts of data to learn.
Ranked #3 on
3D Human Reconstruction
on 4D-DRESS
no code implementations • 31 Mar 2021 • Michael Niemeyer, Andreas Geiger
At test time, our model generates images with explicit control over the camera as well as the shape and appearance of the scene.
4 code implementations • ICCV 2021 • Christian Reiser, Songyou Peng, Yiyi Liao, Andreas Geiger
NeRF synthesizes novel views of a scene with unprecedented quality by fitting a neural radiance field to RGB images.
1 code implementation • CVPR 2021 • Despoina Paschalidou, Angelos Katharopoulos, Andreas Geiger, Sanja Fidler
The INN allows us to compute the inverse mapping of the homeomorphism, which in turn, enables the efficient computation of both the implicit surface function of a primitive and its mesh, without any additional post-processing.
1 code implementation • 23 Feb 2021 • Mark Weber, Jun Xie, Maxwell Collins, Yukun Zhu, Paul Voigtlaender, Hartwig Adam, Bradley Green, Andreas Geiger, Bastian Leibe, Daniel Cremers, Aljoša Ošep, Laura Leal-Taixé, Liang-Chieh Chen
The task of assigning semantic classes and track identities to every pixel in a video is called video panoptic segmentation.
1 code implementation • ICLR 2021 • Axel Sauer, Andreas Geiger
Prior works on image classification show that instead of learning a connection to object shape, deep classifiers tend to exploit spurious correlations with low-level texture or the background for solving the classification task.
no code implementations • ICCV 2021 • Stefan Andreas Baur, David Josef Emmerichs, Frank Moosmann, Peter Pinggera, Bjorn Ommer, Andreas Geiger
Recently, several frameworks for self-supervised learning of 3D scene flow on point clouds have emerged.
1 code implementation • CVPR 2021 • Michael Niemeyer, Andreas Geiger
While several recent works investigate how to disentangle underlying factors of variation in the data, most of them operate in 2D and hence ignore that our world is three-dimensional.
5 code implementations • 16 Sep 2020 • Jonathon Luiten, Aljosa Osep, Patrick Dendorfer, Philip Torr, Andreas Geiger, Laura Leal-Taixe, Bastian Leibe
Multi-Object Tracking (MOT) has been notoriously difficult to evaluate.
no code implementations • ECCV 2020 • Xu Chen, Zijian Dong, Jie Song, Andreas Geiger, Otmar Hilliges
Many object pose estimation algorithms rely on the analysis-by-synthesis framework which requires explicit representations of individual object instances.
1 code implementation • NeurIPS 2020 • Katja Schwarz, Yiyi Liao, Michael Niemeyer, Andreas Geiger
In contrast to voxel-based representations, radiance fields are not confined to a coarse discretization of the 3D space, yet allow for disentangling camera and scene properties while degrading gracefully in the presence of reconstruction ambiguity.
Ranked #2 on
Scene Generation
on VizDoom
no code implementations • 29 Jun 2020 • Hassan Abu Alhaija, Siva Karthik Mustikovela, Justus Thies, Varun Jampani, Matthias Nießner, Andreas Geiger, Carsten Rother
Neural rendering techniques promise efficient photo-realistic image synthesis while at the same time providing rich control over scene parameters by learning the physical image formation process.
1 code implementation • 12 Jun 2020 • Marissa A. Weis, Kashyap Chitta, Yash Sharma, Wieland Brendel, Matthias Bethge, Andreas Geiger, Alexander S. Ecker
Perceiving the world in terms of objects and tracking them through time is a crucial prerequisite for reasoning and scene understanding.
no code implementations • 5 Jun 2020 • Paul Sanzenbacher, Lars Mescheder, Andreas Geiger
In recent years, deep generative models have gained significance due to their ability to synthesize natural-looking images with applications ranging from virtual reality to data augmentation for training computer vision models.
3 code implementations • 20 May 2020 • Aseem Behl, Kashyap Chitta, Aditya Prakash, Eshed Ohn-Bar, Andreas Geiger
Beyond label efficiency, we find several additional training benefits when leveraging visual abstractions, such as a significant reduction in the variance of the learned policy when compared to state-of-the-art end-to-end driving models.
1 code implementation • CVPR 2020 • Despoina Paschalidou, Luc van Gool, Andreas Geiger
Humans perceive the 3D world as a set of distinct objects that are characterized by various low-level (geometry, reflectance) and high-level (connectivity, adjacency, symmetry) properties.
3 code implementations • 27 Mar 2020 • Michael Oechsle, Michael Niemeyer, Lars Mescheder, Thilo Strauss, Andreas Geiger
In this work, we propose a novel implicit representation for capturing the visual appearance of an object in terms of its surface light field.
6 code implementations • ECCV 2020 • Songyou Peng, Michael Niemeyer, Lars Mescheder, Marc Pollefeys, Andreas Geiger
Recently, implicit neural representations have gained popularity for learning-based 3D reconstruction.
1 code implementation • 10 Feb 2020 • Peidong Liu, Joel Janai, Marc Pollefeys, Torsten Sattler, Andreas Geiger
Motion blurry images challenge many computer vision algorithms, e. g, feature detection, motion estimation, or object recognition.
1 code implementation • CVPR 2020 • Michael Niemeyer, Lars Mescheder, Michael Oechsle, Andreas Geiger
In this work, we propose a differentiable rendering formulation for implicit shape and texture representations.
1 code implementation • CVPR 2020 • Yiyi Liao, Katja Schwarz, Lars Mescheder, Andreas Geiger
We define the new task of 3D controllable image synthesis and propose an approach for solving it by reasoning both in 3D space and in the 2D image domain.
1 code implementation • ICCV 2019 • Anurag Ranjan, Joel Janai, Andreas Geiger, Michael J. Black
In this paper, we extend adversarial patch attacks to optical flow networks and show that such attacks can compromise their performance.
no code implementations • ICCV 2019 • Michael Oechsle, Lars Mescheder, Michael Niemeyer, Thilo Strauss, Andreas Geiger
A major reason for these limitations is that common representations of texture are inefficient or hard to interface for modern deep learning techniques.
no code implementations • 7 May 2019 • Ioan Andrei Bârsan, Peidong Liu, Marc Pollefeys, Andreas Geiger
We use both instance-aware semantic segmentation and sparse scene flow to classify objects as either background, moving, or potentially moving, thereby ensuring that the system is able to model objects with the potential to transition from static to dynamic, such as parked cars.
1 code implementation • CVPR 2019 • Despoina Paschalidou, Ali Osman Ulusoy, Andreas Geiger
Abstracting complex 3D shapes with parsimonious part-based representations has been a long standing goal in computer vision.
no code implementations • CVPR 2019 • Paul Voigtlaender, Michael Krause, Aljosa Osep, Jonathon Luiten, Berin Balachandar Gnana Sekar, Andreas Geiger, Bastian Leibe
This paper extends the popular task of multi-object tracking to multi-object tracking and segmentation (MOTS).
Ranked #6 on
Multi-Object Tracking
on MOTS20
Multi-Object Tracking
Multi-Object Tracking and Segmentation
+3
1 code implementation • CVPR 2018 • Despoina Paschalidou, Ali Osman Ulusoy, Carolin Schmitt, Luc van Gool, Andreas Geiger
RayNet integrates a CNN that learns view-invariant feature representations with an MRF that explicitly encodes the physics of perspective projection and occlusion.
1 code implementation • CVPR 2019 • Zhaoyang Lv, Frank Dellaert, James M. Rehg, Andreas Geiger
In this paper, we provide a modern synthesis of the classic inverse compositional algorithm for dense image alignment.
7 code implementations • CVPR 2019 • Lars Mescheder, Michael Oechsle, Michael Niemeyer, Sebastian Nowozin, Andreas Geiger
With the advent of deep neural networks, learning-based approaches for 3D reconstruction have gained popularity.
no code implementations • 12 Sep 2018 • Hassan Abu Alhaija, Siva Karthik Mustikovela, Andreas Geiger, Carsten Rother
The task of generating natural images from 3D scenes has been a long standing goal in computer graphics.
no code implementations • ECCV 2018 • Benjamin Coors, Alexandru Paul Condurache, Andreas Geiger
Omnidirectional cameras offer great benefits over classical cameras wherever a wide field of view is essential, such as in virtual reality applications or in autonomous robots.
no code implementations • ECCV 2018 • Ian Cherabier, Johannes L. Schonberger, Martin R. Oswald, Marc Pollefeys, Andreas Geiger
In contrast to existing variational methods for semantic 3D reconstruction, our model is end-to-end trainable and captures more complex dependencies between the semantic labels and the 3D geometry.
no code implementations • ECCV 2018 • Joel Janai, Fatma Guney, Anurag Ranjan, Michael Black, Andreas Geiger
In this paper, we propose a framework for unsupervised learning of optical flow and occlusions over multiple frames.
1 code implementation • 18 Jun 2018 • Axel Sauer, Nikolay Savinov, Andreas Geiger
Most existing approaches to autonomous driving fall into one of two categories: modular pipelines, that build an extensive model of the environment, and imitation learning approaches, that map images directly to control outputs.
no code implementations • CVPR 2019 • Aseem Behl, Despoina Paschalidou, Simon Donné, Andreas Geiger
In this paper, we propose to estimate 3D motion from such unstructured point clouds using a deep neural network.
1 code implementation • CVPR 2018 • Yiyi Liao, Simon Donné, Andreas Geiger
Existing learning based solutions to 3D surface prediction cannot be trained end-to-end as they operate on intermediate representations (e. g., TSDF) from which 3D surface meshes must be extracted in a post-processing step (e. g., via the marching cubes algorithm).
1 code implementation • CVPR 2018 • David Stutz, Andreas Geiger
Learning-based approaches, in contrast, avoid the expensive optimization step and instead directly predict the complete shape from the incomplete observations using deep neural networks.
5 code implementations • 18 May 2018 • David Stutz, Andreas Geiger
We address the problem of 3D shape completion from sparse and noisy point clouds, a fundamental problem in computer vision and robotics.
9 code implementations • ICML 2018 • Lars Mescheder, Andreas Geiger, Sebastian Nowozin
In this paper, we show that the requirement of absolute continuity is necessary: we describe a simple yet prototypical counterexample showing that in the more realistic case of distributions that are not absolutely continuous, unregularized GAN training is not always convergent.
no code implementations • 22 Dec 2017 • Laura Sevilla-Lara, Yiyi Liao, Fatma Guney, Varun Jampani, Andreas Geiger, Michael J. Black
Here we take a deeper look at the combination of flow and action recognition, and investigate why optical flow is helpful, what makes a flow method good for action recognition, and how we can make it better.
no code implementations • CVPR 2018 • Johannes L. Schönberger, Marc Pollefeys, Andreas Geiger, Torsten Sattler
Robust visual localization under a wide range of viewing conditions is a fundamental problem in computer vision.
no code implementations • ICCV 2017 • Aseem Behl, Omid Hosseini Jafari, Siva Karthik Mustikovela, Hassan Abu Alhaija, Carsten Rother, Andreas Geiger
Existing methods for 3D scene flow estimation often fail in the presence of large displacement or local ambiguities, e. g., at texture-less or reflective surfaces.
1 code implementation • 22 Aug 2017 • Jonas Uhrig, Nick Schneider, Lukas Schneider, Uwe Franke, Thomas Brox, Andreas Geiger
In this paper, we consider convolutional neural networks operating on sparse inputs with an application to depth upsampling from sparse laser scan data.
Ranked #16 on
Depth Completion
on KITTI Depth Completion
no code implementations • 4 Aug 2017 • Hassan Abu Alhaija, Siva Karthik Mustikovela, Lars Mescheder, Andreas Geiger, Carsten Rother
Further, we demonstrate the utility of our approach on training standard deep models for semantic instance segmentation and object detection of cars in outdoor driving scenes.
no code implementations • CVPR 2017 • Joel Janai, Fatma Guney, Jonas Wulff, Michael J. Black, Andreas Geiger
Existing optical flow datasets are limited in size and variability due to the difficulty of capturing dense ground truth.
no code implementations • CVPR 2017 • Thomas Schops, Johannes L. Schonberger, Silvano Galliani, Torsten Sattler, Konrad Schindler, Marc Pollefeys, Andreas Geiger
Motivated by the limitations of existing multi-view stereo benchmarks, we present a novel dataset for this task.
no code implementations • CVPR 2017 • Federico Camposeco, Torsten Sattler, Andrea Cohen, Andreas Geiger, Marc Pollefeys
Adding the knowledge of direction of triangulation, we are able to approximate the position of the camera from two matches alone.
no code implementations • CVPR 2017 • Ali Osman Ulusoy, Michael J. Black, Andreas Geiger
Due to its probabilistic nature, the approach is able to cope with the approximate geometry of the 3D models as well as input shapes that are not present in the scene.
4 code implementations • NeurIPS 2017 • Lars Mescheder, Sebastian Nowozin, Andreas Geiger
In this paper, we analyze the numerics of common algorithms for training Generative Adversarial Networks (GANs).
no code implementations • 18 Apr 2017 • Joel Janai, Fatma Güney, Aseem Behl, Andreas Geiger
Towards this goal, we analyze the performance of the state of the art on several challenging benchmarking datasets, including KITTI, MOT, and Cityscapes.
1 code implementation • 4 Apr 2017 • Gernot Riegler, Ali Osman Ulusoy, Horst Bischof, Andreas Geiger
In this paper, we present a learning based approach to depth fusion, i. e., dense 3D reconstruction from multiple depth images.
1 code implementation • ICML 2017 • Lars Mescheder, Sebastian Nowozin, Andreas Geiger
We show that in the nonparametric limit our method yields an exact maximum-likelihood assignment for the parameters of the generative model, as well as the exact posterior distribution over the latent variables given an observation.
no code implementations • 21 Nov 2016 • Lars Mescheder, Sebastian Nowozin, Andreas Geiger
We present a new notion of probabilistic duality for random variables involving mixture distributions.
1 code implementation • CVPR 2017 • Gernot Riegler, Ali Osman Ulusoy, Andreas Geiger
We present OctNet, a representation for deep learning with sparse 3D data.
no code implementations • CVPR 2016 • Ali Osman Ulusoy, Michael J. Black, Andreas Geiger
In this paper, we propose a non-local structured prior for volumetric multi-view 3D reconstruction.
no code implementations • ICCV 2015 • Chen Zhou, Fatma Guney, Yizhou Wang, Andreas Geiger
Despite recent progress, reconstructing outdoor scenes in 3D from movable platforms remains a highly difficult endeavour.
no code implementations • CVPR 2016 • Jun Xie, Martin Kiefel, Ming-Ting Sun, Andreas Geiger
Semantic annotations are vital for training models for object recognition, semantic segmentation or scene understanding.
no code implementations • CVPR 2015 • Moritz Menze, Andreas Geiger
We demonstrate the performance of our model on existing benchmarks as well as a novel realistic dataset with scene flow ground truth.
no code implementations • CVPR 2015 • Fatma Guney, Andreas Geiger
Stereo techniques have witnessed tremendous progress over the last decades, yet some aspects of the problem still remain challenging today.
no code implementations • ICCV 2015 • Philip Lenz, Andreas Geiger, Raquel Urtasun
One of the most popular approaches to multi-target tracking is tracking-by-detection.
Ranked #32 on
Multiple Object Tracking
on KITTI Test (Online Methods)
(MOTA metric)
no code implementations • CVPR 2013 • Marcus A. Brubaker, Andreas Geiger, Raquel Urtasun
In this paper we propose an affordable solution to selflocalization, which utilizes visual odometry and road maps as the only inputs.
no code implementations • NeurIPS 2011 • Andreas Geiger, Christian Wojek, Raquel Urtasun
We propose a novel generative model that is able to reason jointly about the 3D scene layout as well as the 3D location and orientation of objects in the scene.