1 code implementation • 8 Oct 2024 • M. Jehanzeb Mirza, Mengjie Zhao, Zhuoyuan Mao, Sivan Doveh, Wei Lin, Paul Gavrikov, Michael Dorkenwald, Shiqi Yang, Saurav Jha, Hiromi Wakaki, Yuki Mitsufuji, Horst Possegger, Rogerio Feris, Leonid Karlinsky, James Glass
In each respective optimization step, the ranked prompts are fed as in-context examples (with their accuracies) to equip the LLM with the knowledge of the type of text prompts preferred by the downstream VLM.
no code implementations • 24 Sep 2024 • Alexander Prutsch, Horst Bischof, Horst Possegger
For efficient and safe autonomous driving, it is essential that autonomous vehicles can predict the motion of other traffic agents.
1 code implementation • 7 Aug 2024 • Christian Fruhwirth-Reisinger, Wei Lin, Dušan Malić, Horst Bischof, Horst Possegger
To overcome these limitations, we propose a vision-language-guided unsupervised 3D detection approach that operates exclusively on LiDAR point clouds.
1 code implementation • 12 Apr 2024 • Nadezda Kirillova, M. Jehanzeb Mirza, Horst Possegger, Horst Bischof
To address these limitations, we propose a pipeline for physic-based volumetric fog simulation in arbitrary real-world MOT dataset utilizing frame-by-frame monocular depth estimation and a fog formation optical model.
1 code implementation • CVPR 2024 • Jakub Micorek, Horst Possegger, Dominik Narnhofer, Horst Bischof, Mateusz Kozinski
We propose a novel approach to video anomaly detection: we treat feature vectors extracted from videos as realizations of a random variable with a fixed distribution and model this distribution with a neural network.
Ranked #1 on Anomaly Detection on CUHK Avenue
1 code implementation • 18 Mar 2024 • M. Jehanzeb Mirza, Leonid Karlinsky, Wei Lin, Sivan Doveh, Jakub Micorek, Mateusz Kozinski, Hilde Kuehne, Horst Possegger
Prompt ensembling of Large Language Model (LLM) generated category-specific prompts has emerged as an effective method to enhance zero-shot recognition ability of Vision-Language Models (VLMs).
no code implementations • 16 Jan 2024 • Abhiram Kolli, Filippo Casamassima, Horst Possegger, Horst Bischof
Using neural networks for localization of key fob within and surrounding a car as a security feature for keyless entry is fast emerging.
1 code implementation • ICCV 2023 • David Schinagl, Georg Krispel, Christian Fruhwirth-Reisinger, Horst Possegger, Horst Bischof
Widely-used LiDAR-based 3D object detectors often neglect fundamental geometric information readily available from the object proposals in their confidence estimation.
1 code implementation • 13 Sep 2023 • M. Jehanzeb Mirza, Leonid Karlinsky, Wei Lin, Horst Possegger, Rogerio Feris, Horst Bischof
Vision and Language Models (VLMs), such as CLIP, have enabled visual recognition of a potentially unlimited set of categories described by text prompts.
1 code implementation • 30 May 2023 • Stefan Leitner, M. Jehanzeb Mirza, Wei Lin, Jakub Micorek, Marc Masana, Mateusz Kozinski, Horst Possegger, Horst Bischof
We propose to store these affine parameters as a memory bank for each weather condition and plug-in their weather-specific parameters during driving (i. e. test time) when the respective weather conditions are encountered.
1 code implementation • ICCV 2023 • Wei Lin, Leonid Karlinsky, Nina Shvetsova, Horst Possegger, Mateusz Kozinski, Rameswar Panda, Rogerio Feris, Hilde Kuehne, Horst Bischof
We adapt a VL model for zero-shot and few-shot action recognition using a collection of unlabeled videos and an unpaired action dictionary.
Ranked #3 on Zero-Shot Action Recognition on Charades
no code implementations • 9 Mar 2023 • Wei Lin, Anna Kukleva, Horst Possegger, Hilde Kuehne, Horst Bischof
Temporal action segmentation in untrimmed videos has gained increased attention recently.
no code implementations • 14 Dec 2022 • Georg Krispel, David Schinagl, Christian Fruhwirth-Reisinger, Horst Possegger, Horst Bischof
The sensing process of large-scale LiDAR point clouds inevitably causes large blind spots, i. e. regions not visible to the sensor.
no code implementations • 6 Dec 2022 • Bisheng Wang, Horst Possegger, Horst Bischof, Guo Cao
Existing Multiple Object Tracking (MOT) methods design complex architectures for better tracking performance.
1 code implementation • CVPR 2023 • Wei Lin, Muhammad Jehanzeb Mirza, Mateusz Kozinski, Horst Possegger, Hilde Kuehne, Horst Bischof
Our proposed method demonstrates a substantial performance gain over existing test-time adaptation approaches in both evaluations of a single distribution shift and the challenging case of random distribution shifts.
1 code implementation • CVPR 2023 • Muhammad Jehanzeb Mirza, Pol Jané Soneira, Wei Lin, Mateusz Kozinski, Horst Possegger, Horst Bischof
Test-Time-Training (TTT) is an approach to cope with out-of-distribution (OOD) data by adapting a trained model to distribution shifts occurring at test-time.
1 code implementation • ICCV 2023 • M. Jehanzeb Mirza, Inkyu Shin, Wei Lin, Andreas Schriebl, Kunyang Sun, Jaesung Choe, Horst Possegger, Mateusz Kozinski, In So Kweon, Kun-Jin Yoon, Horst Bischof
Our MATE is the first Test-Time-Training (TTT) method designed for 3D data, which makes deep networks trained for point cloud classification robust to distribution shifts occurring in test data.
no code implementations • 10 Nov 2022 • Abhiram Kolli, Muhammad Jehanzeb Mirza, Horst Possegger, Horst Bischof
Keyless entry systems in cars are adopting neural networks for localizing its operators.
1 code implementation • 14 Oct 2022 • Dušan Malić, Christian Fruhwirth-Reisinger, Horst Possegger, Horst Bischof
State-of-the-art unsupervised domain adaptation approaches outsource methods to overcome the object size bias.
1 code implementation • 19 Apr 2022 • M. Jehanzeb Mirza, Marc Masana, Horst Possegger, Horst Bischof
This catastrophic forgetting is typically addressed via incremental learning approaches which usually re-train the model by either keeping a memory bank of training samples or keeping a copy of the entire model or model parameters for each scenario.
1 code implementation • CVPR 2022 • David Schinagl, Georg Krispel, Horst Possegger, Peter M. Roth, Horst Bischof
These maps indicate the importance of each 3D point in predicting the specific objects.
1 code implementation • 30 Mar 2022 • Wei Lin, Anna Kukleva, Kunyang Sun, Horst Possegger, Hilde Kuehne, Horst Bischof
To address these challenges, we propose Cycle Domain Adaptation (CycDA), a cycle-based approach for unsupervised image-to-video domain adaptation by leveraging the joint spatial information in images and videos on the one hand and, on the other hand, training an independent spatio-temporal model to bridge the modality gap.
no code implementations • 20 Mar 2022 • Niloofar Azizi, Horst Possegger, Emanuele Rodolà, Horst Bischof
In particular, this allows us to directly and explicitly encode the transformation between joints, resulting in a significantly more compact representation.
Ranked #65 on 3D Human Pose Estimation on Human3.6M
1 code implementation • CVPR 2022 • M. Jehanzeb Mirza, Jakub Micorek, Horst Possegger, Horst Bischof
This can be a hurdle in fields which require continuous dynamic adaptation or suffer from scarcity of data, e. g. autonomous driving in challenging weather conditions.
no code implementations • 18 Oct 2021 • Christian Fruhwirth-Reisinger, Michael Opitz, Horst Possegger, Horst Bischof
In the field of autonomous driving, self-training is widely applied to mitigate distribution shifts in LiDAR-based 3D object detectors.
no code implementations • 18 Dec 2019 • Georg Krispel, Michael Opitz, Georg Waltner, Horst Possegger, Horst Bischof
We introduce a simple yet effective fusion method of LiDAR and RGB data to segment LiDAR point clouds.
no code implementations • 9 May 2018 • Georg Waltner, Michael Maurer, Thomas Holzmann, Patrick Ruprecht, Michael Opitz, Horst Possegger, Friedrich Fraundorfer, Horst Bischof
Furthermore due to the design of the network, at test time only the 2D camera images are required for classification which enables the usage in portable computer vision systems.
1 code implementation • 15 Jan 2018 • Michael Opitz, Georg Waltner, Horst Possegger, Horst Bischof
To this end, we divide the last embedding layer of a deep network into an embedding ensemble and formulate training this ensemble as an online gradient boosting problem.
Ranked #13 on Image Retrieval on SOP
no code implementations • ICCV 2017 • Michael Opitz, Georg Waltner, Horst Possegger, Horst Bischof
Learning similarity functions between image pairs with deep neural networks yields highly correlated activations of large embeddings.
no code implementations • 1 Sep 2016 • Michael Opitz, Georg Waltner, Georg Poier, Horst Possegger, Horst Bischof
Detection of partially occluded objects is a challenging computer vision problem.
no code implementations • CVPR 2015 • Thomas Mauthner, Horst Possegger, Georg Waltner, Horst Bischof
We present a novel video saliency detection method to support human activity recognition and weakly supervised training of activity detection algorithms.
no code implementations • CVPR 2015 • Horst Possegger, Thomas Mauthner, Horst Bischof
We evaluate our approach on recent online tracking benchmark datasets demonstrating state-of-the-art results.
no code implementations • CVPR 2014 • Horst Possegger, Thomas Mauthner, Peter M. Roth, Horst Bischof
Robust multi-object tracking-by-detection requires the correct assignment of noisy detection results to object trajectories.
no code implementations • CVPR 2013 • Horst Possegger, Sabine Sternig, Thomas Mauthner, Peter M. Roth, Horst Bischof
Combining foreground images from multiple views by projecting them onto a common ground-plane has been recently applied within many multi-object tracking approaches.