Search Results for author: Horst Possegger

Found 34 papers, 18 papers with code

GLOV: Guided Large Language Models as Implicit Optimizers for Vision Language Models

1 code implementation8 Oct 2024 M. Jehanzeb Mirza, Mengjie Zhao, Zhuoyuan Mao, Sivan Doveh, Wei Lin, Paul Gavrikov, Michael Dorkenwald, Shiqi Yang, Saurav Jha, Hiromi Wakaki, Yuki Mitsufuji, Horst Possegger, Rogerio Feris, Leonid Karlinsky, James Glass

In each respective optimization step, the ranked prompts are fed as in-context examples (with their accuracies) to equip the LLM with the knowledge of the type of text prompts preferred by the downstream VLM.

Zero-Shot Learning

Vision-Language Guidance for LiDAR-based Unsupervised 3D Object Detection

1 code implementation7 Aug 2024 Christian Fruhwirth-Reisinger, Wei Lin, Dušan Malić, Horst Bischof, Horst Possegger

To overcome these limitations, we propose a vision-language-guided unsupervised 3D detection approach that operates exclusively on LiDAR point clouds.

3D Object Detection Autonomous Driving +3

Into the Fog: Evaluating Multiple Object Tracking Robustness

1 code implementation12 Apr 2024 Nadezda Kirillova, M. Jehanzeb Mirza, Horst Possegger, Horst Bischof

To address these limitations, we propose a pipeline for physic-based volumetric fog simulation in arbitrary real-world MOT dataset utilizing frame-by-frame monocular depth estimation and a fog formation optical model.

Monocular Depth Estimation Multiple Object Tracking +1

MULDE: Multiscale Log-Density Estimation via Denoising Score Matching for Video Anomaly Detection

1 code implementation CVPR 2024 Jakub Micorek, Horst Possegger, Dominik Narnhofer, Horst Bischof, Mateusz Kozinski

We propose a novel approach to video anomaly detection: we treat feature vectors extracted from videos as realizations of a random variable with a fixed distribution and model this distribution with a neural network.

Anomaly Detection In Surveillance Videos Denoising +2

Meta-Prompting for Automating Zero-shot Visual Recognition with LLMs

1 code implementation18 Mar 2024 M. Jehanzeb Mirza, Leonid Karlinsky, Wei Lin, Sivan Doveh, Jakub Micorek, Mateusz Kozinski, Hilde Kuehne, Horst Possegger

Prompt ensembling of Large Language Model (LLM) generated category-specific prompts has emerged as an effective method to enhance zero-shot recognition ability of Vision-Language Models (VLMs).

Language Modelling Large Language Model +1

Robust Localization of Key Fob Using Channel Impulse Response of Ultra Wide Band Sensors for Keyless Entry Systems

no code implementations16 Jan 2024 Abhiram Kolli, Filippo Casamassima, Horst Possegger, Horst Bischof

Using neural networks for localization of key fob within and surrounding a car as a security feature for keyless entry is fast emerging.

GACE: Geometry Aware Confidence Enhancement for Black-Box 3D Object Detectors on LiDAR-Data

1 code implementation ICCV 2023 David Schinagl, Georg Krispel, Christian Fruhwirth-Reisinger, Horst Possegger, Horst Bischof

Widely-used LiDAR-based 3D object detectors often neglect fundamental geometric information readily available from the object proposals in their confidence estimation.

Object

TAP: Targeted Prompting for Task Adaptive Generation of Textual Training Instances for Visual Classification

1 code implementation13 Sep 2023 M. Jehanzeb Mirza, Leonid Karlinsky, Wei Lin, Horst Possegger, Rogerio Feris, Horst Bischof

Vision and Language Models (VLMs), such as CLIP, have enabled visual recognition of a potentially unlimited set of categories described by text prompts.

Zero-Shot Learning

Sit Back and Relax: Learning to Drive Incrementally in All Weather Conditions

1 code implementation30 May 2023 Stefan Leitner, M. Jehanzeb Mirza, Wei Lin, Jakub Micorek, Marc Masana, Mateusz Kozinski, Horst Possegger, Horst Bischof

We propose to store these affine parameters as a memory bank for each weather condition and plug-in their weather-specific parameters during driving (i. e. test time) when the respective weather conditions are encountered.

Autonomous Driving Incremental Learning +2

MAELi: Masked Autoencoder for Large-Scale LiDAR Point Clouds

no code implementations14 Dec 2022 Georg Krispel, David Schinagl, Christian Fruhwirth-Reisinger, Horst Possegger, Horst Bischof

The sensing process of large-scale LiDAR point clouds inevitably causes large blind spots, i. e. regions not visible to the sensor.

3D Object Detection Autonomous Driving +5

Sparse Message Passing Network with Feature Integration for Online Multiple Object Tracking

no code implementations6 Dec 2022 Bisheng Wang, Horst Possegger, Horst Bischof, Guo Cao

Existing Multiple Object Tracking (MOT) methods design complex architectures for better tracking performance.

Multiple Object Tracking

Video Test-Time Adaptation for Action Recognition

1 code implementation CVPR 2023 Wei Lin, Muhammad Jehanzeb Mirza, Mateusz Kozinski, Horst Possegger, Hilde Kuehne, Horst Bischof

Our proposed method demonstrates a substantial performance gain over existing test-time adaptation approaches in both evaluations of a single distribution shift and the challenging case of random distribution shifts.

Action Recognition Temporal Action Localization +1

ActMAD: Activation Matching to Align Distributions for Test-Time-Training

1 code implementation CVPR 2023 Muhammad Jehanzeb Mirza, Pol Jané Soneira, Wei Lin, Mateusz Kozinski, Horst Possegger, Horst Bischof

Test-Time-Training (TTT) is an approach to cope with out-of-distribution (OOD) data by adapting a trained model to distribution shifts occurring at test-time.

Image Classification

MATE: Masked Autoencoders are Online 3D Test-Time Learners

1 code implementation ICCV 2023 M. Jehanzeb Mirza, Inkyu Shin, Wei Lin, Andreas Schriebl, Kunyang Sun, Jaesung Choe, Horst Possegger, Mateusz Kozinski, In So Kweon, Kun-Jin Yoon, Horst Bischof

Our MATE is the first Test-Time-Training (TTT) method designed for 3D data, which makes deep networks trained for point cloud classification robust to distribution shifts occurring in test data.

3D Object Classification Point Cloud Classification

An Efficient Domain-Incremental Learning Approach to Drive in All Weather Conditions

1 code implementation19 Apr 2022 M. Jehanzeb Mirza, Marc Masana, Horst Possegger, Horst Bischof

This catastrophic forgetting is typically addressed via incremental learning approaches which usually re-train the model by either keeping a memory bank of training samples or keeping a copy of the entire model or model parameters for each scenario.

Autonomous Driving Incremental Learning +2

CycDA: Unsupervised Cycle Domain Adaptation from Image to Video

1 code implementation30 Mar 2022 Wei Lin, Anna Kukleva, Kunyang Sun, Horst Possegger, Hilde Kuehne, Horst Bischof

To address these challenges, we propose Cycle Domain Adaptation (CycDA), a cycle-based approach for unsupervised image-to-video domain adaptation by leveraging the joint spatial information in images and videos on the one hand and, on the other hand, training an independent spatio-temporal model to bridge the modality gap.

Action Recognition Domain Adaptation +1

3D Human Pose Estimation Using Möbius Graph Convolutional Networks

no code implementations20 Mar 2022 Niloofar Azizi, Horst Possegger, Emanuele Rodolà, Horst Bischof

In particular, this allows us to directly and explicitly encode the transformation between joints, resulting in a significantly more compact representation.

3D Human Pose Estimation

The Norm Must Go On: Dynamic Unsupervised Domain Adaptation by Normalization

1 code implementation CVPR 2022 M. Jehanzeb Mirza, Jakub Micorek, Horst Possegger, Horst Bischof

This can be a hurdle in fields which require continuous dynamic adaptation or suffer from scarcity of data, e. g. autonomous driving in challenging weather conditions.

Autonomous Driving object-detection +3

FAST3D: Flow-Aware Self-Training for 3D Object Detectors

no code implementations18 Oct 2021 Christian Fruhwirth-Reisinger, Michael Opitz, Horst Possegger, Horst Bischof

In the field of autonomous driving, self-training is widely applied to mitigate distribution shifts in LiDAR-based 3D object detectors.

Autonomous Driving Object +1

FuseSeg: LiDAR Point Cloud Segmentation Fusing Multi-Modal Data

no code implementations18 Dec 2019 Georg Krispel, Michael Opitz, Georg Waltner, Horst Possegger, Horst Bischof

We introduce a simple yet effective fusion method of LiDAR and RGB data to segment LiDAR point clouds.

Point Cloud Segmentation

Deep 2.5D Vehicle Classification with Sparse SfM Depth Prior for Automated Toll Systems

no code implementations9 May 2018 Georg Waltner, Michael Maurer, Thomas Holzmann, Patrick Ruprecht, Michael Opitz, Horst Possegger, Friedrich Fraundorfer, Horst Bischof

Furthermore due to the design of the network, at test time only the 2D camera images are required for classification which enables the usage in portable computer vision systems.

3D Object Classification Classification +2

Deep Metric Learning with BIER: Boosting Independent Embeddings Robustly

1 code implementation15 Jan 2018 Michael Opitz, Georg Waltner, Horst Possegger, Horst Bischof

To this end, we divide the last embedding layer of a deep network into an embedding ensemble and formulate training this ensemble as an online gradient boosting problem.

Diversity Image Retrieval +2

BIER - Boosting Independent Embeddings Robustly

no code implementations ICCV 2017 Michael Opitz, Georg Waltner, Horst Possegger, Horst Bischof

Learning similarity functions between image pairs with deep neural networks yields highly correlated activations of large embeddings.

Image Retrieval Metric Learning +1

Grid Loss: Detecting Occluded Faces

no code implementations1 Sep 2016 Michael Opitz, Georg Waltner, Georg Poier, Horst Possegger, Horst Bischof

Detection of partially occluded objects is a challenging computer vision problem.

Face Detection

Encoding Based Saliency Detection for Videos and Images

no code implementations CVPR 2015 Thomas Mauthner, Horst Possegger, Georg Waltner, Horst Bischof

We present a novel video saliency detection method to support human activity recognition and weakly supervised training of activity detection algorithms.

Activity Detection Human Activity Recognition +4

In Defense of Color-Based Model-Free Tracking

no code implementations CVPR 2015 Horst Possegger, Thomas Mauthner, Horst Bischof

We evaluate our approach on recent online tracking benchmark datasets demonstrating state-of-the-art results.

Object Object Tracking

Occlusion Geodesics for Online Multi-Object Tracking

no code implementations CVPR 2014 Horst Possegger, Thomas Mauthner, Peter M. Roth, Horst Bischof

Robust multi-object tracking-by-detection requires the correct assignment of noisy detection results to object trajectories.

motion prediction Multi-Object Tracking +2

Robust Real-Time Tracking of Multiple Objects by Volumetric Mass Densities

no code implementations CVPR 2013 Horst Possegger, Sabine Sternig, Thomas Mauthner, Peter M. Roth, Horst Bischof

Combining foreground images from multiple views by projecting them onto a common ground-plane has been recently applied within many multi-object tracking approaches.

3D Object Tracking Multi-Object Tracking

Cannot find the paper you are looking for? You can Submit a new open access paper.