1 code implementation • 18 Mar 2025 • Bastian Pätzold, Jan Nogga, Sven Behnke
This paper introduces a novel approach that leverages the capabilities of vision-language models (VLMs) by integrating them with established approaches for open-vocabulary detection (OVD), instance segmentation, and tracking.
2 code implementations • 30 Oct 2024 • Jonas Bode, Bastian Pätzold, Raphael Memmesheimer, Sven Behnke
Recent advances in LLM have been instrumental in autonomous robot control and human-robot interaction by leveraging their vast general knowledge and capabilities to understand and reason across a wide range of tasks and scenarios.
1 code implementation • 15 Sep 2022 • Bastian Pätzold, Simon Bultmann, Sven Behnke
The person keypoint detections from multiple views are received at a central backend where they are synchronized, filtered, and assigned to person hypotheses.