Search Results for author: M. Saquib Sarfraz

Found 27 papers, 17 papers with code

Optimizing Small Language Models for In-Vehicle Function-Calling

no code implementations4 Jan 2025 Yahya Sowti Khiabani, Farris Atif, Chieh Hsu, Sven Stahlmann, Tobias Michels, Sebastian Kramer, Benedikt Heidrich, M. Saquib Sarfraz, Julian Merten, Faezeh Tafazzoli

Our results demonstrate the potential of SLMs to transform vehicle control systems, enabling more intuitive interactions between users and their vehicles for an enhanced driving experience.

Model Compression Quantization

Muscles in Time: Learning to Understand Human Motion by Simulating Muscle Activations

1 code implementation31 Oct 2024 David Schneider, Simon Reiß, Marco Kugler, Alexander Jaus, Kunyu Peng, Susanne Sutschet, M. Saquib Sarfraz, Sven Matthiesen, Rainer Stiefelhagen

In this work, we address this issue by establishing Muscles in Time (MinT), a large-scale synthetic muscle activation dataset.

Advancing Open-Set Domain Generalization Using Evidential Bi-Level Hardest Domain Scheduler

1 code implementation26 Sep 2024 Kunyu Peng, Di Wen, Kailun Yang, Ao Luo, Yufan Chen, Jia Fu, M. Saquib Sarfraz, Alina Roitberg, Rainer Stiefelhagen

In this paper, we observe that an adaptive domain scheduler benefits more in OSDG compared with prefixed sequential and random domain schedulers.

Data Augmentation Domain Generalization +1

Spacewalker: Traversing Representation Spaces for Fast Interactive Exploration and Annotation of Unstructured Data

1 code implementation25 Sep 2024 Lukas Heine, Fabian Hörst, Jana Fragemann, Gijs Luijten, Jan Egger, Fin Bahnsen, M. Saquib Sarfraz, Jens Kleesiek, Constantin Seibold

In industries such as healthcare, finance, and manufacturing, analysis of unstructured textual data presents significant challenges for analysis and decision making.

Decision Making

Referring Atomic Video Action Recognition

1 code implementation2 Jul 2024 Kunyu Peng, Jia Fu, Kailun Yang, Di Wen, Yufan Chen, Ruiping Liu, Junwei Zheng, Jiaming Zhang, M. Saquib Sarfraz, Rainer Stiefelhagen, Alina Roitberg

Since these existing methods underperform on RAVAR, we introduce RefAtomNet -- a novel cross-stream attention-driven method specialized for the unique challenges of RAVAR: the need to interpret a textual referring expression for the targeted individual, utilize this reference to guide the spatial localization and harvest the prediction of the atomic actions for the referring person.

Action Recognition Question Answering +4

AltChart: Enhancing VLM-based Chart Summarization Through Multi-Pretext Tasks

no code implementations22 May 2024 Omar Moured, Jiaming Zhang, M. Saquib Sarfraz, Rainer Stiefelhagen

Chart summarization is a crucial task for blind and visually impaired individuals as it is their primary means of accessing and interpreting graphical data.

Position: Quo Vadis, Unsupervised Time Series Anomaly Detection?

1 code implementation4 May 2024 M. Saquib Sarfraz, Mei-Yen Chen, Lukas Layer, Kunyu Peng, Marios Koulakis

The current state of machine learning scholarship in Timeseries Anomaly Detection (TAD) is plagued by the persistent use of flawed evaluation metrics, inconsistent benchmarking practices, and a lack of proper justification for the choices made in novel deep learning-based model designs.

Anomaly Detection Benchmarking +3

Fourier Prompt Tuning for Modality-Incomplete Scene Segmentation

1 code implementation30 Jan 2024 Ruiping Liu, Jiaming Zhang, Kunyu Peng, Yufan Chen, Ke Cao, Junwei Zheng, M. Saquib Sarfraz, Kailun Yang, Rainer Stiefelhagen

Integrating information from multiple modalities enhances the robustness of scene perception systems in autonomous vehicles, providing a more comprehensive and reliable sensory framework.

Autonomous Vehicles Scene Segmentation

Navigating Open Set Scenarios for Skeleton-based Action Recognition

1 code implementation11 Dec 2023 Kunyu Peng, Cheng Yin, Junwei Zheng, Ruiping Liu, David Schneider, Jiaming Zhang, Kailun Yang, M. Saquib Sarfraz, Rainer Stiefelhagen, Alina Roitberg

In real-world scenarios, human actions often fall outside the distribution of training data, making it crucial for models to recognize known actions and reject unknown ones.

cross-modal alignment Novelty Detection +4

Domain Adaptive Object Detection via Balancing Between Self-Training and Adversarial Learning

no code implementations8 Nov 2023 Muhammad Akhtar Munir, Muhammad Haris Khan, M. Saquib Sarfraz, Mohsen Ali

Deep learning based object detectors struggle generalizing to a new target domain bearing significant variations in object and background.

Object object-detection +1

Exploring Few-Shot Adaptation for Activity Recognition on Diverse Domains

2 code implementations15 May 2023 Kunyu Peng, Di Wen, David Schneider, Jiaming Zhang, Kailun Yang, M. Saquib Sarfraz, Rainer Stiefelhagen, Alina Roitberg

In this work, we focus on Few-Shot Domain Adaptation for Activity Recognition (FSDA-AR), which leverages a very small amount of labeled target videos to achieve effective adaptation.

Action Recognition Unsupervised Domain Adaptation

Towards Improving Calibration in Object Detection Under Domain Shift

no code implementations15 Sep 2022 Muhammad Akhtar Munir, Muhammad Haris Khan, M. Saquib Sarfraz, Mohsen Ali

To this end, we first propose a new, plug-and-play, train-time calibration loss for object detection (coined as TCD).

Decision Making Object +3

Synergizing between Self-Training and Adversarial Learning for Domain Adaptive Object Detection

no code implementations1 Oct 2021 Muhammad Akhtar Munir, Muhammad Haris Khan, M. Saquib Sarfraz, Mohsen Ali

In this paper, we propose to leverage model predictive uncertainty to strike the right balance between adversarial feature alignment and class-level alignment.

Object object-detection +1

Temporally-Weighted Hierarchical Clustering for Unsupervised Action Segmentation

1 code implementation CVPR 2021 M. Saquib Sarfraz, Naila Murray, Vivek Sharma, Ali Diba, Luc van Gool, Rainer Stiefelhagen

Action segmentation refers to inferring boundaries of semantically consistent visual concepts in videos and is an important requirement for many video understanding tasks.

Clustering Segmentation +2

Anchor-free Small-scale Multispectral Pedestrian Detection

1 code implementation19 Aug 2020 Alexander Wolpert, Michael Teutsch, M. Saquib Sarfraz, Rainer Stiefelhagen

In this way, we can both simplify the network architecture and achieve higher detection performance, especially for pedestrians under occlusion or at low object resolution.

Autonomous Driving Data Augmentation +3

Content and Colour Distillation for Learning Image Translations with the Spatial Profile Loss

1 code implementation1 Aug 2019 M. Saquib Sarfraz, Constantin Seibold, Haroon Khalid, Rainer Stiefelhagen

In this paper, we propose a novel method of computing the loss directly between the source and target images that enable proper distillation of shape/content and colour/style.

Image Super-Resolution Translation

Efficient Parameter-free Clustering Using First Neighbor Relations

1 code implementation28 Feb 2019 M. Saquib Sarfraz, Vivek Sharma, Rainer Stiefelhagen

We present a new clustering method in the form of a single clustering equation that is able to directly discover groupings in the data.

Clustering

A Pose-Sensitive Embedding for Person Re-Identification with Expanded Cross Neighborhood Re-Ranking

2 code implementations CVPR 2018 M. Saquib Sarfraz, Arne Schumann, Andreas Eberle, Rainer Stiefelhagen

In contrast to the recent direction of explicitly modeling body parts or correcting for misalignment based on these, we show that a rather straightforward inclusion of acquired camera view and/or the detected joint locations into a convolutional neural network helps to learn a very effective representation.

Person Re-Identification Re-Ranking +1

Deep View-Sensitive Pedestrian Attribute Inference in an end-to-end Model

no code implementations19 Jul 2017 M. Saquib Sarfraz, Arne Schumann, Yan Wang, Rainer Stiefelhagen

The visual cues hinting at attributes can be strongly localized and inference of person attributes such as hair, backpack, shorts, etc., are highly dependent on the acquired view of the pedestrian.

Attribute Multi-Label Image Classification +2

Deep Perceptual Mapping for Cross-Modal Face Recognition

no code implementations20 Jan 2016 M. Saquib Sarfraz, Rainer Stiefelhagen

Our method bridges the drop in performance due to the modality gap by more than 40\%.

Face Recognition

Deep Perceptual Mapping for Thermal to Visible Face Recognition

no code implementations10 Jul 2015 M. Saquib Sarfraz, Rainer Stiefelhagen

Cross modal face matching between the thermal and visible spectrum is a much de- sired capability for night-time surveillance and security applications.

Face Recognition

Cannot find the paper you are looking for? You can Submit a new open access paper.