Search Results for author: Gerhard Rigoll

Found 38 papers, 21 papers with code

Octuplet Loss: Make Face Recognition Robust to Image Resolution

1 code implementation14 Jul 2022 Martin Knoche, Mohamed Elkadeem, Stefan Hörmann, Gerhard Rigoll

To address this problem, we propose a novel combination of the popular triplet loss to improve robustness against image resolution via fine-tuning of existing face recognition models.

Face Recognition Face Verification

Wavelet Regularization Benefits Adversarial Training

1 code implementation8 Jun 2022 Jun Yan, Huilin Yin, Xiaoyang Deng, Ziming Zhao, Wancheng Ge, Hao Zhang, Gerhard Rigoll

Since adversarial vulnerability can be regarded as a high-frequency phenomenon, it is essential to regulate the adversarially-trained neural network models in the frequency domain.

Adversarial Robustness

Face Morphing: Fooling a Face Recognition System Is Simple!

no code implementations27 May 2022 Stefan Hörmann, Tianlin Kong, Torben Teepe, Fabian Herzog, Martin Knoche, Gerhard Rigoll

State-of-the-art face recognition (FR) approaches have shown remarkable results in predicting whether two faces belong to the same identity, yielding accuracies between 92% and 100% depending on the difficulty of the protocol.

Face Recognition

Towards a Deeper Understanding of Skeleton-based Gait Recognition

2 code implementations16 Apr 2022 Torben Teepe, Johannes Gilg, Fabian Herzog, Stefan Hörmann, Gerhard Rigoll

Gait recognition is a promising biometric with unique properties for identifying individuals from a long distance by their walking patterns.

Gait Recognition Pose Estimation

The Box Size Confidence Bias Harms Your Object Detector

1 code implementation3 Dec 2021 Johannes Gilg, Torben Teepe, Fabian Herzog, Gerhard Rigoll

Recent work even suggests that detectors' confidence predictions are biased with respect to object size and position, but it is still unclear how this bias relates to the performance of the affected object detectors.

object-detection Object Detection

Cross-Quality LFW: A Database for Analyzing Cross-Resolution Image Face Recognition in Unconstrained Environments

1 code implementation23 Aug 2021 Martin Knoche, Stefan Hörmann, Gerhard Rigoll

Real-world face recognition applications often deal with suboptimal image quality or resolution due to different capturing conditions such as various subject-to-camera distances, poor camera settings, or motion blur.

Face Recognition

Image Resolution Susceptibility of Face Recognition Models

no code implementations8 Jul 2021 Martin Knoche, Stefan Hörmann, Gerhard Rigoll

In this work, we first analyze the impact of image resolutions on the face verification performance with a state-of-the-art face recognition model.

Face Recognition Face Verification

Attention-based Partial Face Recognition

1 code implementation11 Jun 2021 Stefan Hörmann, Zeyuan Zhang, Martin Knoche, Torben Teepe, Gerhard Rigoll

In this paper, we propose a novel approach to partial face recognition capable of recognizing faces with different occluded areas.

Face Recognition

How to Design a Three-Stage Architecture for Audio-Visual Active Speaker Detection in the Wild

1 code implementation ICCV 2021 Okan Köpüklü, Maja Taseska, Gerhard Rigoll

Successful active speaker detection requires a three-stage pipeline: (i) audio-visual encoding for all speakers in the clip, (ii) inter-speaker relation modeling between a reference speaker and the background speakers within each frame, and (iii) temporal modeling for the reference speaker.

Audio-Visual Active Speaker Detection

Adversarial Joint Training with Self-Attention Mechanism for Robust End-to-End Speech Recognition

no code implementations3 Apr 2021 Lujun Li, Yikai Kang, Yuchen Shi, Ludwig Kürzinger, Tobias Watzel, Gerhard Rigoll

Inspired by the extensive applications of the generative adversarial networks (GANs) in speech enhancement and ASR tasks, we propose an adversarial joint training framework with the self-attention mechanism to boost the noise robustness of the ASR system.

Automatic Speech Recognition Speech Enhancement +1

GaitGraph: Graph Convolutional Network for Skeleton-Based Gait Recognition

2 code implementations27 Jan 2021 Torben Teepe, Ali Khan, Johannes Gilg, Fabian Herzog, Stefan Hörmann, Gerhard Rigoll

However, silhouette images can lose fine-grained spatial information, and most papers do not regard how to obtain these silhouettes in complex scenes.

Multiview Gait Recognition Pose Estimation

Lightweight Multi-Branch Network for Person Re-Identification

1 code implementation26 Jan 2021 Fabian Herzog, Xunbo Ji, Torben Teepe, Stefan Hörmann, Johannes Gilg, Gerhard Rigoll

Person Re-Identification aims to retrieve person identities from images captured by multiple cameras or the same cameras in different time instances and locations.

Person Re-Identification

Lightweight End-to-End Speech Recognition from Raw Audio Data Using Sinc-Convolutions

no code implementations15 Oct 2020 Ludwig Kürzinger, Nicolas Lindae, Palle Klewitz, Gerhard Rigoll

For this, we propose Lightweight Sinc-Convolutions (LSC) that integrate Sinc-convolutions with depthwise convolutions as a low-parameter machine-learnable feature extraction for end-to-end ASR systems.

Automatic Speech Recognition speech-recognition

Dissected 3D CNNs: Temporal Skip Connections for Efficient Online Video Processing

1 code implementation30 Sep 2020 Okan Köpüklü, Stefan Hörmann, Fabian Herzog, Hakan Cevikalp, Gerhard Rigoll

Convolutional Neural Networks with 3D kernels (3D-CNNs) currently achieve state-of-the-art results in video recognition tasks due to their supremacy in extracting spatiotemporal features within video frames.

Action Classification Video Recognition

Driver Anomaly Detection: A Dataset and Contrastive Learning Approach

1 code implementation30 Sep 2020 Okan Köpüklü, Jiapeng Zheng, Hang Xu, Gerhard Rigoll

For this task, we introduce a new video-based benchmark, the Driver Anomaly Detection (DAD) dataset, which contains normal driving videos together with a set of anomalous actions in its training set.

Anomaly Detection Contrastive Learning +1

MP3 Compression To Diminish Adversarial Noise in End-to-End Speech Recognition

1 code implementation25 Jul 2020 Iustina Andronic, Ludwig Kürzinger, Edgar Ricardo Chavez Rosas, Gerhard Rigoll, Bernhard U. Seeber

The present work proposes MP3 compression as a means to decrease the impact of Adversarial Noise (AN) in audio samples transcribed by ASR systems.

Audio and Speech Processing Cryptography and Security Sound

CTC-Segmentation of Large Corpora for German End-to-end Speech Recognition

8 code implementations17 Jul 2020 Ludwig Kürzinger, Dominik Winkelbauer, Lujun Li, Tobias Watzel, Gerhard Rigoll

In this work, we combine freely available corpora for German speech recognition, including yet unlabeled speech data, to a big dataset of over $1700$h of speech data.

Ranked #3 on Speech Recognition on TUDA (using extra training data)

Speech Recognition Audio and Speech Processing

A Multi-Task Comparator Framework for Kinship Verification

no code implementations2 Jun 2020 Stefan Hörmann, Martin Knoche, Gerhard Rigoll

Approaches for kinship verification often rely on cosine distances between face identification features.

Face Identification

DriverMHG: A Multi-Modal Dataset for Dynamic Recognition of Driver Micro Hand Gestures and a Real-Time Recognition Framework

no code implementations2 Mar 2020 Okan Köpüklü, Thomas Ledwon, Yao Rong, Neslihan Kose, Gerhard Rigoll

In this work, we propose an HCI system for dynamic recognition of driver micro hand gestures, which can have a crucial impact in automotive sector especially for safety related issues.

Deep Attention Based Semi-Supervised 2D-Pose Estimation for Surgical Instruments

1 code implementation10 Dec 2019 Mert Kayhan, Okan Köpüklü, Mhd Hasan Sarhan, Mehmet Yigitsoy, Abouzar Eslami, Gerhard Rigoll

To this end, a lightweight network architecture is introduced and mean teacher, virtual adversarial training and pseudo-labeling algorithms are evaluated on 2D-pose estimation for surgical instruments.

Deep Attention Pose Estimation

You Only Watch Once: A Unified CNN Architecture for Real-Time Spatiotemporal Action Localization

4 code implementations15 Nov 2019 Okan Köpüklü, Xiangyu Wei, Gerhard Rigoll

YOWO is a single-stage architecture with two branches to extract temporal and spatial information concurrently and predict bounding boxes and action probabilities directly from video clips in one evaluation.

Action Recognition In Videos

Comparative Analysis of CNN-based Spatiotemporal Reasoning in Videos

1 code implementation arXiv preprint 2019 Okan Köpüklü, Fabian Herzog, Gerhard Rigoll

Understanding actions and gestures in video streams requires temporal reasoning of the spatial content from different time instants, i. e., spatiotemporal (ST) modeling.

Action Recognition Human-Object Interaction Detection

Real-Time Driver State Monitoring Using a CNN Based Spatio-Temporal Approach

no code implementations18 Jul 2019 Neslihan Kose, Okan Kopuklu, Alexander Unnervik, Gerhard Rigoll

Experiments show that our approach outperforms the state-of-the art results on the Distracted Driver Dataset (96. 31%), with an accuracy of 99. 10% for 10-class classification while providing real-time performance.

Action Recognition Autonomous Vehicles +1

On Flow Profile Image for Video Representation

no code implementations12 May 2019 Mohammadreza Babaee, David Full, Gerhard Rigoll

Video representation is a key challenge in many computer vision applications such as video classification, video captioning, and video surveillance.

Activity Recognition Optical Flow Estimation +2

Talking With Your Hands: Scaling Hand Gestures and Recognition With CNNs

no code implementations10 May 2019 Okan Köpüklü, Yao Rong, Gerhard Rigoll

The use of hand gestures provides a natural alternative to cumbersome interface devices for Human-Computer Interaction (HCI) systems.

Resource Efficient 3D Convolutional Neural Networks

2 code implementations4 Apr 2019 Okan Köpüklü, Neslihan Kose, Ahmet Gunduz, Gerhard Rigoll

Recently, convolutional neural networks with 3D kernels (3D CNNs) have been very popular in computer vision community as a result of their superior ability of extracting spatio-temporal features within video frames compared to 2D CNNs.

Action Recognition Transfer Learning

Real-time Hand Gesture Detection and Classification Using Convolutional Neural Networks

5 code implementations29 Jan 2019 Okan Köpüklü, Ahmet Gunduz, Neslihan Kose, Gerhard Rigoll

We evaluate our architecture on two publicly available datasets - EgoGesture and NVIDIA Dynamic Hand Gesture Datasets - which require temporal detection and classification of the performed hand gestures.

Action Recognition General Classification +2

Convolutional Neural Networks with Layer Reuse

1 code implementation28 Jan 2019 Okan Köpüklü, Maryam Babaee, Stefan Hörmann, Gerhard Rigoll

In this paper, we propose a CNN architecture, Layer Reuse Network (LruNet), where the convolutional layers are used repeatedly without the need of introducing new layers to get a better performance.

Image Classification

Multiple People Tracking Using Hierarchical Deep Tracklet Re-identification

no code implementations9 Nov 2018 Maryam Babaee, Ali Athar, Gerhard Rigoll

To this end, tracklet re-identification is performed by utilizing a novel multi-stage deep network that can jointly reason about the visual appearance and spatio-temporal properties of a pair of tracklets, thereby providing a robust measure of affinity.

Multiple People Tracking Person Re-Identification

Robust Facial Landmark Detection via a Fully-Convolutional Local-Global Context Network

no code implementations CVPR 2018 Daniel Merget, Matthias Rock, Gerhard Rigoll

While fully-convolutional neural networks are very strong at modeling local features, they fail to aggregate global context due to their constrained receptive field.

Facial Landmark Detection

Person Identification from Partial Gait Cycle Using Fully Convolutional Neural Network

no code implementations23 Apr 2018 Maryam Babaee, Linwei Li, Gerhard Rigoll

In gait recognition, normally, gait feature such as Gait Energy Image (GEI) is extracted from one full gait cycle.

Gait Recognition Person Identification

A Deep Convolutional Neural Network for Background Subtraction

no code implementations6 Feb 2017 Mohammadreza Babaee, Duc Tung Dinh, Gerhard Rigoll

In this work, we present a novel background subtraction system that uses a deep Convolutional Neural Network (CNN) to perform the segmentation.

Change Detection Feature Engineering

A Broadcast News Corpus for Evaluation and Tuning of German LVCSR Systems

no code implementations15 Dec 2014 Felix Weninger, Björn Schuller, Florian Eyben, Martin Wöllmer, Gerhard Rigoll

Transcription of broadcast news is an interesting and challenging application for large-vocabulary continuous speech recognition (LVCSR).

speech-recognition Speech Recognition

Acoustic Gait-based Person Identification using Hidden Markov Models

no code implementations11 Jun 2014 Jürgen T. Geiger, Maximilian Kneißl, Björn Schuller, Gerhard Rigoll

The goal of the system is to analyse sounds emitted by walking persons (mostly the step sounds) and identify those persons.

Gait Recognition Person Identification

Cannot find the paper you are looking for? You can Submit a new open access paper.