Search Results for author: Okan Köpüklü

Found 15 papers, 10 papers with code

Motion Fused Frames: Data Level Fusion Strategy for Hand Gesture Recognition

1 code implementation • 19 Apr 2018 • Okan Köpüklü, Neslihan Köse, Gerhard Rigoll

Acquiring spatio-temporal states of an action is the most crucial step for action classification.

Ranked #1 on Hand Gesture Recognition on ChaLean test

Action Classification General Classification +2

131

Paper
Code

Convolutional Neural Networks with Layer Reuse

1 code implementation • 28 Jan 2019 • Okan Köpüklü, Maryam Babaee, Stefan Hörmann, Gerhard Rigoll

In this paper, we propose a CNN architecture, Layer Reuse Network (LruNet), where the convolutional layers are used repeatedly without the need of introducing new layers to get a better performance.

Image Classification

Paper
Code

Real-time Hand Gesture Detection and Classification Using Convolutional Neural Networks

5 code implementations • 29 Jan 2019 • Okan Köpüklü, Ahmet Gunduz, Neslihan Kose, Gerhard Rigoll

We evaluate our architecture on two publicly available datasets - EgoGesture and NVIDIA Dynamic Hand Gesture Datasets - which require temporal detection and classification of the performed hand gestures.

Ranked #1 on Hand Gesture Recognition on EgoGesture

Action Recognition General Classification +2

599

Paper
Code

Resource Efficient 3D Convolutional Neural Networks

2 code implementations • 4 Apr 2019 • Okan Köpüklü, Neslihan Kose, Ahmet Gunduz, Gerhard Rigoll

Recently, convolutional neural networks with 3D kernels (3D CNNs) have been very popular in computer vision community as a result of their superior ability of extracting spatio-temporal features within video frames compared to 2D CNNs.

Ranked #2 on Action Recognition In Videos on UCF101

Action Recognition In Videos Transfer Learning

741

Paper
Code

Talking With Your Hands: Scaling Hand Gestures and Recognition With CNNs

no code implementations • 10 May 2019 • Okan Köpüklü, Yao Rong, Gerhard Rigoll

The use of hand gestures provides a natural alternative to cumbersome interface devices for Human-Computer Interaction (HCI) systems.

Paper
Add Code

Comparative Analysis of CNN-based Spatiotemporal Reasoning in Videos

1 code implementation • arXiv preprint 2019 • Okan Köpüklü, Fabian Herzog, Gerhard Rigoll

Understanding actions and gestures in video streams requires temporal reasoning of the spatial content from different time instants, i. e., spatiotemporal (ST) modeling.

Ranked #117 on Action Recognition on Something-Something V2

Action Recognition Human-Object Interaction Detection

Paper
Code

You Only Watch Once: A Unified CNN Architecture for Real-Time Spatiotemporal Action Localization

4 code implementations • 15 Nov 2019 • Okan Köpüklü, Xiangyu Wei, Gerhard Rigoll

YOWO is a single-stage architecture with two branches to extract temporal and spatial information concurrently and predict bounding boxes and action probabilities directly from video clips in one evaluation.

Ranked #1 on Action Recognition In Videos on AVA v2.2

Actin Detection Action Detection +1

817

Paper
Code

Unsupervised Monocular Depth Prediction for Indoor Continuous Video Streams

no code implementations • 20 Nov 2019 • Yinglong Feng, Shuncheng Wu, Okan Köpüklü, Xueyang Kang, Federico Tombari

This paper studies unsupervised monocular depth prediction problem.

Depth Estimation Depth Prediction +2

Paper
Add Code

Deep Attention Based Semi-Supervised 2D-Pose Estimation for Surgical Instruments

1 code implementation • 10 Dec 2019 • Mert Kayhan, Okan Köpüklü, Mhd Hasan Sarhan, Mehmet Yigitsoy, Abouzar Eslami, Gerhard Rigoll

To this end, a lightweight network architecture is introduced and mean teacher, virtual adversarial training and pseudo-labeling algorithms are evaluated on 2D-pose estimation for surgical instruments.

2D Pose Estimation Deep Attention +1

Paper
Code

DriverMHG: A Multi-Modal Dataset for Dynamic Recognition of Driver Micro Hand Gestures and a Real-Time Recognition Framework

no code implementations • 2 Mar 2020 • Okan Köpüklü, Thomas Ledwon, Yao Rong, Neslihan Kose, Gerhard Rigoll

In this work, we propose an HCI system for dynamic recognition of driver micro hand gestures, which can have a crucial impact in automotive sector especially for safety related issues.

Paper
Add Code

Dissected 3D CNNs: Temporal Skip Connections for Efficient Online Video Processing

1 code implementation • 30 Sep 2020 • Okan Köpüklü, Stefan Hörmann, Fabian Herzog, Hakan Cevikalp, Gerhard Rigoll

Convolutional Neural Networks with 3D kernels (3D-CNNs) currently achieve state-of-the-art results in video recognition tasks due to their supremacy in extracting spatiotemporal features within video frames.

Action Classification Video Recognition

Paper
Code

Driver Anomaly Detection: A Dataset and Contrastive Learning Approach

1 code implementation • 30 Sep 2020 • Okan Köpüklü, Jiapeng Zheng, Hang Xu, Gerhard Rigoll

For this task, we introduce a new video-based benchmark, the Driver Anomaly Detection (DAD) dataset, which contains normal driving videos together with a set of anomalous actions in its training set.

Anomaly Detection Contrastive Learning +1

111

Paper
Code

TRAT: Tracking by Attention Using Spatio-Temporal Features

no code implementations • 18 Nov 2020 • Hasan Saribas, Hakan Cevikalp, Okan Köpüklü, Bedirhan Uzun

Although motion provides distinctive and complementary information especially for fast moving objects, most of the recent tracking architectures primarily focus on the objects' appearance information.

Object Tracking

Paper
Add Code

Deep Compact Polyhedral Conic Classifier for Open and Closed Set Recognition

no code implementations • 24 Feb 2021 • Hakan Cevikalp, Bedirhan Uzun, Okan Köpüklü, Gurkan Ozturk

In this paper, we propose a new deep neural network classifier that simultaneously maximizes the inter-class separation and minimizes the intra-class variation by using the polyhedral conic classification function.

Anomaly Detection General Classification +1

Paper
Add Code

How to Design a Three-Stage Architecture for Audio-Visual Active Speaker Detection in the Wild

1 code implementation • ICCV 2021 • Okan Köpüklü, Maja Taseska, Gerhard Rigoll

Successful active speaker detection requires a three-stage pipeline: (i) audio-visual encoding for all speakers in the clip, (ii) inter-speaker relation modeling between a reference speaker and the background speakers within each frame, and (iii) temporal modeling for the reference speaker.

Ranked #7 on Audio-Visual Active Speaker Detection on AVA-ActiveSpeaker

Audio-Visual Active Speaker Detection

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.