Search Results for author: Kalin Stefanov

Found 16 papers, 5 papers with code

Analysis of Behavior Classification in Motivational Interviewing

no code implementations • NAACL (CLPsych) 2021 • Leili Tavabi, Trang Tran, Kalin Stefanov, Brian Borsari, Joshua Woolley, Stefan Scherer, Mohammad Soleymani

Analysis of client and therapist behavior in counseling sessions can provide helpful insights for assessing the quality of the session and consequently, the client’s behavioral outcome.

Classification

Paper
Add Code

GTA-HDR: A Large-Scale Synthetic Dataset for HDR Image Reconstruction

1 code implementation • 26 Mar 2024 • Hrishav Bakul Barua, Kalin Stefanov, KokSheik Wong, Abhinav Dhall, Ganesh Krishnasamy

High Dynamic Range (HDR) content (i. e., images and videos) has a broad range of applications.

3D Human Pose Estimation Image Reconstruction +1

Paper
Code

Human Brain Exhibits Distinct Patterns When Listening to Fake Versus Real Audio: Preliminary Evidence

no code implementations • 22 Feb 2024 • Mahsa Salehi, Kalin Stefanov, Ehsan Shareghi

In this paper we study the variations in human brain activity when listening to real and fake audio.

EEG Face Swapping

Paper
Add Code

HistoHDR-Net: Histogram Equalization for Single LDR to HDR Image Translation

no code implementations • 8 Feb 2024 • Hrishav Bakul Barua, Ganesh Krishnasamy, KokSheik Wong, Abhinav Dhall, Kalin Stefanov

High Dynamic Range (HDR) imaging aims to replicate the high visual quality and clarity of real-world scenes.

Image Reconstruction

Paper
Add Code

AV-Deepfake1M: A Large-Scale LLM-Driven Audio-Visual Deepfake Dataset

1 code implementation • 26 Nov 2023 • Zhixi Cai, Shreya Ghosh, Aman Pankaj Adatia, Munawar Hayat, Abhinav Dhall, Kalin Stefanov

The comprehensive benchmark of the proposed dataset utilizing state-of-the-art deepfake detection and localization methods indicates a significant drop in performance compared to previous datasets.

2k DeepFake Detection +2

Paper
Code

ArtHDR-Net: Perceptually Realistic and Accurate HDR Content Creation

no code implementations • 7 Sep 2023 • Hrishav Bakul Barua, Ganesh Krishnasamy, KokSheik Wong, Kalin Stefanov, Abhinav Dhall

High Dynamic Range (HDR) content creation has become an important topic for modern media and entertainment sectors, gaming and Augmented/Virtual Reality industries.

SSIM

Paper
Add Code

S-HR-VQVAE: Sequential Hierarchical Residual Learning Vector Quantized Variational Autoencoder for Video Prediction

no code implementations • 13 Jul 2023 • Mohammad Adiban, Kalin Stefanov, Sabato Marco Siniscalchi, Giampiero Salvi

We address the video prediction task by putting forth a novel model that combines (i) our recently proposed hierarchical residual vector quantized variational autoencoder (HR-VQVAE), and (ii) a novel spatiotemporal PixelCNN (ST-PixelCNN).

Video Prediction

Paper
Add Code

Glitch in the Matrix: A Large Scale Benchmark for Content Driven Audio-Visual Forgery Detection and Localization

1 code implementation • 3 May 2023 • Zhixi Cai, Shreya Ghosh, Abhinav Dhall, Tom Gedeon, Kalin Stefanov, Munawar Hayat

The proposed baseline method, Boundary Aware Temporal Forgery Detection (BA-TFD), is a 3D Convolutional Neural Network-based architecture which effectively captures multimodal manipulations.

Ranked #1 on Temporal Forgery Localization on ForgeryNet

Binary Classification DeepFake Detection +2

Paper
Code

MARLIN: Masked Autoencoder for facial video Representation LearnINg

1 code implementation • CVPR 2023 • Zhixi Cai, Shreya Ghosh, Kalin Stefanov, Abhinav Dhall, Jianfei Cai, Hamid Rezatofighi, Reza Haffari, Munawar Hayat

This paper proposes a self-supervised approach to learn universal facial representations from videos, that can transfer across a variety of facial analysis tasks such as Facial Attribute Recognition (FAR), Facial Expression Recognition (FER), DeepFake Detection (DFD), and Lip Synchronization (LS).

Ranked #1 on Emotion Classification on CMU-MOSEI

Action Classification Attribute +9

189

Paper
Code

Hierarchical Residual Learning Based Vector Quantized Variational Autoencoder for Image Reconstruction and Generation

no code implementations • 9 Aug 2022 • Mohammad Adiban, Kalin Stefanov, Sabato Marco Siniscalchi, Giampiero Salvi

We propose a multi-layer variational autoencoder method, we call HR-VQVAE, that learns hierarchical discrete representations of the data.

Image Generation Image Reconstruction

Paper
Add Code

Visual Representations of Physiological Signals for Fake Video Detection

no code implementations • 18 Jul 2022 • Kalin Stefanov, Bhawna Paliwal, Abhinav Dhall

We investigate two strategies for combining the video and physiology modalities, either by augmenting the video with information from the physiology or by novelly learning the fusion of those two modalities with a proposed Graph Convolutional Network architecture.

Misinformation

Paper
Add Code

Do You Really Mean That? Content Driven Audio-Visual Deepfake Dataset and Multimodal Method for Temporal Forgery Localization

1 code implementation • 13 Apr 2022 • Zhixi Cai, Kalin Stefanov, Abhinav Dhall, Munawar Hayat

Our baseline method for benchmarking the proposed dataset is a 3DCNN model, termed as Boundary Aware Temporal Forgery Detection (BA-TFD), which is guided via contrastive, boundary matching, and frame classification loss functions.

Ranked #1 on DeepFake Detection on LAV-DF

Benchmarking DeepFake Detection +1

Paper
Code

Webcam-based Eye Gaze Tracking under Natural Head Movement

no code implementations • 29 Mar 2018 • Kalin Stefanov

Furthermore, we can report that the proposed tracker commits a mean error of (87. 18, 103. 86) pixels in x and y direction, respectively, under natural head movement.

Paper
Add Code

Self-Supervised Vision-Based Detection of the Active Speaker as Support for Socially-Aware Language Acquisition

no code implementations • 24 Nov 2017 • Kalin Stefanov, Jonas Beskow, Giampiero Salvi

Active speaker detection is a fundamental prerequisite for any artificial cognitive system attempting to acquire language in social settings.

Language Acquisition

Paper
Add Code

A Multi-party Multi-modal Dataset for Focus of Visual Attention in Human-human and Human-robot Interaction

no code implementations • LREC 2016 • Kalin Stefanov, Jonas Beskow

This papers describes a data collection setup and a newly recorded dataset.

Paper
Add Code

The Tutorbot Corpus --- A Corpus for Studying Tutoring Behaviour in Multiparty Face-to-Face Spoken Dialogue

no code implementations • LREC 2014 • Maria Koutsombogera, Samer Al Moubayed, Bajibabu Bollepalli, Ahmed Hussen Abdelaziz, Martin Johansson, Jos{\'e} David Aguas Lopes, Jekaterina Novikova, Catharine Oertel, Kalin Stefanov, G{\"u}l Varol

The corpus is targeted and designed towards the development of a dialogue system platform to explore verbal and nonverbal tutoring strategies in multiparty spoken interactions.

Spoken Dialogue Systems

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.