Search Results for author: Abhinav Dhall

Found 41 papers, 18 papers with code

Generation and Detection of Sign Language Deepfakes - A Linguistic and Visual Analysis

no code implementations1 Apr 2024 Shahzeb Naeem, Muhammad Riyyan Khan, Usman Tariq, Abhinav Dhall, Carlos Ivan Colon, Hasan Al-Nashash

A question in the realm of deepfakes is slowly emerging pertaining to whether we can go beyond facial deepfakes and whether it would be beneficial to society.

Face Swapping

DiffAugment: Diffusion based Long-Tailed Visual Relationship Recognition

no code implementations1 Jan 2024 Parul Gupta, Tuan Nguyen, Abhinav Dhall, Munawar Hayat, Trung Le, Thanh-Toan Do

The task of Visual Relationship Recognition (VRR) aims to identify relationships between two interacting objects in an image and is particularly challenging due to the widely-spread and highly imbalanced distribution of <subject, relation, object> triplets.

Object Relation

AV-Deepfake1M: A Large-Scale LLM-Driven Audio-Visual Deepfake Dataset

1 code implementation26 Nov 2023 Zhixi Cai, Shreya Ghosh, Aman Pankaj Adatia, Munawar Hayat, Abhinav Dhall, Kalin Stefanov

The comprehensive benchmark of the proposed dataset utilizing state-of-the-art deepfake detection and localization methods indicates a significant drop in performance compared to previous datasets.

2k DeepFake Detection +2

MAGIC-TBR: Multiview Attention Fusion for Transformer-based Bodily Behavior Recognition in Group Settings

1 code implementation19 Sep 2023 Surbhi Madan, Rishabh Jain, Gulshan Sharma, Ramanathan Subramanian, Abhinav Dhall

Bodily behavioral language is an important social cue, and its automated analysis helps in enhancing the understanding of artificial intelligence systems.

Pose Estimation

ArtHDR-Net: Perceptually Realistic and Accurate HDR Content Creation

no code implementations7 Sep 2023 Hrishav Bakul Barua, Ganesh Krishnasamy, KokSheik Wong, Kalin Stefanov, Abhinav Dhall

High Dynamic Range (HDR) content creation has become an important topic for modern media and entertainment sectors, gaming and Augmented/Virtual Reality industries.

SSIM

Pavlok-Nudge: A Feedback Mechanism for Atomic Behaviour Modification with Snoring Usecase

no code implementations10 May 2023 Shreya Ghosh, Rakibul Hasan, Pradyumna Agrawal, Zhixi Cai, Susannah Soon, Abhinav Dhall, Tom Gedeon

To this end, we design a user interface to generate an automatic feedback mechanism that integrates Pavlok and a deep learning based model to detect certain behaviours via an integrated user interface i. e. mobile or desktop application.

Glitch in the Matrix: A Large Scale Benchmark for Content Driven Audio-Visual Forgery Detection and Localization

1 code implementation3 May 2023 Zhixi Cai, Shreya Ghosh, Abhinav Dhall, Tom Gedeon, Kalin Stefanov, Munawar Hayat

The proposed baseline method, Boundary Aware Temporal Forgery Detection (BA-TFD), is a 3D Convolutional Neural Network-based architecture which effectively captures multimodal manipulations.

Binary Classification DeepFake Detection +2

Audio-visual video face hallucination with frequency supervision and cross modality support by speech based lip reading loss

no code implementations20 Nov 2022 Shailza Sharma, Abhinav Dhall, Vinay Kumar, Vivek Singh Bawa

In order to learn these fine spatio-temporal motion details, we propose a novel cross-modal audio-visual Video Face Hallucination Generative Adversarial Network (VFH-GAN).

Face Hallucination Generative Adversarial Network +2

MARLIN: Masked Autoencoder for facial video Representation LearnINg

1 code implementation CVPR 2023 Zhixi Cai, Shreya Ghosh, Kalin Stefanov, Abhinav Dhall, Jianfei Cai, Hamid Rezatofighi, Reza Haffari, Munawar Hayat

This paper proposes a self-supervised approach to learn universal facial representations from videos, that can transfer across a variety of facial analysis tasks such as Facial Attribute Recognition (FAR), Facial Expression Recognition (FER), DeepFake Detection (DFD), and Lip Synchronization (LS).

Action Classification Attribute +9

RAZE: Region Guided Self-Supervised Gaze Representation Learning

no code implementations4 Aug 2022 Neeru Dubey, Shreya Ghosh, Abhinav Dhall

Automatic eye gaze estimation is an important problem in vision based assistive technology with use cases in different emerging topics such as augmented reality, virtual reality and human-computer interaction.

Gaze Estimation Representation Learning +1

'Labelling the Gaps': A Weakly Supervised Automatic Eye Gaze Estimation

1 code implementation3 Aug 2022 Shreya Ghosh, Abhinav Dhall, Jarrod Knibbe, Munawar Hayat

Our proposed method reduces the annotation effort to as low as 2. 67%, with minimal impact on performance; indicating the potential of our model enabling gaze estimation 'in-the-wild' setup.

Gaze Estimation

Visual Representations of Physiological Signals for Fake Video Detection

no code implementations18 Jul 2022 Kalin Stefanov, Bhawna Paliwal, Abhinav Dhall

We investigate two strategies for combining the video and physiology modalities, either by augmenting the video with information from the physiology or by novelly learning the fusion of those two modalities with a proposed Graph Convolutional Network architecture.

Misinformation

AV-Gaze: A Study on the Effectiveness of Audio Guided Visual Attention Estimation for Non-Profilic Faces

1 code implementation7 Jul 2022 Shreya Ghosh, Abhinav Dhall, Munawar Hayat, Jarrod Knibbe

In challenging real-life conditions such as extreme head-pose, occlusions, and low-resolution images where the visual information fails to estimate visual attention/gaze direction, audio signals could provide important and complementary information.

Do You Really Mean That? Content Driven Audio-Visual Deepfake Dataset and Multimodal Method for Temporal Forgery Localization

1 code implementation13 Apr 2022 Zhixi Cai, Kalin Stefanov, Abhinav Dhall, Munawar Hayat

Our baseline method for benchmarking the proposed dataset is a 3DCNN model, termed as Boundary Aware Temporal Forgery Detection (BA-TFD), which is guided via contrastive, boundary matching, and frame classification loss functions.

Benchmarking DeepFake Detection +1

MTGLS: Multi-Task Gaze Estimation with Limited Supervision

no code implementations23 Oct 2021 Shreya Ghosh, Munawar Hayat, Abhinav Dhall, Jarrod Knibbe

Our proposed framework outperforms the unsupervised state-of-the-art on CAVE (by 6. 43%) and even supervised state-of-the-art methods on Gaze360 (by 6. 59%) datasets.

Gaze Estimation

Frequency Aware Face Hallucination Generative Adversarial Network with Semantic Structural Constraint

no code implementations5 Oct 2021 Shailza Sharma, Abhinav Dhall, Vinay Kumar

To explicitly encode the high frequency components, an auto encoder is proposed to generate high resolution coefficients of Discrete Cosine Transform (DCT).

Face Hallucination Generative Adversarial Network +1

Automatic Gaze Analysis: A Survey of Deep Learning based Approaches

1 code implementation12 Aug 2021 Shreya Ghosh, Abhinav Dhall, Munawar Hayat, Jarrod Knibbe, Qiang Ji

Eye gaze analysis is an important research problem in the field of Computer Vision and Human-Computer Interaction.

Gaze Estimation

Self-Supervised Approach for Facial Movement Based Optical Flow

no code implementations4 May 2021 Muhannad Alkaddour, Usman Tariq, Abhinav Dhall

The aim of this work is threefold: (1) exploring self-supervised techniques to generate optical flow ground truth for face images; (2) computing baseline results on the effects of using face data to train Convolutional Neural Networks (CNN) for predicting optical flow; and (3) using the learned optical flow in micro-expression recognition to demonstrate its effectiveness.

Micro Expression Recognition Micro-Expression Recognition +1

FakeBuster: A DeepFakes Detection Tool for Video Conferencing Scenarios

no code implementations9 Jan 2021 Vineet Mehta, Parul Gupta, Ramanathan Subramanian, Abhinav Dhall

This paper proposes a new DeepFake detector FakeBuster for detecting impostors during video conferencing and manipulated faces on social media.

Face Swapping

Hyperrealistic Image Inpainting with Hypergraphs

1 code implementation5 Nov 2020 Gourav Wadhwa, Abhinav Dhall, Subrahmanyam Murala, Usman Tariq

In this paper, we introduce hypergraph convolution on spatial features to learn the complex relationship among the data.

Image Inpainting

The eyes know it: FakeET -- An Eye-tracking Database to Understand Deepfake Perception

no code implementations12 Jun 2020 Parul Gupta, Komal Chugh, Abhinav Dhall, Ramanathan Subramanian

We present \textbf{FakeET}-- an eye-tracking database to understand human visual perception of \emph{deepfake} videos.

EEG Face Swapping

Speak2Label: Using Domain Knowledge for Creating a Large Scale Driver Gaze Zone Estimation Dataset

no code implementations13 Apr 2020 Shreya Ghosh, Abhinav Dhall, Garima Sharma, Sarthak Gupta, Nicu Sebe

In this paper, a fully automatic technique for labelling an image based gaze behavior dataset for driver gaze zone estimation is proposed.

Gaze Prediction

Predicting Group Cohesiveness in Images

no code implementations31 Dec 2018 Shreya Ghosh, Abhinav Dhall, Nicu Sebe, Tom Gedeon

We study the factors that influence the perception of group-level cohesion and propose methods for estimating the human-perceived cohesion on the group cohesiveness scale.

Attribute

Clustering and Learning from Imbalanced Data

no code implementations2 Nov 2018 Naman D. Singh, Abhinav Dhall

A learning classifier must outperform a trivial solution, in case of imbalanced data, this condition usually does not hold true.

Clustering

DIF : Dataset of Perceived Intoxicated Faces for Drunk Person Identification

no code implementations25 May 2018 Vineet Mehta, Devendra Pratap Yadav, Sai Srinadhu Katta, Abhinav Dhall

Convolutional Neural Networks (CNN) and Deep Neural Networks (DNN) are trained for computing the video and audio baselines, respectively.

Person Identification

Prediction and Localization of Student Engagement in the Wild

1 code implementation3 Apr 2018 Amanjot Kaur, Aamir Mustafa, Love Mehta, Abhinav Dhall

Recognizing the lack of any publicly available dataset in the domain of user engagement, a new `in the wild' dataset is created to study the subject engagement problem.

Multiple Instance Learning Weakly-supervised Learning

Continuous Multimodal Emotion Recognition Approach for AVEC 2017

no code implementations18 Sep 2017 Narotam Singh, Nittin Singh, Abhinav Dhall

This paper reports the analysis of audio and visual features in predicting the continuous emotion dimensions under the seventh Audio/Visual Emotion Challenge (AVEC 2017), which was done as part of a B. Tech.

Multimodal Emotion Recognition

Depression Scale Recognition from Audio, Visual and Text Analysis

2 code implementations18 Sep 2017 Shubham Dham, Anirudh Sharma, Abhinav Dhall

The results obtained were able to cross the provided baseline on validation data set by 17% on audio features and 24. 5% on video features.

Clustering

Occlusion-Aware Human Pose Estimation with Mixtures of Sub-Trees

no code implementations3 Dec 2015 Ibrahim Radwan, Abhinav Dhall, Roland Goecke

The proposed method handles occlusions during the inference process by identifying overlapping regions between different sub-trees and introducing a penalty term for overlapping parts.

Pose Estimation

Cannot find the paper you are looking for? You can Submit a new open access paper.