Search Results for author: Mahyar Khayatkhoei

Found 12 papers, 8 papers with code

ManiFPT: Defining and Analyzing Fingerprints of Generative Models

no code implementations16 Feb 2024 Hae Jin Song, Mahyar Khayatkhoei, Wael AbdAlmageed

Recent works have shown that generative models leave traces of their underlying generative process on the generated samples, broadly referred to as fingerprints of a generative model, and have studied their utility in detecting synthetic images from real ones.

Exploring Perceptual Limitation of Multimodal Large Language Models

1 code implementation12 Feb 2024 Jiarui Zhang, Jinyi Hu, Mahyar Khayatkhoei, Filip Ilievski, Maosong Sun

Multimodal Large Language Models (MLLMs) have recently shown remarkable perceptual capability in answering visual questions, however, little is known about the limits of their perception.

Object Question Answering

Unsupervised Multimodal Deepfake Detection Using Intra- and Cross-Modal Inconsistencies

no code implementations28 Nov 2023 Mulin Tian, Mahyar Khayatkhoei, Joe Mathai, Wael AbdAlmageed

Deepfake videos present an increasing threat to society with potentially negative impact on criminal justice, democracy, and personal safety and privacy.

DeepFake Detection Face Swapping

SABAF: Removing Strong Attribute Bias from Neural Networks with Adversarial Filtering

1 code implementation13 Nov 2023 Jiazhi Li, Mahyar Khayatkhoei, Jiageng Zhu, Hanchen Xie, Mohamed E. Hussein, Wael AbdAlmageed

To that end, in this work, we mathematically and empirically reveal the limitation of existing attribute bias removal methods in presence of strong bias and propose a new method that can mitigate this limitation.

Attribute

Towards Perceiving Small Visual Details in Zero-shot Visual Question Answering with Multimodal LLMs

2 code implementations24 Oct 2023 Jiarui Zhang, Mahyar Khayatkhoei, Prateek Chhikara, Filip Ilievski

In particular, we show that their zero-shot accuracy in answering visual questions is very sensitive to the size of the visual subject of the question, declining up to 46% with size.

Question Answering Visual Question Answering

Information-Theoretic Bounds on The Removal of Attribute-Specific Bias From Neural Networks

1 code implementation8 Oct 2023 Jiazhi Li, Mahyar Khayatkhoei, Jiageng Zhu, Hanchen Xie, Mohamed E. Hussein, Wael AbdAlmageed

Ensuring a neural network is not relying on protected attributes (e. g., race, sex, age) for predictions is crucial in advancing fair and trustworthy AI.

Attribute

Emergent Asymmetry of Precision and Recall for Measuring Fidelity and Diversity of Generative Models in High Dimensions

1 code implementation16 Jun 2023 Mahyar Khayatkhoei, Wael AbdAlmageed

Precision and Recall are two prominent metrics of generative performance, which were proposed to separately measure the fidelity and diversity of generative models.

Using Visual Cropping to Enhance Fine-Detail Question Answering of BLIP-Family Models

no code implementations31 May 2023 Jiarui Zhang, Mahyar Khayatkhoei, Prateek Chhikara, Filip Ilievski

As our initial analysis of BLIP-family models revealed difficulty with answering fine-detail questions, we investigate the following question: Can visual cropping be employed to improve the performance of state-of-the-art visual question answering models on fine-detail questions?

Question Answering Visual Question Answering

A Critical View of Vision-Based Long-Term Dynamics Prediction Under Environment Misalignment

1 code implementation12 May 2023 Hanchen Xie, Jiageng Zhu, Mahyar Khayatkhoei, Jiazhi Li, Mohamed E. Hussein, Wael AbdAlmageed

In this paper, we investigate two challenging conditions for environment misalignment: Cross-Domain and Cross-Context by proposing four datasets that are designed for these challenges: SimB-Border, SimB-Split, BlenB-Border, and BlenB-Split.

Region Proposal

Spatial Frequency Bias in Convolutional Generative Adversarial Networks

no code implementations4 Oct 2020 Mahyar Khayatkhoei, Ahmed Elgammal

As the success of Generative Adversarial Networks (GANs) on natural images quickly propels them into various real-life applications across different domains, it becomes more and more important to clearly understand their limitations.

Denoising Super-Resolution

Disconnected Manifold Learning for Generative Adversarial Networks

1 code implementation NeurIPS 2018 Mahyar Khayatkhoei, Ahmed Elgammal, Maneesh Singh

Natural images may lie on a union of disjoint manifolds rather than one globally connected manifold, and this can cause several difficulties for the training of common Generative Adversarial Networks (GANs).

Cannot find the paper you are looking for? You can Submit a new open access paper.