Search Results for author: Saman Motamed

Found 12 papers, 4 papers with code

Investigating the Effectiveness of Cross-Attention to Unlock Zero-Shot Editing of Text-to-Video Diffusion Models

no code implementations8 Apr 2024 Saman Motamed, Wouter Van Gansbeke, Luc van Gool

With recent advances in image and video diffusion models for content creation, a plethora of techniques have been proposed for customizing their generated content.

Video Editing

A Unified and Interpretable Emotion Representation and Expression Generation

no code implementations1 Apr 2024 Reni Paskaleva, Mykyta Holubakha, Andela Ilic, Saman Motamed, Luc van Gool, Danda Paudel

However, emotions are often compound, e. g. happily surprised, and can be mapped to the action units (AUs) used for expressing emotions, and trivially to the canonical ones.

D3GU: Multi-Target Active Domain Adaptation via Enhancing Domain Alignment

1 code implementation10 Jan 2024 Lin Zhang, Linghan Xu, Saman Motamed, Shayok Chakraborty, Fernando de la Torre

Unsupervised domain adaptation (UDA) for image classification has made remarkable progress in transferring classification knowledge from a labeled source domain to an unlabeled target domain, thanks to effective domain alignment techniques.

Classification Image Classification +1

Personalized Face Inpainting with Diffusion Models by Parallel Visual Attention

no code implementations6 Dec 2023 Jianjin Xu, Saman Motamed, Praneetha Vaddamanu, Chen Henry Wu, Christian Haene, Jean-Charles Bazin, Fernando de la Torre

Specifically, we insert parallel attention matrices to each cross-attention module in the denoising network, which attends to features extracted from reference images by an identity encoder.

Denoising Facial Inpainting

Lego: Learning to Disentangle and Invert Concepts Beyond Object Appearance in Text-to-Image Diffusion Models

no code implementations23 Nov 2023 Saman Motamed, Danda Pani Paudel, Luc van Gool

To enable customized content creation based on a few example images of a concept, methods such as Textual Inversion and DreamBooth invert the desired concept and enable synthesizing it in new scenes.

Language Modelling Large Language Model +3

PATMAT: Person Aware Tuning of Mask-Aware Transformer for Face Inpainting

2 code implementations ICCV 2023 Saman Motamed, Jianjin Xu, Chen Henry Wu, Fernando de la Torre

By using ~40 reference images, PATMAT creates anchor points in MAT's style module, and tunes the model using the fixed anchors to adapt the model to a new face identity.

Facial Inpainting

Generative Visual Prompt: Unifying Distributional Control of Pre-Trained Generative Models

1 code implementation14 Sep 2022 Chen Henry Wu, Saman Motamed, Shaunak Srivastava, Fernando de la Torre

Our experiments demonstrate how PromptGen can efficiently sample from several unconditional generative models (e. g., StyleGAN2, StyleNeRF, diffusion autoencoder, NVAE) in a controlled or/and de-biased manner using various off-the-shelf models: (1) with the CLIP model as control, PromptGen can sample images guided by text, (2) with image classifiers as control, PromptGen can de-bias generative models across a set of attributes or attribute combinations, and (3) with inverse graphics models as control, PromptGen can sample images of the same identity in different poses.

Attribute

Vanishing Twin GAN: How training a weak Generative Adversarial Network can improve semi-supervised image classification

no code implementations3 Mar 2021 Saman Motamed, Farzad Khalvati

By training a weak GAN and using its generated output image parallel to the regular GAN, the Vanishing Twin training improves semi-supervised image classification where image similarity can hurt classification tasks.

Classification General Classification +2

Multi-class Generative Adversarial Nets for Semi-supervised Image Classification

no code implementations13 Feb 2021 Saman Motamed, Farzad Khalvati

We propose a modification to the traditional training of GANs that allows for improved multi-class classification in similar classes of images in a semi-supervised learning framework.

Classification Domain Adaptation +3

RANDGAN: Randomized Generative Adversarial Network for Detection of COVID-19 in Chest X-ray

1 code implementation6 Oct 2020 Saman Motamed, Patrik Rogalla, Farzad Khalvati

Gathering labeled data is a cumbersome task and requires time and resources which could further strain health care systems and radiologists at the early stages of a pandemic such as COVID-19.

Anomaly Detection COVID-19 Diagnosis +3

A Transfer Learning Approach for Automated Segmentation of Prostate Whole Gland and Transition Zone in Diffusion Weighted MRI

no code implementations20 Sep 2019 Saman Motamed, Isha Gujrathi, Dominik Deniffel, Anton Oentoro, Masoom A. Haider, Farzad Khalvati

Using a fine-tuning data of 115 patients from the target domain, dice score coefficient of 0. 85 and 0. 84 are achieved for segmentation of whole gland and transition zone, respectively, in the target domain.

Segmentation Transfer Learning

Cannot find the paper you are looking for? You can Submit a new open access paper.