Search Results for author: Ron Mokady

Found 13 papers, 12 papers with code

Null-text Inversion for Editing Real Images using Guided Diffusion Models

4 code implementations CVPR 2023 Ron Mokady, Amir Hertz, Kfir Aberman, Yael Pritch, Daniel Cohen-Or

Our Null-text inversion, based on the publicly available Stable Diffusion model, is extensively evaluated on a variety of images and prompt editing, showing high-fidelity editing of real images.

Image Generation Text-based Image Editing

Text-Only Training for Image Captioning using Noise-Injected CLIP

2 code implementations1 Nov 2022 David Nukrai, Ron Mokady, Amir Globerson

We consider the task of image-captioning using only the CLIP model and additional text data at training time, and no additional captioned images.

Language Modelling Semi Supervised Learning for Image Captioning

Prompt-to-Prompt Image Editing with Cross Attention Control

7 code implementations2 Aug 2022 Amir Hertz, Ron Mokady, Jay Tenenbaum, Kfir Aberman, Yael Pritch, Daniel Cohen-Or

Editing is challenging for these generative models, since an innate property of an editing technique is to preserve most of the original image, while in the text-based models, even a small modification of the text prompt often leads to a completely different outcome.

Image Generation Text-based Image Editing

State-of-the-Art in the Architecture, Methods and Applications of StyleGAN

no code implementations28 Feb 2022 Amit H. Bermano, Rinon Gal, Yuval Alaluf, Ron Mokady, Yotam Nitzan, Omer Tov, Or Patashnik, Daniel Cohen-Or

Of these, StyleGAN offers a fascinating case study, owing to its remarkable visual quality and an ability to support a large array of downstream tasks.

Image Generation

Self-Distilled StyleGAN: Towards Generation from Internet Photos

2 code implementations24 Feb 2022 Ron Mokady, Michal Yarom, Omer Tov, Oran Lang, Daniel Cohen-Or, Tali Dekel, Michal Irani, Inbar Mosseri

To meet these challenges, we proposed a StyleGAN-based self-distillation approach, which consists of two main components: (i) A generative-based self-filtering of the dataset to eliminate outlier images, in order to generate an adequate training set, and (ii) Perceptual clustering of the generated images to detect the inherent data modalities, which are then employed to improve StyleGAN's "truncation trick" in the image synthesis process.

Image Generation

Stitch it in Time: GAN-Based Facial Editing of Real Videos

1 code implementation20 Jan 2022 Rotem Tzaban, Ron Mokady, Rinon Gal, Amit H. Bermano, Daniel Cohen-Or

The ability of Generative Adversarial Networks to encode rich semantics within their latent space has been widely adopted for facial image editing.

Facial Editing

ClipCap: CLIP Prefix for Image Captioning

4 code implementations18 Nov 2021 Ron Mokady, Amir Hertz, Amit H. Bermano

Image captioning is a fundamental task in vision-language understanding, where the model predicts a textual informative caption to a given input image.

Image Captioning Language Modelling

JOKR: Joint Keypoint Representation for Unsupervised Cross-Domain Motion Retargeting

1 code implementation17 Jun 2021 Ron Mokady, Rotem Tzaban, Sagie Benaim, Amit H. Bermano, Daniel Cohen-Or

To alleviate this problem, we introduce JOKR - a JOint Keypoint Representation that captures the motion common to both the source and target videos, without requiring any object prior or data collection.

Disentanglement motion retargeting

Pivotal Tuning for Latent-based Editing of Real Images

3 code implementations10 Jun 2021 Daniel Roich, Ron Mokady, Amit H. Bermano, Daniel Cohen-Or

The key idea is pivotal tuning - a brief training process that preserves the editing quality of an in-domain latent region, while changing its portrayed identity and appearance.

Facial Editing Image Manipulation

Masked Based Unsupervised Content Transfer

1 code implementation ICLR 2020 Ron Mokady, Sagie Benaim, Lior Wolf, Amit Bermano

We consider the problem of translating, in an unsupervised manner, between two domains where one contains some additional information compared to the other.

Translation Weakly supervised Semantic Segmentation +1

Structural-analogy from a Single Image Pair

1 code implementation5 Apr 2020 Sagie Benaim, Ron Mokady, Amit Bermano, Daniel Cohen-Or, Lior Wolf

In this paper, we explore the capabilities of neural networks to understand image structure given only a single pair of images, A and B.

Translation Unsupervised Image-To-Image Translation

Mask Based Unsupervised Content Transfer

1 code implementation15 Jun 2019 Ron Mokady, Sagie Benaim, Lior Wolf, Amit Bermano

We consider the problem of translating, in an unsupervised manner, between two domains where one contains some additional information compared to the other.

Translation Weakly supervised Semantic Segmentation +1

Cannot find the paper you are looking for? You can Submit a new open access paper.