Search Results for author: Ron Mokady

Found 13 papers, 12 papers with code

Null-text Inversion for Editing Real Images using Guided Diffusion Models

4 code implementations • CVPR 2023 • Ron Mokady, Amir Hertz, Kfir Aberman, Yael Pritch, Daniel Cohen-Or

Our Null-text inversion, based on the publicly available Stable Diffusion model, is extensively evaluated on a variety of images and prompt editing, showing high-fidelity editing of real images.

Ranked #4 on Text-based Image Editing on PIE-Bench

Image Generation Text-based Image Editing

2,830

Paper
Code

Text-Only Training for Image Captioning using Noise-Injected CLIP

2 code implementations • 1 Nov 2022 • David Nukrai, Ron Mokady, Amir Globerson

We consider the task of image-captioning using only the CLIP model and additional text data at training time, and no additional captioned images.

Ranked #1 on Image Captioning on MSCOCO

Language Modelling Semi Supervised Learning for Image Captioning

167

Paper
Code

Prompt-to-Prompt Image Editing with Cross Attention Control

7 code implementations • 2 Aug 2022 • Amir Hertz, Ron Mokady, Jay Tenenbaum, Kfir Aberman, Yael Pritch, Daniel Cohen-Or

Editing is challenging for these generative models, since an innate property of an editing technique is to preserve most of the original image, while in the text-based models, even a small modification of the text prompt often leads to a completely different outcome.

Ranked #14 on Text-based Image Editing on PIE-Bench

Image Generation Text-based Image Editing

2,830

Paper
Code

State-of-the-Art in the Architecture, Methods and Applications of StyleGAN

no code implementations • 28 Feb 2022 • Amit H. Bermano, Rinon Gal, Yuval Alaluf, Ron Mokady, Yotam Nitzan, Omer Tov, Or Patashnik, Daniel Cohen-Or

Of these, StyleGAN offers a fascinating case study, owing to its remarkable visual quality and an ability to support a large array of downstream tasks.

Image Generation

Paper
Add Code

Self-Distilled StyleGAN: Towards Generation from Internet Photos

2 code implementations • 24 Feb 2022 • Ron Mokady, Michal Yarom, Omer Tov, Oran Lang, Daniel Cohen-Or, Tali Dekel, Michal Irani, Inbar Mosseri

To meet these challenges, we proposed a StyleGAN-based self-distillation approach, which consists of two main components: (i) A generative-based self-filtering of the dataset to eliminate outlier images, in order to generate an adequate training set, and (ii) Perceptual clustering of the generated images to detect the inherent data modalities, which are then employed to improve StyleGAN's "truncation trick" in the image synthesis process.

Image Generation

276

Paper
Code

Stitch it in Time: GAN-Based Facial Editing of Real Videos

1 code implementation • 20 Jan 2022 • Rotem Tzaban, Ron Mokady, Rinon Gal, Amit H. Bermano, Daniel Cohen-Or

The ability of Generative Adversarial Networks to encode rich semantics within their latent space has been widely adopted for facial image editing.

Facial Editing

1,195

Paper
Code

HyperStyle: StyleGAN Inversion with HyperNetworks for Real Image Editing

1 code implementation • CVPR 2022 • Yuval Alaluf, Omer Tov, Ron Mokady, Rinon Gal, Amit H. Bermano

In this work, we introduce this approach into the realm of encoder-based inversion.

992

Paper
Code

ClipCap: CLIP Prefix for Image Captioning

4 code implementations • 18 Nov 2021 • Ron Mokady, Amir Hertz, Amit H. Bermano

Image captioning is a fundamental task in vision-language understanding, where the model predicts a textual informative caption to a given input image.

Ranked #1 on Image Captioning on Conceptual Captions

Image Captioning Language Modelling

1,196

Paper
Code

JOKR: Joint Keypoint Representation for Unsupervised Cross-Domain Motion Retargeting

1 code implementation • 17 Jun 2021 • Ron Mokady, Rotem Tzaban, Sagie Benaim, Amit H. Bermano, Daniel Cohen-Or

To alleviate this problem, we introduce JOKR - a JOint Keypoint Representation that captures the motion common to both the source and target videos, without requiring any object prior or data collection.

Disentanglement motion retargeting

Paper
Code

Pivotal Tuning for Latent-based Editing of Real Images

3 code implementations • 10 Jun 2021 • Daniel Roich, Ron Mokady, Amit H. Bermano, Daniel Cohen-Or

The key idea is pivotal tuning - a brief training process that preserves the editing quality of an in-domain latent region, while changing its portrayed identity and appearance.

Facial Editing Image Manipulation

6,567

Paper
Code

Masked Based Unsupervised Content Transfer

1 code implementation • ICLR 2020 • Ron Mokady, Sagie Benaim, Lior Wolf, Amit Bermano

We consider the problem of translating, in an unsupervised manner, between two domains where one contains some additional information compared to the other.

Translation Weakly supervised Semantic Segmentation +1

Paper
Code

Structural-analogy from a Single Image Pair

1 code implementation • 5 Apr 2020 • Sagie Benaim, Ron Mokady, Amit Bermano, Daniel Cohen-Or, Lior Wolf

In this paper, we explore the capabilities of neural networks to understand image structure given only a single pair of images, A and B.

Translation Unsupervised Image-To-Image Translation

100

Paper
Code

Mask Based Unsupervised Content Transfer

1 code implementation • 15 Jun 2019 • Ron Mokady, Sagie Benaim, Lior Wolf, Amit Bermano

We consider the problem of translating, in an unsupervised manner, between two domains where one contains some additional information compared to the other.

Translation Weakly supervised Semantic Segmentation +1

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.