Search Results for author: Amit H. Bermano

Found 13 papers, 7 papers with code

Unaligned Supervision For Automatic Music Transcription in The Wild

no code implementations28 Apr 2022 Ben Maman, Amit H. Bermano

In order to overcome data collection barriers, previous AMT approaches attempt to employ musical scores in the form of a digitized version of the same song or piece.

Information Retrieval Music Information Retrieval +1

MotionCLIP: Exposing Human Motion Generation to CLIP Space

1 code implementation15 Mar 2022 Guy Tevet, Brian Gordon, Amir Hertz, Amit H. Bermano, Daniel Cohen-Or

MotionCLIP gains its unique power by aligning its latent space with that of the Contrastive Language-Image Pre-training (CLIP) model.

Disentanglement Motion Interpolation

State-of-the-Art in the Architecture, Methods and Applications of StyleGAN

no code implementations28 Feb 2022 Amit H. Bermano, Rinon Gal, Yuval Alaluf, Ron Mokady, Yotam Nitzan, Omer Tov, Or Patashnik, Daniel Cohen-Or

Of these, StyleGAN offers a fascinating case study, owing to its remarkable visual quality and an ability to support a large array of downstream tasks.

Image Generation

Self-Conditioned Generative Adversarial Networks for Image Editing

no code implementations8 Feb 2022 Yunzhe Liu, Rinon Gal, Amit H. Bermano, Baoquan Chen, Daniel Cohen-Or

We compare our models to a wide range of latent editing methods, and show that by alleviating the bias they achieve finer semantic control and better identity preservation through a wider range of transformations.

Fairness

Stitch it in Time: GAN-Based Facial Editing of Real Videos

1 code implementation20 Jan 2022 Rotem Tzaban, Ron Mokady, Rinon Gal, Amit H. Bermano, Daniel Cohen-Or

The ability of Generative Adversarial Networks to encode rich semantics within their latent space has been widely adopted for facial image editing.

Facial Editing

Leveraging in-domain supervision for unsupervised image-to-image translation tasks via multi-stream generators

no code implementations30 Dec 2021 Dvir Yerushalmi, Dov Danon, Amit H. Bermano

In addition, we propose training a semantic segmentation network along with the translation task, and to leverage this output as a loss term that improves robustness.

Semantic Segmentation Translation +1

HyperStyle: StyleGAN Inversion with HyperNetworks for Real Image Editing

1 code implementation30 Nov 2021 Yuval Alaluf, Omer Tov, Ron Mokady, Rinon Gal, Amit H. Bermano

In this work, we introduce this approach into the realm of encoder-based inversion.

ClipCap: CLIP Prefix for Image Captioning

3 code implementations18 Nov 2021 Ron Mokady, Amir Hertz, Amit H. Bermano

Image captioning is a fundamental task in vision-language understanding, where the model predicts a textual informative caption to a given input image.

Image Captioning Language Modelling

JOKR: Joint Keypoint Representation for Unsupervised Cross-Domain Motion Retargeting

1 code implementation17 Jun 2021 Ron Mokady, Rotem Tzaban, Sagie Benaim, Amit H. Bermano, Daniel Cohen-Or

To alleviate this problem, we introduce JOKR - a JOint Keypoint Representation that captures the motion common to both the source and target videos, without requiring any object prior or data collection.

Disentanglement motion retargeting

Pivotal Tuning for Latent-based Editing of Real Images

2 code implementations10 Jun 2021 Daniel Roich, Ron Mokady, Amit H. Bermano, Daniel Cohen-Or

The key idea is pivotal tuning - a brief training process that preserves the editing quality of an in-domain latent region, while changing its portrayed identity and appearance.

Facial Editing Image Manipulation

Cannot find the paper you are looking for? You can Submit a new open access paper.