no code implementations • ECCV 2020 • Oren Katzir, Dani Lischinski, Daniel Cohen-Or
We mitigate this by descending the deep layers of a pre-trained network, where the deep features contain more semantics, and applying the translation between these deep features.
no code implementations • 12 Sep 2024 • Omer Regev, Omri Avrahami, Dani Lischinski
Recent advancements in generative models have revolutionized image generation and editing, making these tasks accessible to non-experts.
no code implementations • 1 Sep 2024 • Or Greenberg, Eran Kishon, Dani Lischinski
In contrast to existing methods that modify images during the diffusion sampling process, we leverage the semantic information encoded within the space of inverted seeds of a pretrained DM, dubbed as the seed-space.
no code implementations • 3 Jun 2024 • Omri Avrahami, Rinon Gal, Gal Chechik, Ohad Fried, Dani Lischinski, Arash Vahdat, Weili Nie
In this work, we propose a training-free method, dubbed DiffUHaul, that harnesses the spatial understanding of a localized text-to-image model, for the object dragging task.
no code implementations • 21 May 2024 • Jingyuan Yang, Jiawei Feng, Weibin Luo, Dani Lischinski, Daniel Cohen-Or, Hui Huang
We introduce EmoEdit, a novel two-stage framework comprising emotion attribution and image editing.
1 code implementation • CVPR 2024 • Yang Zhou, Rongjun Xiao, Dani Lischinski, Daniel Cohen-Or, Hui Huang
This paper addresses the challenge of example-based non-stationary texture synthesis.
1 code implementation • 5 Dec 2023 • Brian Gordon, Yonatan Bitton, Yonatan Shafir, Roopal Garg, Xi Chen, Dani Lischinski, Daniel Cohen-Or, Idan Szpektor
While existing image-text alignment models reach high quality binary assessments, they fall short of pinpointing the exact source of misalignment.
no code implementations • 30 Nov 2023 • Or Greenberg, Eran Kishon, Dani Lischinski
Image-to-image translation (I2IT) refers to the process of transforming images from a source domain to a target domain while maintaining a fundamental connection in terms of image content.
1 code implementation • 16 Nov 2023 • Omri Avrahami, Amir Hertz, Yael Vinker, Moab Arar, Shlomi Fruchter, Ohad Fried, Daniel Cohen-Or, Dani Lischinski
Recent advances in text-to-image generation models have unlocked vast potential for visual creativity.
no code implementations • 26 Oct 2023 • Oren Katzir, Or Patashnik, Daniel Cohen-Or, Dani Lischinski
Score Distillation Sampling (SDS) has emerged as the de facto approach for text-to-content generation in non-image domains.
no code implementations • ICCV 2023 • Jingyuan Yang, Qirui Huang, Tingting Ding, Dani Lischinski, Daniel Cohen-Or, Hui Huang
Most of the existing work in this area focuses on feature design, while little attention has been paid to dataset construction.
no code implementations • 28 Jun 2023 • Naama Pearl, Yaron Brodsky, Dana Berman, Assaf Zomet, Alex Rav Acha, Daniel Cohen-Or, Dani Lischinski
Our formulation also accounts for the correlation that exists between the condition image and the samples along the modified diffusion process.
1 code implementation • 22 Jun 2023 • Ori Gordon, Omri Avrahami, Dani Lischinski
We present Blended-NeRF, a robust and flexible framework for editing a specific region of interest in an existing NeRF scene, based on text prompts, along with a 3D ROI box.
1 code implementation • NeurIPS 2023 • Matan Levy, Rami Ben-Ari, Nir Darshan, Dani Lischinski
These questions form a dialog with the user in order to retrieve the desired image from a large corpus.
1 code implementation • 25 May 2023 • Omri Avrahami, Kfir Aberman, Ohad Fried, Daniel Cohen-Or, Dani Lischinski
Text-to-image model personalization aims to introduce a user-provided concept to the model, allowing its synthesis in diverse contexts.
1 code implementation • 16 Mar 2023 • Matan Levy, Rami Ben-Ari, Nir Darshan, Dani Lischinski
To address these shortcomings, we introduce the Large Scale Composed Image Retrieval (LaSCo) dataset, a new CoIR dataset which is ten times larger than existing ones.
Ranked #1 on Image Retrieval on LaSCo
no code implementations • CVPR 2023 • Omri Avrahami, Thomas Hayes, Oran Gafni, Sonal Gupta, Yaniv Taigman, Devi Parikh, Dani Lischinski, Ohad Fried, Xi Yin
Due to lack of large-scale datasets that have a detailed textual description for each region in the image, we choose to leverage the current large-scale text-to-image datasets and base our approach on a novel CLIP-based spatio-textual representation, and show its effectiveness on two state-of-the-art diffusion models: pixel-based and latent-based.
1 code implementation • 13 Oct 2022 • Eric Ming Chen, Jin Sun, Apoorv Khandelwal, Dani Lischinski, Noah Snavely, Hadar Averbuch-Elor
How can one visually characterize people in a decade?
1 code implementation • 6 Jun 2022 • Omri Avrahami, Ohad Fried, Dani Lischinski
Our solution leverages a recent text-to-image Latent Diffusion Model (LDM), which speeds up diffusion by operating in a lower-dimensional latent space.
no code implementations • 3 Apr 2022 • Oren Katzir, Dani Lischinski, Daniel Cohen-Or
We introduce an unsupervised technique for encoding point clouds into a canonical shape representation, by disentangling shape and pose.
no code implementations • 11 Feb 2022 • Oren Katzir, Vicky Perepelook, Dani Lischinski, Daniel Cohen-Or
Truncation is widely used in generative models for improving the quality of the generated samples, at the expense of reducing their diversity.
1 code implementation • 31 Jan 2022 • Yuval Alaluf, Or Patashnik, Zongze Wu, Asif Zamir, Eli Shechtman, Dani Lischinski, Daniel Cohen-Or
In particular, we demonstrate that while StyleGAN3 can be trained on unaligned data, one can still use aligned data for training, without hindering the ability to generate unaligned imagery.
1 code implementation • CVPR 2022 • Xingguang Yan, Liqiang Lin, Niloy J. Mitra, Dani Lischinski, Daniel Cohen-Or, Hui Huang
We present ShapeFormer, a transformer-based network that produces a distribution of object completions, conditioned on incomplete, and possibly noisy, point clouds.
1 code implementation • 29 Nov 2021 • Matan Levy, Rami Ben-Ari, Dani Lischinski
Our model is particularly well suited for realistic questions with out-of-vocabulary answers that require regression.
Ranked #2 on Visual Question Answering (VQA) on PlotQA-D1
1 code implementation • CVPR 2022 • Omri Avrahami, Dani Lischinski, Ohad Fried
Natural language offers a highly intuitive interface for image editing.
text-guided-image-editing Zero-Shot Text-to-Image Generation
1 code implementation • ICLR 2022 • Zongze Wu, Yotam Nitzan, Eli Shechtman, Dani Lischinski
Several works already utilize some basic properties of aligned StyleGAN models to perform image-to-image translation.
1 code implementation • ICCV 2021 • Jinming Cao, Hanchao Leng, Dani Lischinski, Danny Cohen-Or, Changhe Tu, Yangyan Li
The reason is that the learnt weights for balancing the importance between the shape and base components in ShapeConv become constants in the inference phase, and thus can be fused into the following convolution, resulting in a network that is identical to one with vanilla convolutional layers.
Ranked #3 on Semantic Segmentation on Stanford2D3D - RGBD
1 code implementation • 7 Jun 2021 • Omri Avrahami, Dani Lischinski, Ohad Fried
In the second stage, we merge the rooted models by averaging their weights and fine-tuning them for each specific domain, using only data generated by the original trained models.
5 code implementations • ICCV 2021 • Or Patashnik, Zongze Wu, Eli Shechtman, Daniel Cohen-Or, Dani Lischinski
Inspired by the ability of StyleGAN to generate highly realistic images in a variety of domains, much recent work has focused on understanding how to use the latent spaces of StyleGAN to manipulate generated and real images.
no code implementations • 26 Dec 2020 • Sarah Gingichashvili, Dani Lischinski
Edge-preserving filters play an essential role in some of the most basic tasks of computational photography, such as abstraction, tonemapping, detail enhancement and texture removal, to name a few.
6 code implementations • CVPR 2021 • Zongze Wu, Dani Lischinski, Eli Shechtman
Manipulation of visual attributes via these StyleSpace controls is shown to be better disentangled than via those proposed in previous works.
no code implementations • 22 Jun 2020 • Mingyi Shi, Kfir Aberman, Andreas Aristidou, Taku Komura, Dani Lischinski, Daniel Cohen-Or, Baoquan Chen
We introduce MotioNet, a deep neural network that directly reconstructs the motion of a 3D human skeleton from monocular video. While previous methods rely on either rigging or inverse kinematics (IK) to associate a consistent skeleton with temporally coherent joint rotations, our method is the first data-driven approach that directly outputs a kinematic skeleton, which is a complete, commonly used, motion representation.
1 code implementation • 22 Jun 2020 • Jinming Cao, Yangyan Li, Mingchao Sun, Ying Chen, Dani Lischinski, Daniel Cohen-Or, Baoquan Chen, Changhe Tu
Moreover, in the inference phase, the depthwise convolution is folded into the conventional convolution, reducing the computation to be exactly equivalent to that of a convolutional layer without over-parameterization.
1 code implementation • 12 May 2020 • Kfir Aberman, Peizhuo Li, Dani Lischinski, Olga Sorkine-Hornung, Daniel Cohen-Or, Baoquan Chen
In other words, our operators form the building blocks of a new deep motion processing framework that embeds the motion into a common latent space, shared by a collection of homeomorphic skeletons.
1 code implementation • 12 May 2020 • Kfir Aberman, Yijia Weng, Dani Lischinski, Daniel Cohen-Or, Baoquan Chen
In this paper, we present a novel data-driven framework for motion style transfer, which learns from an unpaired collection of motions with style labels, and enables transferring motion styles not observed during training.
no code implementations • 10 Jan 2020 • Omry Sendik, Dani Lischinski, Daniel Cohen-Or
The emergence of deep generative models has recently enabled the automatic generation of massive amounts of graphical content, both in 2D and in 3D.
no code implementations • 13 Jun 2019 • Eytan Lifshitz, Dani Lischinski
In this paper, we present a new, physically-based, approach for estimating illuminant chromaticity from interreflections of light between diffuse surfaces.
no code implementations • 4 Jun 2019 • Oren Katzir, Dani Lischinski, Daniel Cohen-Or
Our translation is performed in a cascaded, deep-to-shallow, fashion, along the deep feature hierarchy: we first translate between the deepest layers that encode the higher-level semantic content of the image, proceeding to translate the shallower layers, conditioned on the deeper ones.
2 code implementations • 5 May 2019 • Kfir Aberman, Rundi Wu, Dani Lischinski, Baoquan Chen, Daniel Cohen-Or
In order to achieve our goal, we learn to extract, directly from a video, a high-level latent motion representation, which is invariant to the skeleton geometry and the camera view.
no code implementations • 14 Jan 2019 • Omry Sendik, Dani Lischinski, Daniel Cohen-Or
Recent GAN-based architectures have been able to deliver impressive performance on the general task of image-to-image translation.
no code implementations • ECCV 2018 • Di Lin, Yuanfeng Ji, Dani Lischinski, Daniel Cohen-Or, Hui Huang
Accurate semantic image segmentation requires the joint consideration of local appearance, semantic information, and global scene context.
no code implementations • 21 Aug 2018 • Kfir Aberman, Mingyi Shi, Jing Liao, Dani Lischinski, Baoquan Chen, Daniel Cohen-Or
After training a deep generative network using a reference video capturing the appearance and dynamics of a target actor, we are able to generate videos where this actor reenacts other performances.
1 code implementation • 12 Aug 2018 • Zhijie Wu, Xiang Wang, Di Lin, Dani Lischinski, Daniel Cohen-Or, Hui Huang
The key idea is that during the analysis, the two branches exchange information between them, thereby learning the dependencies between structure and geometry and encoding two augmented features, which are then fused into a single latent code.
Graphics
no code implementations • 21 May 2018 • Jinming Cao, Oren Katzir, Peng Jiang, Dani Lischinski, Danny Cohen-Or, Changhe Tu, Yangyan Li
The key idea is that by learning to separately extract both the common and the domain-specific features, one can synthesize more target domain data with supervision, thereby boosting the domain adaptation performance.
1 code implementation • 11 May 2018 • Yang Zhou, Zhen Zhu, Xiang Bai, Dani Lischinski, Daniel Cohen-Or, Hui Huang
We demonstrate that this conceptually simple approach is highly effective for capturing large-scale structures, as well as other non-stationary attributes of the input exemplar.
2 code implementations • 10 May 2018 • Kfir Aberman, Jing Liao, Mingyi Shi, Dani Lischinski, Baoquan Chen, Daniel Cohen-Or
Correspondence between images is a fundamental problem in computer vision, with a variety of graphics applications.
no code implementations • 22 Nov 2017 • Zhenhua Wang, Fanglin Gu, Dani Lischinski, Daniel Cohen-Or, Changhe Tu, Baoquan Chen
Contextual information provides important cues for disambiguating visually similar pixels in scene segmentation.
no code implementations • ICCV 2017 • Lei Zhu, Chi-Wing Fu, Dani Lischinski, Pheng-Ann Heng
A third prior is defined on the rain-streak layer R, based on similarity of patches to the extracted rain patches.
no code implementations • 18 Aug 2016 • Huayong Xu, Yangyan Li, Wenzheng Chen, Dani Lischinski, Daniel Cohen-Or, Baoquan Chen
We show that the resulting P-maps may be used to evaluate how likely a rectangle proposal is to contain an instance of the class, and further process good proposals to produce an accurate object cutout mask.
no code implementations • 10 Apr 2016 • Wenzheng Chen, Huan Wang, Yangyan Li, Hao Su, Zhenhua Wang, Changhe Tu, Dani Lischinski, Daniel Cohen-Or, Baoquan Chen
Human 3D pose estimation from a single image is a challenging task with numerous applications.