Search Results for author: Eli Shechtman

Found 95 papers, 60 papers with code

Lazy Diffusion Transformer for Interactive Image Editing

no code implementations • 18 Apr 2024 • Yotam Nitzan, Zongze Wu, Richard Zhang, Eli Shechtman, Daniel Cohen-Or, Taesung Park, Michaël Gharbi

We demonstrate that our approach is competitive with state-of-the-art inpainting methods in terms of quality and fidelity while providing a 10x speedup for typical user interactions, where the editing mask represents 10% of the image.

Paper
Add Code

Customizing Text-to-Image Diffusion with Camera Viewpoint Control

no code implementations • 18 Apr 2024 • Nupur Kumari, Grace Su, Richard Zhang, Taesung Park, Eli Shechtman, Jun-Yan Zhu

Model customization introduces new concepts to existing text-to-image models, enabling the generation of the new concept in novel contexts.

Object Prompt Engineering

Paper
Add Code

VideoGigaGAN: Towards Detail-rich Video Super-Resolution

no code implementations • 18 Apr 2024 • Yiran Xu, Taesung Park, Richard Zhang, Yang Zhou, Eli Shechtman, Feng Liu, Jia-Bin Huang, Difan Liu

We introduce VideoGigaGAN, a new generative VSR model that can produce videos with high-frequency details and temporal consistency.

Video Super-Resolution

Paper
Add Code

Magic Fixup: Streamlining Photo Editing by Watching Dynamic Videos

no code implementations • 19 Mar 2024 • Hadi AlZayer, Zhihao Xia, Xuaner Zhang, Eli Shechtman, Jia-Bin Huang, Michael Gharbi

We show that by using simple segmentations and coarse 2D manipulations, we can synthesize a photorealistic edit faithful to the user's input while addressing second-order effects like harmonizing the lighting and physical interactions between edited objects.

Paper
Add Code

Jump Cut Smoothing for Talking Heads

no code implementations • 9 Jan 2024 • Xiaojuan Wang, Taesung Park, Yang Zhou, Eli Shechtman, Richard Zhang

We leverage the appearance of the subject from the other source frames in the video, fusing it with a mid-level representation driven by DensePose keypoints and face landmarks.

Paper
Add Code

Customizing Motion in Text-to-Video Diffusion Models

no code implementations • 7 Dec 2023 • Joanna Materzynska, Josef Sivic, Eli Shechtman, Antonio Torralba, Richard Zhang, Bryan Russell

To avoid overfitting to the new custom motion, we introduce an approach for regularization over videos.

Text-to-Video Generation Video Generation

Paper
Add Code

One-step Diffusion with Distribution Matching Distillation

no code implementations • 30 Nov 2023 • Tianwei Yin, Michaël Gharbi, Richard Zhang, Eli Shechtman, Fredo Durand, William T. Freeman, Taesung Park

We introduce Distribution Matching Distillation (DMD), a procedure to transform a diffusion model into a one-step image generator with minimal impact on image quality.

Paper
Add Code

Perceptual Artifacts Localization for Image Synthesis Tasks

1 code implementation • ICCV 2023 • Lingzhi Zhang, Zhengjie Xu, Connelly Barnes, Yuqian Zhou, Qing Liu, He Zhang, Sohrab Amirghodsi, Zhe Lin, Eli Shechtman, Jianbo Shi

Recent advancements in deep generative models have facilitated the creation of photo-realistic images across various tasks.

Image Generation

7,405

Paper
Code

DIFF-NST: Diffusion Interleaving For deFormable Neural Style Transfer

no code implementations • 9 Jul 2023 • Dan Ruta, Gemma Canet Tarrés, Andrew Gilbert, Eli Shechtman, Nicholas Kolkin, John Collomosse

Neural Style Transfer (NST) is the field of study applying neural techniques to modify the artistic appearance of a content image to match the style of a reference style image.

Image Generation Style Transfer

Paper
Add Code

Realistic Saliency Guided Image Enhancement

1 code implementation • CVPR 2023 • S. Mahdi H. Miangoleh, Zoya Bylinskii, Eric Kee, Eli Shechtman, Yağız Aksoy

We thus offer a viable solution for automating image enhancement and photo cleanup operations.

Image Enhancement

Paper
Code

SimpSON: Simplifying Photo Cleanup with Single-Click Distracting Object Segmentation Network

1 code implementation • CVPR 2023 • Chuong Huynh, Yuqian Zhou, Zhe Lin, Connelly Barnes, Eli Shechtman, Sohrab Amirghodsi, Abhinav Shrivastava

In photo editing, it is common practice to remove visual distractions to improve the overall image quality and highlight the primary subject.

Panoptic Segmentation Segmentation

Paper
Code

NeAT: Neural Artistic Tracing for Beautiful Style Transfer

1 code implementation • 11 Apr 2023 • Dan Ruta, Andrew Gilbert, John Collomosse, Eli Shechtman, Nicholas Kolkin

As a component of curating this data, we present a novel model able to classify if an image is stylistic.

Image Generation Style Transfer

Paper
Code

Automatic High Resolution Wire Segmentation and Removal

1 code implementation • CVPR 2023 • Mang Tik Chiu, Xuaner Zhang, Zijun Wei, Yuqian Zhou, Eli Shechtman, Connelly Barnes, Zhe Lin, Florian Kainz, Sohrab Amirghodsi, Humphrey Shi

In this paper, we present an automatic wire clean-up system that eases the process of wire segmentation and removal/inpainting to within a few seconds.

Segmentation Vocal Bursts Intensity Prediction

Paper
Code

Ablating Concepts in Text-to-Image Diffusion Models

1 code implementation • ICCV 2023 • Nupur Kumari, Bingliang Zhang, Sheng-Yu Wang, Eli Shechtman, Richard Zhang, Jun-Yan Zhu

To achieve this goal, we propose an efficient method of ablating concepts in the pretrained model, i. e., preventing the generation of a target concept.

134

Paper
Code

Scaling up GANs for Text-to-Image Synthesis

1 code implementation • CVPR 2023 • Minguk Kang, Jun-Yan Zhu, Richard Zhang, Jaesik Park, Eli Shechtman, Sylvain Paris, Taesung Park

From a technical standpoint, it also marked a drastic change in the favored architecture to design generative image models.

Ranked #18 on Image Generation on ImageNet 256x256

Text-to-Image Generation

1,581

Paper
Code

Semi-supervised Parametric Real-world Image Harmonization

no code implementations • CVPR 2023 • Ke Wang, Michaël Gharbi, He Zhang, Zhihao Xia, Eli Shechtman

Learning-based image harmonization techniques are usually trained to undo synthetic random global transformations applied to a masked foreground in a single ground truth photo.

Image Harmonization

Paper
Add Code

Domain Expansion of Image Generators

1 code implementation • CVPR 2023 • Yotam Nitzan, Michaël Gharbi, Richard Zhang, Taesung Park, Jun-Yan Zhu, Daniel Cohen-Or, Eli Shechtman

First, we note the generator contains a meaningful, pretrained latent space.

27,836

Paper
Code

Structure-Guided Image Completion with Image-level and Object-level Semantic Discriminators

no code implementations • 13 Dec 2022 • Haitian Zheng, Zhe Lin, Jingwan Lu, Scott Cohen, Eli Shechtman, Connelly Barnes, Jianming Zhang, Qing Liu, Yuqian Zhou, Sohrab Amirghodsi, Jiebo Luo

Moreover, the object-level discriminators take aligned instances as inputs to enforce the realism of individual objects.

Object

Paper
Add Code

Multi-Concept Customization of Text-to-Image Diffusion

2 code implementations • CVPR 2023 • Nupur Kumari, Bingliang Zhang, Richard Zhang, Eli Shechtman, Jun-Yan Zhu

Can we teach a model to quickly acquire a new concept, given a few examples?

1,777

Paper
Code

Contrastive Learning for Diverse Disentangled Foreground Generation

no code implementations • 4 Nov 2022 • Yuheng Li, Yijun Li, Jingwan Lu, Eli Shechtman, Yong Jae Lee, Krishna Kumar Singh

We introduce a new method for diverse foreground generation with explicit control over various factors.

Contrastive Learning Facial Inpainting

Paper
Add Code

Text-Free Learning of a Natural Language Interface for Pretrained Face Generators

1 code implementation • 8 Sep 2022 • Xiaodan Du, Raymond A. Yeh, Nicholas Kolkin, Eli Shechtman, Greg Shakhnarovich

We propose Fast text2StyleGAN, a natural language interface that adapts pre-trained GANs for text-guided human face synthesis.

Face Generation

Paper
Code

Inpainting at Modern Camera Resolution by Guided PatchMatch with Auto-Curation

no code implementations • 6 Aug 2022 • Lingzhi Zhang, Connelly Barnes, Kevin Wampler, Sohrab Amirghodsi, Eli Shechtman, Zhe Lin, Jianbo Shi

Recently, deep models have established SOTA performance for low-resolution image inpainting, but they lack fidelity at resolutions associated with modern cameras such as 4K or more, and for large holes.

4k Image Inpainting

Paper
Add Code

Perceptual Artifacts Localization for Inpainting

1 code implementation • 5 Aug 2022 • Lingzhi Zhang, Yuqian Zhou, Connelly Barnes, Sohrab Amirghodsi, Zhe Lin, Eli Shechtman, Jianbo Shi

Inspired by this workflow, we propose a new learning task of automatic segmentation of inpainting perceptual artifacts, and apply the model for inpainting model evaluation and iterative refinement.

Image Inpainting

Paper
Code

Controllable Shadow Generation Using Pixel Height Maps

no code implementations • 12 Jul 2022 • Yichen Sheng, Yifan Liu, Jianming Zhang, Wei Yin, A. Cengiz Oztireli, He Zhang, Zhe Lin, Eli Shechtman, Bedrich Benes

It can be used to calculate hard shadows in a 2D image based on the projective geometry, providing precise control of the shadows' direction and shape.

Paper
Add Code

RigNeRF: Fully Controllable Neural 3D Portraits

no code implementations • CVPR 2022 • ShahRukh Athar, Zexiang Xu, Kalyan Sunkavalli, Eli Shechtman, Zhixin Shu

In this work, we propose RigNeRF, a system that goes beyond just novel view synthesis and enables full control of head pose and facial expressions learned from a single portrait video.

Face Model Neural Rendering +1

Paper
Add Code

ARF: Artistic Radiance Fields

1 code implementation • 13 Jun 2022 • Kai Zhang, Nick Kolkin, Sai Bi, Fujun Luan, Zexiang Xu, Eli Shechtman, Noah Snavely

We present a method for transferring the artistic features of an arbitrary style image to a 3D scene.

481

Paper
Code

BlobGAN: Spatially Disentangled Scene Representations

no code implementations • 5 May 2022 • Dave Epstein, Taesung Park, Richard Zhang, Eli Shechtman, Alexei A. Efros

Blobs are differentiably placed onto a feature grid that is decoded into an image by a generative adversarial network.

Generative Adversarial Network

Paper
Add Code

Any-resolution Training for High-resolution Image Synthesis

1 code implementation • 14 Apr 2022 • Lucy Chai, Michael Gharbi, Eli Shechtman, Phillip Isola, Richard Zhang

To take advantage of varied-size data, we introduce continuous-scale training, a process that samples patches at random scales to train a new generator with variable output resolutions.

2k Image Generation +1

238

Paper
Code

Neural Neighbor Style Transfer

1 code implementation • 24 Mar 2022 • Nicholas Kolkin, Michal Kucera, Sylvain Paris, Daniel Sykora, Eli Shechtman, Greg Shakhnarovich

We propose Neural Neighbor Style Transfer (NNST), a pipeline that offers state-of-the-art quality, generalization, and competitive efficiency for artistic style transfer.

Style Transfer

241

Paper
Code

CM-GAN: Image Inpainting with Cascaded Modulation GAN and Object-Aware Training

1 code implementation • 22 Mar 2022 • Haitian Zheng, Zhe Lin, Jingwan Lu, Scott Cohen, Eli Shechtman, Connelly Barnes, Jianming Zhang, Ning Xu, Sohrab Amirghodsi, Jiebo Luo

We propose cascaded modulation GAN (CM-GAN), a new network design consisting of an encoder with Fourier convolution blocks that extract multi-scale feature representations from the input image with holes and a dual-stream decoder with a novel cascaded global-spatial modulation block at each scale level.

Ranked #1 on Image Inpainting on Places2

Image Inpainting

209

Paper
Code

InsetGAN for Full-Body Image Generation

2 code implementations • CVPR 2022 • Anna Frühstück, Krishna Kumar Singh, Eli Shechtman, Niloy J. Mitra, Peter Wonka, Jingwan Lu

Instead of modeling this complex domain with a single GAN, we propose a novel method to combine multiple pretrained GANs, where one GAN generates a global canvas (e. g., human body) and a set of specialized GANs, or insets, focus on different parts (e. g., faces, shoes) that can be seamlessly inserted onto the global canvas.

Image Generation

167

Paper
Code

Third Time's the Charm? Image and Video Editing with StyleGAN3

1 code implementation • 31 Jan 2022 • Yuval Alaluf, Or Patashnik, Zongze Wu, Asif Zamir, Eli Shechtman, Dani Lischinski, Daniel Cohen-Or

In particular, we demonstrate that while StyleGAN3 can be trained on unaligned data, one can still use aligned data for training, without hindering the ability to generate unaligned imagery.

Disentanglement Image Generation +1

641

Paper
Code

GeoFill: Reference-Based Image Inpainting with Better Geometric Understanding

no code implementations • 20 Jan 2022 • Yunhan Zhao, Connelly Barnes, Yuqian Zhou, Eli Shechtman, Sohrab Amirghodsi, Charless Fowlkes

Our approach achieves state-of-the-art performance on both RealEstate10K and MannequinChallenge dataset with large baselines, complex geometry and extreme camera motions.

Image Inpainting Monocular Depth Estimation

Paper
Add Code

StyleSDF: High-Resolution 3D-Consistent Image and Geometry Generation

1 code implementation • CVPR 2022 • Roy Or-El, Xuan Luo, Mengyi Shan, Eli Shechtman, Jeong Joon Park, Ira Kemelmacher-Shlizerman

We introduce a high resolution, 3D-consistent image and shape generation technique which we call StyleSDF.

3D-Aware Image Synthesis Face Generation +1

534

Paper
Code

Ensembling Off-the-shelf Models for GAN Training

1 code implementation • CVPR 2022 • Nupur Kumari, Richard Zhang, Eli Shechtman, Jun-Yan Zhu

Can the collective "knowledge" from a large bank of pretrained vision models be leveraged to improve GAN training?

Ranked #1 on Image Generation on AFHQ Cat

Image Generation

373

Paper
Code

GAN-Supervised Dense Visual Alignment

1 code implementation • CVPR 2022 • William Peebles, Jun-Yan Zhu, Richard Zhang, Antonio Torralba, Alexei A. Efros, Eli Shechtman

We propose GAN-Supervised Learning, a framework for learning discriminative models and their GAN-generated training data jointly end-to-end.

Data Augmentation Dense Pixel Correspondence Estimation

1,008

Paper
Code

StyleAlign: Analysis and Applications of Aligned StyleGAN Models

1 code implementation • ICLR 2022 • Zongze Wu, Yotam Nitzan, Eli Shechtman, Dani Lischinski

Several works already utilize some basic properties of aligned StyleGAN models to perform image-to-image translation.

Image Morphing Image-to-Image Translation +2

152

Paper
Code

STALP: Style Transfer with Auxiliary Limited Pairing

no code implementations • 20 Oct 2021 • David Futschik, Michal Kučera, Michal Lukáč, Zhaowen Wang, Eli Shechtman, Daniel Sýkora

We present an approach to example-based stylization of images that uses a single pair of a source image and its stylized counterpart.

Style Transfer Translation

Paper
Add Code

Real Image Inversion via Segments

1 code implementation • 12 Oct 2021 • David Futschik, Michal Lukáč, Eli Shechtman, Daniel Sýkora

In this short report, we present a simple, yet effective approach to editing real images via generative adversarial networks (GAN).

128

Paper
Code

KDSalBox: A toolbox of efficient knowledge-distilled saliency models

no code implementations • NeurIPS Workshop SVRHM 2021 • Ard Kastrati, Zoya Bylinskii, Eli Shechtman

Dozens of saliency models have been designed over the last few decades, targeted at diverse applications ranging from image compression and retargeting to robot navigation, surveillance, and distractor detection.

Image Compression Robot Navigation

Paper
Add Code

Collaging Class-specific GANs for Semantic Image Synthesis

no code implementations • ICCV 2021 • Yuheng Li, Yijun Li, Jingwan Lu, Eli Shechtman, Yong Jae Lee, Krishna Kumar Singh

We propose a new approach for high resolution semantic image synthesis.

Image Generation Object

Paper
Add Code

Pose with Style: Detail-Preserving Pose-Guided Image Synthesis with Conditional StyleGAN

no code implementations • 13 Sep 2021 • Badour AlBahar, Jingwan Lu, Jimei Yang, Zhixin Shu, Eli Shechtman, Jia-Bin Huang

We present an algorithm for re-rendering a person from a single image under arbitrary poses.

Image Generation

Paper
Add Code

Ensembling with Deep Generative Views

1 code implementation • CVPR 2021 • Lucy Chai, Jun-Yan Zhu, Eli Shechtman, Phillip Isola, Richard Zhang

Here, we investigate whether such views can be applied to real images to benefit downstream analysis tasks such as image classification.

Image Classification

Paper
Code

Few-shot Image Generation via Cross-domain Correspondence

2 code implementations • CVPR 2021 • Utkarsh Ojha, Yijun Li, Jingwan Lu, Alexei A. Efros, Yong Jae Lee, Eli Shechtman, Richard Zhang

Training generative models, such as GANs, on a target domain containing limited examples (e. g., 10) can easily result in overfitting.

Ranked #3 on 10-shot image generation on Babies

10-shot image generation Image Generation

285

Paper
Code

Modulated Periodic Activations for Generalizable Local Functional Representations

2 code implementations • ICCV 2021 • Ishit Mehta, Michaël Gharbi, Connelly Barnes, Eli Shechtman, Ravi Ramamoorthi, Manmohan Chandraker

Our approach produces generalizable functional representations of images, videos and shapes, and achieves higher reconstruction quality than prior works that are optimized for a single signal.

452

Paper
Code

StyleCLIP: Text-Driven Manipulation of StyleGAN Imagery

5 code implementations • ICCV 2021 • Or Patashnik, Zongze Wu, Eli Shechtman, Daniel Cohen-Or, Dani Lischinski

Inspired by the ability of StyleGAN to generate highly realistic images in a variety of domains, much recent work has focused on understanding how to use the latent spaces of StyleGAN to manipulate generated and real images.

Image Manipulation

3,895

Paper
Code

TransFill: Reference-guided Image Inpainting by Merging Multiple Color and Spatial Transformations

no code implementations • CVPR 2021 • Yuqian Zhou, Connelly Barnes, Eli Shechtman, Sohrab Amirghodsi

Image inpainting is the task of plausibly restoring missing pixels within a hole region that is to be removed from a target image.

Image Inpainting

Paper
Add Code

CharacterGAN: Few-Shot Keypoint Character Animation and Reposing

1 code implementation • 5 Feb 2021 • Tobias Hinz, Matthew Fisher, Oliver Wang, Eli Shechtman, Stefan Wermter

Our model generates novel poses based on keypoint locations, which can be modified in real time while providing interactive feedback, allowing for intuitive reposing and animation.

195

Paper
Code

Spatially-Adaptive Pixelwise Networks for Fast Image Translation

1 code implementation • CVPR 2021 • Tamar Rott Shaham, Michael Gharbi, Richard Zhang, Eli Shechtman, Tomer Michaeli

We introduce a new generator architecture, aimed at fast and efficient high-resolution image-to-image translation.

Image-to-Image Translation Inductive Bias +1

119

Paper
Code

Few-shot Image Generation with Elastic Weight Consolidation

no code implementations • NeurIPS 2020 • Yijun Li, Richard Zhang, Jingwan Lu, Eli Shechtman

Few-shot image generation seeks to generate more data of a given domain, with only few available training examples.

Ranked #4 on 10-shot image generation on Babies

10-shot image generation Image Generation

Paper
Add Code

StyleSpace Analysis: Disentangled Controls for StyleGAN Image Generation

6 code implementations • CVPR 2021 • Zongze Wu, Dani Lischinski, Eli Shechtman

Manipulation of visual attributes via these StyleSpace controls is shown to be better disentangled than via those proposed in previous works.

Attribute Image Generation

3,895

Paper
Code

Look here! A parametric learning based approach to redirect visual attention

no code implementations • ECCV 2020 • Youssef Alami Mejjati, Celso F. Gomez, Kwang In Kim, Eli Shechtman, Zoya Bylinskii

Extensions of our model allow for multi-style edits and the ability to both increase and attenuate attention in an image region.

Marketing

Paper
Add Code

Swapping Autoencoder for Deep Image Manipulation

4 code implementations • NeurIPS 2020 • Taesung Park, Jun-Yan Zhu, Oliver Wang, Jingwan Lu, Eli Shechtman, Alexei A. Efros, Richard Zhang

Deep generative models have become increasingly effective at producing realistic images from randomly sampled seeds, but using such models for controllable manipulation of existing images remains challenging.

Image Manipulation

506

Paper
Code

High-Resolution Image Inpainting with Iterative Confidence Feedback and Guided Upsampling

1 code implementation • ECCV 2020 • Yu Zeng, Zhe Lin, Jimei Yang, Jianming Zhang, Eli Shechtman, Huchuan Lu

To address this challenge, we propose an iterative inpainting method with a feedback mechanism.

Ranked #6 on Image Inpainting on Places2

Image Inpainting Vocal Bursts Intensity Prediction

209

Paper
Code

Image Morphing with Perceptual Constraints and STN Alignment

1 code implementation • 29 Apr 2020 • Noa Fish, Richard Zhang, Lilach Perry, Daniel Cohen-Or, Eli Shechtman, Connelly Barnes

In image morphing, a sequence of plausible frames are synthesized and composited together to form a smooth transformation between given instances.

Image Morphing

Paper
Code

MakeItTalk: Speaker-Aware Talking-Head Animation

3 code implementations • 27 Apr 2020 • Yang Zhou, Xintong Han, Eli Shechtman, Jose Echevarria, Evangelos Kalogerakis, DIngzeyu Li

We present a method that generates expressive talking heads from a single facial image with audio as the only input.

Talking Face Generation Talking Head Generation

924

Paper
Code

State of the Art on Neural Rendering

no code implementations • 8 Apr 2020 • Ayush Tewari, Ohad Fried, Justus Thies, Vincent Sitzmann, Stephen Lombardi, Kalyan Sunkavalli, Ricardo Martin-Brualla, Tomas Simon, Jason Saragih, Matthias Nießner, Rohit Pandey, Sean Fanello, Gordon Wetzstein, Jun-Yan Zhu, Christian Theobalt, Maneesh Agrawala, Eli Shechtman, Dan B. Goldman, Michael Zollhöfer

Neural rendering is a new and rapidly emerging field that combines generative machine learning techniques with physical knowledge from computer graphics, e. g., by the integration of differentiable rendering into network training.

BIG-bench Machine Learning Image Generation +2

Paper
Add Code

Deep CG2Real: Synthetic-to-Real Translation via Image Disentanglement

no code implementations • ICCV 2019 • Sai Bi, Kalyan Sunkavalli, Federico Perazzi, Eli Shechtman, Vladimir Kim, Ravi Ramamoorthi

We present a method to improve the visual realism of low-quality, synthetic images, e. g. OpenGL renderings.

Disentanglement Domain Adaptation +2

Paper
Add Code

Lifespan Age Transformation Synthesis

2 code implementations • ECCV 2020 • Roy Or-El, Soumyadip Sengupta, Ohad Fried, Eli Shechtman, Ira Kemelmacher-Shlizerman

Most existing aging methods are limited to changing the texture, overlooking transformations in head shape that occur during the human aging and growth process.

Face Age Editing Generative Adversarial Network +5

556

Paper
Code

Neural Puppet: Generative Layered Cartoon Characters

no code implementations • 4 Oct 2019 • Omid Poursaeed, Vladimir G. Kim, Eli Shechtman, Jun Saito, Serge Belongie

We capture these subtle changes by applying an image translation network to refine the mesh rendering, providing an end-to-end model to generate new animations of a character with high visual quality.

Paper
Add Code

Interactive Sketch & Fill: Multiclass Sketch-to-Image Translation

1 code implementation • ICCV 2019 • Arnab Ghosh, Richard Zhang, Puneet K. Dokania, Oliver Wang, Alexei A. Efros, Philip H. S. Torr, Eli Shechtman

We propose an interactive GAN-based sketch-to-image translation method that helps novice users create images of simple objects.

Object Sketch-to-Image Translation +1

192

Paper
Code

UprightNet: Geometry-Aware Camera Orientation Estimation from Single Images

no code implementations • ICCV 2019 • Wenqi Xian, Zhengqi Li, Matthew Fisher, Jonathan Eisenmann, Eli Shechtman, Noah Snavely

We introduce UprightNet, a learning-based approach for estimating 2DoF camera orientation from a single RGB image of an indoor scene.

Camera Calibration

Paper
Add Code

Text-based Editing of Talking-head Video

1 code implementation • 4 Jun 2019 • Ohad Fried, Ayush Tewari, Michael Zollhöfer, Adam Finkelstein, Eli Shechtman, Dan B. Goldman, Kyle Genova, Zeyu Jin, Christian Theobalt, Maneesh Agrawala

To edit a video, the user has to only edit the transcript, and an optimization strategy then chooses segments of the input corpus as base material.

Face Model Sentence +3

409

Paper
Code

Photometric Mesh Optimization for Video-Aligned 3D Object Reconstruction

1 code implementation • CVPR 2019 • Chen-Hsuan Lin, Oliver Wang, Bryan C. Russell, Eli Shechtman, Vladimir G. Kim, Matthew Fisher, Simon Lucey

In this paper, we address the problem of 3D object mesh reconstruction from RGB videos.

3D Object Reconstruction 3D Reconstruction +2

209

Paper
Code

Im2Pencil: Controllable Pencil Illustration from Photographs

1 code implementation • CVPR 2019 • Yijun Li, Chen Fang, Aaron Hertzmann, Eli Shechtman, Ming-Hsuan Yang

We propose a high-quality photo-to-pencil translation method with fine-grained control over the drawing style.

Translation

181

Paper
Code

Texture Mixer: A Network for Controllable Synthesis and Interpolation of Texture

1 code implementation • CVPR 2019 • Ning Yu, Connelly Barnes, Eli Shechtman, Sohrab Amirghodsi, Michal Lukac

This paper addresses the problem of interpolating visual textures.

101

Paper
Code

Localizing Moments in Video with Temporal Language

1 code implementation • EMNLP 2018 • Lisa Anne Hendricks, Oliver Wang, Eli Shechtman, Josef Sivic, Trevor Darrell, Bryan Russell

To benchmark whether our model, and other recent video localization models, can effectively reason about temporal language, we collect the novel TEMPOral reasoning in video and language (TEMPO) dataset.

Natural Language Queries Retrieval +1

Paper
Code

MT-VAE: Learning Motion Transformations to Generate Multimodal Human Dynamics

1 code implementation • ECCV 2018 • Xinchen Yan, Akash Rastogi, Ruben Villegas, Kalyan Sunkavalli, Eli Shechtman, Sunil Hadap, Ersin Yumer, Honglak Lee

Our model jointly learns a feature embedding for motion modes (that the motion sequence can be reconstructed from) and a feature transformation that represents the transition of one motion mode to the next motion mode.

Ranked #7 on Human Pose Forecasting on Human3.6M (ADE metric)

Human Dynamics Human Pose Forecasting +1

Paper
Code

Learning Blind Video Temporal Consistency

1 code implementation • ECCV 2018 • Wei-Sheng Lai, Jia-Bin Huang, Oliver Wang, Eli Shechtman, Ersin Yumer, Ming-Hsuan Yang

Our method takes the original unprocessed and per-frame processed videos as inputs to produce a temporally consistent video.

Colorization Image-to-Image Translation +4

403

Paper
Code

Deep Painterly Harmonization

12 code implementations • 9 Apr 2018 • Fujun Luan, Sylvain Paris, Eli Shechtman, Kavita Bala

Copying an element from a photo and pasting it into a painting is a challenging task.

Graphics

6,084

Paper
Code

ST-GAN: Spatial Transformer Generative Adversarial Networks for Image Compositing

2 code implementations • CVPR 2018 • Chen-Hsuan Lin, Ersin Yumer, Oliver Wang, Eli Shechtman, Simon Lucey

We address the problem of finding realistic geometric corrections to a foreground object such that it appears natural when composited into a background image.

Generative Adversarial Network

330

Paper
Code

The Unreasonable Effectiveness of Deep Features as a Perceptual Metric

24 code implementations • CVPR 2018 • Richard Zhang, Phillip Isola, Alexei A. Efros, Eli Shechtman, Oliver Wang

We systematically evaluate deep features across different architectures and tasks and compare them with classic metrics.

Ranked #19 on Video Quality Assessment on MSU FR VQA Database

Image Quality Assessment SSIM +1

3,369

Paper
Code

Multi-Content GAN for Few-Shot Font Style Transfer

6 code implementations • CVPR 2018 • Samaneh Azadi, Matthew Fisher, Vladimir Kim, Zhaowen Wang, Eli Shechtman, Trevor Darrell

In this work, we focus on the challenge of taking partial observations of highly-stylized text and generalizing the observations to generate unobserved glyphs in the ornamented typeface.

Font Style Transfer

442

Paper
Code

Toward Multimodal Image-to-Image Translation

6 code implementations • NeurIPS 2017 • Jun-Yan Zhu, Richard Zhang, Deepak Pathak, Trevor Darrell, Alexei A. Efros, Oliver Wang, Eli Shechtman

Our proposed method encourages bijective consistency between the latent encoding and output modes.

Ranked #2 on Multimodal Unsupervised Image-To-Image Translation on Edge-to-Shoes

Image-to-Image Translation Translation

15,701

Paper
Code

Photorealistic Style Transfer with Screened Poisson Equation

1 code implementation • 28 Sep 2017 • Roey Mechrez, Eli Shechtman, Lihi Zelnik-Manor

Recent work has shown impressive success in transferring painterly style to images.

Style Transfer

Paper
Code

Training Deep Networks to be Spatially Sensitive

no code implementations • ICCV 2017 • Nicholas Kolkin, Gregory Shakhnarovich, Eli Shechtman

In many computer vision tasks, for example saliency prediction or semantic segmentation, the desired output is a foreground map that predicts pixels where some criteria is satisfied.

Saliency Prediction Semantic Segmentation

Paper
Add Code

Localizing Moments in Video with Natural Language

2 code implementations • ICCV 2017 • Lisa Anne Hendricks, Oliver Wang, Eli Shechtman, Josef Sivic, Trevor Darrell, Bryan Russell

A key obstacle to training our MCN model is that current video datasets do not include pairs of localized video segments and referring expressions, or text descriptions which uniquely identify a corresponding moment.

Natural Language Queries

182

Paper
Code

Neural Face Editing with Intrinsic Image Disentangling

2 code implementations • CVPR 2017 • Zhixin Shu, Ersin Yumer, Sunil Hadap, Kalyan Sunkavalli, Eli Shechtman, Dimitris Samaras

Traditional face editing methods often require a number of sophisticated and task specific algorithms to be applied one after the other --- a process that is tedious, fragile, and computationally intensive.

Facial Editing Generative Adversarial Network

Paper
Code

Deep Photo Style Transfer

21 code implementations • CVPR 2017 • Fujun Luan, Sylvain Paris, Eli Shechtman, Kavita Bala

This paper introduces a deep-learning approach to photographic style transfer that handles a large variety of image content while faithfully transferring the reference style.

Style Transfer

9,971

Paper
Code

Removing Shadows from Images of Documents

2 code implementations • ACCV 2017 • Steve Bako, Soheil Darabi, Eli Shechtman, Jue Wang, Kalyan Sunkavalli, Pradeep Sen

In this work, we automatically detect and remove distracting shadows from photographs of documents and other text-based items.

Document Shadow Removal

Paper
Code

Saliency Driven Image Manipulation

1 code implementation • 7 Dec 2016 • Roey Mechrez, Eli Shechtman, Lihi Zelnik-Manor

Have you ever taken a picture only to find out that an unimportant background object ended up being overly salient?

Image Manipulation

Paper
Code

High-Resolution Image Inpainting using Multi-Scale Neural Patch Synthesis

1 code implementation • CVPR 2017 • Chao Yang, Xin Lu, Zhe Lin, Eli Shechtman, Oliver Wang, Hao Li

Recent advances in deep learning have shown exciting promise in filling large holes in natural images with semantically plausible and context aware details, impacting fundamental image manipulation tasks such as object removal.

Image Inpainting Image Manipulation +1

1,292

Paper
Code

Controlling Perceptual Factors in Neural Style Transfer

6 code implementations • CVPR 2017 • Leon A. Gatys, Alexander S. Ecker, Matthias Bethge, Aaron Hertzmann, Eli Shechtman

Neural Style Transfer has shown very exciting results enabling new forms of image manipulation.

Image Manipulation Style Transfer

820

Paper
Code

Generative Visual Manipulation on the Natural Image Manifold

1 code implementation • 12 Sep 2016 • Jun-Yan Zhu, Philipp Krähenbühl, Eli Shechtman, Alexei A. Efros

Realistic image manipulation is challenging because it requires modifying the image appearance in a user-controlled way, while preserving the realism of the result.

Image Manipulation

3,956

Paper
Code

Preserving Color in Neural Artistic Style Transfer

7 code implementations • 19 Jun 2016 • Leon A. Gatys, Matthias Bethge, Aaron Hertzmann, Eli Shechtman

This note presents an extension to the neural artistic style transfer algorithm (Gatys et al.).

Style Transfer

3,101

Paper
Code

Appearance Harmonization for Single Image Shadow Removal

no code implementations • 21 Mar 2016 • Liqian Ma, Jue Wang, Eli Shechtman, Kalyan Sunkavalli, Shi-Min Hu

In this work we propose a fully automatic shadow region harmonization approach that improves the appearance compatibility of the de-shadowed region as typically produced by previous methods.

Image Generation Image Shadow Removal +1

Paper
Add Code

PatchMatch-Based Automatic Lattice Detection for Near-Regular Textures

no code implementations • ICCV 2015 • Siying Liu, Tian-Tsong Ng, Kalyan Sunkavalli, Minh N. Do, Eli Shechtman, Nathan Carr

In this work, we investigate the problem of automatically inferring the lattice structure of near-regular textures (NRT) in real-world images.

Paper
Add Code

Learning a Discriminative Model for the Perception of Realism in Composite Images

1 code implementation • ICCV 2015 • Jun-Yan Zhu, Philipp Krähenbühl, Eli Shechtman, Alexei A. Efros

What makes an image appear realistic?

Paper
Code

DeepFont: Identify Your Font from An Image

1 code implementation • 12 Jul 2015 • Zhangyang Wang, Jianchao Yang, Hailin Jin, Eli Shechtman, Aseem Agarwala, Jonathan Brandt, Thomas S. Huang

As font is one of the core design concepts, automatic font identification and similar font suggestion from an image or photo has been on the wish list of many designers.

Ranked #1 on Font Recognition on VFR-Wild

Domain Adaptation Font Recognition +1

189

Paper
Code

Finding Distractors In Images

no code implementations • CVPR 2015 • Ohad Fried, Eli Shechtman, Dan B. Goldman, Adam Finkelstein

We propose a new computer vision task we call "distractor prediction."

Paper
Add Code

Real-World Font Recognition Using Deep Network and Domain Adaptation

no code implementations • 31 Mar 2015 • Zhangyang Wang, Jianchao Yang, Hailin Jin, Eli Shechtman, Aseem Agarwala, Jonathan Brandt, Thomas S. Huang

We address a challenging fine-grain classification problem: recognizing a font style from an image of text.

Domain Adaptation Font Recognition +1

Paper
Add Code

Decomposition-Based Domain Adaptation for Real-World Font Recognition

no code implementations • 18 Dec 2014 • Zhangyang Wang, Jianchao Yang, Hailin Jin, Eli Shechtman, Aseem Agarwala, Jonathan Brandt, Thomas S. Huang

We present a domain adaption framework to address a domain mismatch between synthetic training and real-world testing data.

Domain Adaptation Font Recognition +1

Paper
Add Code

Large-Scale Visual Font Recognition

no code implementations • CVPR 2014 • Guang Chen, Jianchao Yang, Hailin Jin, Jonathan Brandt, Eli Shechtman, Aseem Agarwala, Tony X. Han

This paper addresses the large-scale visual font recognition (VFR) problem, which aims at automatic identification of the typeface, weight, and slope of the text in an image or photo without any knowledge of content.

Ranked #1 on Font Recognition on VFR-447

Font Recognition Image Categorization +1

Paper
Add Code

Learning Video Saliency from Human Gaze Using Candidate Selection

no code implementations • CVPR 2013 • Dmitry Rudoy, Dan B. Goldman, Eli Shechtman, Lihi Zelnik-Manor

For example, the time each video frame is observed is a fraction of a second, while a still image can be viewed leisurely.

Saliency Prediction

Paper
Add Code

Crowdsourcing Gaze Data Collection

1 code implementation • 16 Apr 2012 • Dmitry Rudoy, Dan B. Goldman, Eli Shechtman, Lihi Zelnik-Manor

In this work we propose a crowdsourced method for acquisition of gaze direction data from a virtually unlimited number of participants, using a robust self-reporting mechanism (see Figure 1).

Social and Information Networks Human-Computer Interaction

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.