Search Results for author: Jimei Yang

Found 62 papers, 29 papers with code

Customize-A-Video: One-Shot Motion Customization of Text-to-Video Diffusion Models

no code implementations • 22 Feb 2024 • Yixuan Ren, Yang Zhou, Jimei Yang, Jing Shi, Difan Liu, Feng Liu, Mingi Kwon, Abhinav Shrivastava

With the emergence of text-to-video (T2V) diffusion models, its temporal counterpart, motion customization, has not yet been well investigated.

Video Generation

Paper
Add Code

Template-Free Single-View 3D Human Digitalization with Diffusion-Guided LRM

no code implementations • 22 Jan 2024 • Zhenzhen Weng, Jingyuan Liu, Hao Tan, Zhan Xu, Yang Zhou, Serena Yeung-Levy, Jimei Yang

We present Human-LRM, a diffusion-guided feed-forward model that predicts the implicit field of a human from a single image.

Paper
Add Code

ActAnywhere: Subject-Aware Video Background Generation

no code implementations • 19 Jan 2024 • Boxiao Pan, Zhan Xu, Chun-Hao Paul Huang, Krishna Kumar Singh, Yang Zhou, Leonidas J. Guibas, Jimei Yang

Generating video background that tailors to foreground subject motion is an important problem for the movie industry and visual effects community.

Paper
Add Code

ContactGen: Generative Contact Modeling for Grasp Generation

no code implementations • ICCV 2023 • Shaowei Liu, Yang Zhou, Jimei Yang, Saurabh Gupta, Shenlong Wang

This paper presents a novel object-centric contact representation ContactGen for hand-object interaction.

Grasp Generation Object

Paper
Add Code

Putting People in Their Place: Affordance-Aware Human Insertion into Scenes

1 code implementation • CVPR 2023 • Sumith Kulal, Tim Brooks, Alex Aiken, Jiajun Wu, Jimei Yang, Jingwan Lu, Alexei A. Efros, Krishna Kumar Singh

Given a scene image with a marked region and an image of a person, we insert the person into the scene while respecting the scene affordances.

129

Paper
Code

Normal-guided Garment UV Prediction for Human Re-texturing

no code implementations • CVPR 2023 • Yasamin Jafarian, Tuanfeng Y. Wang, Duygu Ceylan, Jimei Yang, Nathan Carr, Yi Zhou, Hyun Soo Park

To edit human videos in a physically plausible way, a texture map must take into account not only the garment transformation induced by the body movements and clothes fitting, but also its 3D fine-grained surface geometry.

3D Reconstruction

Paper
Add Code

Learning Visibility for Robust Dense Human Body Estimation

1 code implementation • 23 Aug 2022 • Chun-Han Yao, Jimei Yang, Duygu Ceylan, Yi Zhou, Yang Zhou, Ming-Hsuan Yang

An alternative approach is to estimate dense vertices of a predefined template body in the image space.

Paper
Code

Skeleton-free Pose Transfer for Stylized 3D Characters

1 code implementation • 28 Jul 2022 • Zhouyingcheng Liao, Jimei Yang, Jun Saito, Gerard Pons-Moll, Yang Zhou

We present the first method that automatically transfers poses between stylized 3D characters without skeletal rigging.

Pose Transfer

175

Paper
Code

Audio-driven Neural Gesture Reenactment with Video Motion Graphs

no code implementations • CVPR 2022 • Yang Zhou, Jimei Yang, DIngzeyu Li, Jun Saito, Deepali Aneja, Evangelos Kalogerakis

We present a method that reenacts a high-quality video with gestures matching a target speech audio.

valid

Paper
Add Code

RiCS: A 2D Self-Occlusion Map for Harmonizing Volumetric Objects

no code implementations • 14 May 2022 • Yunseok Jang, Ruben Villegas, Jimei Yang, Duygu Ceylan, Xin Sun, Honglak Lee

We test the effectiveness of our representation on the human image harmonization task by predicting shading that is coherent with a given background image.

Image Harmonization

Paper
Add Code

The Best of Both Worlds: Combining Model-based and Nonparametric Approaches for 3D Human Body Estimation

no code implementations • 1 May 2022 • Zhe Wang, Jimei Yang, Charless Fowlkes

Our framework leverages the best of non-parametric and model-based methods and is also robust to partial occlusion.

Ranked #1 on 3D Absolute Human Pose Estimation on Human3.6M (PA-MPJPE metric)

3D Absolute Human Pose Estimation

Paper
Add Code

Learning Motion-Dependent Appearance for High-Fidelity Rendering of Dynamic Humans from a Single Camera

no code implementations • CVPR 2022 • Jae Shin Yoon, Duygu Ceylan, Tuanfeng Y. Wang, Jingwan Lu, Jimei Yang, Zhixin Shu, Hyun Soo Park

Appearance of dressed humans undergoes a complex geometric transformation induced not only by the static pose but also by its dynamics, i. e., there exists a number of cloth geometric configurations given a pose depending on the way it has moved.

Paper
Add Code

Contact-Aware Retargeting of Skinned Motion

no code implementations • ICCV 2021 • Ruben Villegas, Duygu Ceylan, Aaron Hertzmann, Jimei Yang, Jun Saito

Self-contacts, such as when hands touch each other or the torso or the head, are important attributes of human body language and dynamics, yet existing methods do not model or preserve these contacts.

Motion Estimation motion retargeting

Paper
Add Code

Pose with Style: Detail-Preserving Pose-Guided Image Synthesis with Conditional StyleGAN

no code implementations • 13 Sep 2021 • Badour AlBahar, Jingwan Lu, Jimei Yang, Zhixin Shu, Eli Shechtman, Jia-Bin Huang

We present an algorithm for re-rendering a person from a single image under arbitrary poses.

Image Generation

Paper
Add Code

Stochastic Scene-Aware Motion Prediction

no code implementations • ICCV 2021 • Mohamed Hassan, Duygu Ceylan, Ruben Villegas, Jun Saito, Jimei Yang, Yi Zhou, Michael Black

A long-standing goal in computer vision is to capture, model, and realistically synthesize human behavior.

motion prediction Motion Synthesis +1

Paper
Add Code

Single-image Full-body Human Relighting

no code implementations • 15 Jul 2021 • Manuel Lagunas, Xin Sun, Jimei Yang, Ruben Villegas, Jianming Zhang, Zhixin Shu, Belen Masia, Diego Gutierrez

We present a single-image data-driven method to automatically relight images with full-body humans in them.

Image Reconstruction

Paper
Add Code

Task-Generic Hierarchical Human Motion Prior using VAEs

no code implementations • 7 Jun 2021 • Jiaman Li, Ruben Villegas, Duygu Ceylan, Jimei Yang, Zhengfei Kuang, Hao Li, Yajie Zhao

We demonstrate the effectiveness of our hierarchical motion variational autoencoder in a variety of tasks including video-based human pose estimation, motion completion from partial observations, and motion synthesis from sparse key-frames.

Ranked #4 on Motion Synthesis on LaFAN1

Motion Synthesis Pose Estimation

Paper
Add Code

HuMoR: 3D Human Motion Model for Robust Pose Estimation

1 code implementation • ICCV 2021 • Davis Rempe, Tolga Birdal, Aaron Hertzmann, Jimei Yang, Srinath Sridhar, Leonidas J. Guibas

We introduce HuMoR: a 3D Human Motion Model for Robust Estimation of temporal pose and shape.

Pose Estimation

487

Paper
Code

Attribute-conditioned Layout GAN for Automatic Graphic Design

no code implementations • 11 Sep 2020 • Jianan Li, Jimei Yang, Jianming Zhang, Chang Liu, Christina Wang, Tingfa Xu

In this paper, we introduce Attribute-conditioned Layout GAN to incorporate the attributes of design elements for graphic layout generation by forcing both the generator and the discriminator to meet attribute conditions.

Attribute

Paper
Add Code

Contact and Human Dynamics from Monocular Video

1 code implementation • ECCV 2020 • Davis Rempe, Leonidas J. Guibas, Aaron Hertzmann, Bryan Russell, Ruben Villegas, Jimei Yang

Existing deep models predict 2D and 3D kinematic poses from video that are approximately accurate, but contain visible errors that violate physical constraints, such as feet penetrating the ground and bodies leaning at extreme angles.

Human Dynamics Pose Estimation

263

Paper
Code

High-Resolution Image Inpainting with Iterative Confidence Feedback and Guided Upsampling

1 code implementation • ECCV 2020 • Yu Zeng, Zhe Lin, Jimei Yang, Jianming Zhang, Eli Shechtman, Huchuan Lu

To address this challenge, we propose an iterative inpainting method with a feedback mechanism.

Ranked #6 on Image Inpainting on Places2

Image Inpainting Vocal Bursts Intensity Prediction

209

Paper
Code

Generative Tweening: Long-term Inbetweening of 3D Human Motions

no code implementations • 18 May 2020 • Yi Zhou, Jingwan Lu, Connelly Barnes, Jimei Yang, Sitao Xiang, Hao Li

We introduce a biomechanically constrained generative adversarial network that performs long-term inbetweening of human motions, conditioned on keyframe constraints.

Generative Adversarial Network

Paper
Add Code

3D Ken Burns Effect from a Single Image

4 code implementations • 12 Sep 2019 • Simon Niklaus, Long Mai, Jimei Yang, Feng Liu

According to this depth estimate, our framework then maps the input image to a point cloud and synthesizes the resulting video frames by rendering the point cloud from the corresponding camera positions.

Ranked #4 on Depth Estimation on NYU-Depth V2

Depth Estimation Depth Prediction

1,496

Paper
Code

FreiHAND: A Dataset for Markerless Capture of Hand Pose and Shape from Single RGB Images

no code implementations • ICCV 2019 • Christian Zimmermann, Duygu Ceylan, Jimei Yang, Bryan Russell, Max Argus, Thomas Brox

We show that methods trained on our dataset consistently perform well when tested on other datasets.

Ranked #8 on 3D Hand Pose Estimation on FreiHAND

3D Hand Pose Estimation

Paper
Add Code

Learning to Sit: Synthesizing Human-Chair Interactions via Hierarchical Control

no code implementations • 20 Aug 2019 • Yu-Wei Chao, Jimei Yang, Weifeng Chen, Jia Deng

We experimentally demonstrate the strength of our approach over different non-hierarchical and hierarchical baselines.

Hierarchical Reinforcement Learning motion prediction +3

Paper
Add Code

LayoutGAN: Generating Graphic Layouts with Wireframe Discriminator

1 code implementation • ICLR 2019 • Jianan Li, Tingfa Xu, Jianming Zhang, Aaron Hertzmann, Jimei Yang

Layouts are important for graphic design and scene generation.

Generative Adversarial Network Scene Generation

Paper
Code

Multimodal Style Transfer via Graph Cuts

2 code implementations • ICCV 2019 • Yulun Zhang, Chen Fang, Yilin Wang, Zhaowen Wang, Zhe Lin, Yun Fu, Jimei Yang

An assumption widely used in recent neural style transfer methods is that image styles can be described by global statics of deep features like Gram or covariance matrices.

Style Transfer

Paper
Code

LayoutGAN: Generating Graphic Layouts with Wireframe Discriminators

1 code implementation • 21 Jan 2019 • Jianan Li, Jimei Yang, Aaron Hertzmann, Jianming Zhang, Tingfa Xu

Layout is important for graphic design and scene generation.

Generative Adversarial Network Scene Generation

Paper
Code

Foreground-aware Image Inpainting

no code implementations • CVPR 2019 • Wei Xiong, Jiahui Yu, Zhe Lin, Jimei Yang, Xin Lu, Connelly Barnes, Jiebo Luo

We show that by such disentanglement, the contour completion model predicts reasonable contours of objects, and further substantially improves the performance of image inpainting.

Disentanglement Image Inpainting

Paper
Add Code

On the Continuity of Rotation Representations in Neural Networks

5 code implementations • CVPR 2019 • Yi Zhou, Connelly Barnes, Jingwan Lu, Jimei Yang, Hao Li

Thus, widely used representations such as quaternions and Euler angles are discontinuous and difficult for neural networks to learn.

307

Paper
Code

Learning to Sketch with Deep Q Networks and Demonstrated Strokes

no code implementations • 14 Oct 2018 • Tao Zhou, Chen Fang, Zhaowen Wang, Jimei Yang, Byungmoon Kim, Zhili Chen, Jonathan Brandt, Demetri Terzopoulos

Doodling is a useful and common intelligent skill that people can learn and master.

Q-Learning

Paper
Add Code

Flow-Grounded Spatial-Temporal Video Prediction from Still Images

1 code implementation • ECCV 2018 • Yijun Li, Chen Fang, Jimei Yang, Zhaowen Wang, Xin Lu, Ming-Hsuan Yang

Existing video prediction methods mainly rely on observing multiple historical frames or focus on predicting the next one-frame.

Video Prediction

Paper
Code

Free-Form Image Inpainting with Gated Convolution

30 code implementations • ICCV 2019 • Jiahui Yu, Zhe Lin, Jimei Yang, Xiaohui Shen, Xin Lu, Thomas Huang

We present a generative image inpainting system to complete images with free-form mask and guidance.

Ranked #3 on Image Inpainting on Places2 val

feature selection Image Inpainting +1

3,162

Paper
Code

PlaneNet: Piece-wise Planar Reconstruction from a Single RGB Image

1 code implementation • CVPR 2018 • Chen Liu, Jimei Yang, Duygu Ceylan, Ersin Yumer, Yasutaka Furukawa

The proposed end-to-end DNN learns to directly infer a set of plane parameters and corresponding plane segmentation masks from a single RGB image.

Ranked #2 on Plane Instance Segmentation on NYU Depth v2

Depth Estimation Depth Prediction +1

392

Paper
Code

Neural Kinematic Networks for Unsupervised Motion Retargetting

1 code implementation • CVPR 2018 • Ruben Villegas, Jimei Yang, Duygu Ceylan, Honglak Lee

We propose a recurrent neural network architecture with a Forward Kinematics layer and cycle consistency based adversarial training objective for unsupervised motion retargetting.

237

Paper
Code

BodyNet: Volumetric Inference of 3D Human Body Shapes

2 code implementations • ECCV 2018 • Gül Varol, Duygu Ceylan, Bryan Russell, Jimei Yang, Ersin Yumer, Ivan Laptev, Cordelia Schmid

Human shape estimation is an important task for video editing, animation and fashion industry.

Ranked #3 on 3D Human Pose Estimation on Surreal (using extra training data)

3D Human Pose Estimation Segmentation +1

261

Paper
Code

MAttNet: Modular Attention Network for Referring Expression Comprehension

1 code implementation • CVPR 2018 • Licheng Yu, Zhe Lin, Xiaohui Shen, Jimei Yang, Xin Lu, Mohit Bansal, Tamara L. Berg

In this paper, we address referring expression comprehension: localizing an image region described by a natural language expression.

Ranked #7 on Generalized Referring Expression Segmentation on gRefCOCO

Generalized Referring Expression Segmentation Referring Expression +1

291

Paper
Code

Generative Image Inpainting with Contextual Attention

28 code implementations • CVPR 2018 • Jiahui Yu, Zhe Lin, Jimei Yang, Xiaohui Shen, Xin Lu, Thomas S. Huang

Motivated by these observations, we propose a new deep generative model-based approach which can not only synthesize novel image structures but also explicitly utilize surrounding image features as references during network training to make better predictions.

Image Inpainting

3,162

Paper
Code

Predicting Scene Parsing and Motion Dynamics in the Future

no code implementations • NeurIPS 2017 • Xiaojie Jin, Huaxin Xiao, Xiaohui Shen, Jimei Yang, Zhe Lin, Yunpeng Chen, Zequn Jie, Jiashi Feng, Shuicheng Yan

The ability of predicting the future is important for intelligent systems, e. g. autonomous vehicles and robots to plan early and make decisions accordingly.

Autonomous Vehicles motion prediction +2

Paper
Add Code

FoveaNet: Perspective-aware Urban Scene Parsing

no code implementations • ICCV 2017 • Xin Li, Zequn Jie, Wei Wang, Changsong Liu, Jimei Yang, Xiaohui Shen, Zhe Lin, Qiang Chen, Shuicheng Yan, Jiashi Feng

Thus, they suffer from heterogeneous object scales caused by perspective projection of cameras on actual scenes and inevitably encounter parsing failures on distant objects as well as other boundary and recognition errors.

Scene Parsing

Paper
Add Code

3D-PRNN: Generating Shape Primitives with Recurrent Neural Networks

2 code implementations • ICCV 2017 • Chuhang Zou, Ersin Yumer, Jimei Yang, Duygu Ceylan, Derek Hoiem

The success of various applications including robotics, digital content creation, and visualization demand a structured and abstract representation of the 3D world from limited sensor data.

Retrieval

118

Paper
Code

Material Editing Using a Physically Based Rendering Network

no code implementations • ICCV 2017 • Guilin Liu, Duygu Ceylan, Ersin Yumer, Jimei Yang, Jyh-Ming Lien

We propose an end-to-end network architecture that replicates the forward image formation process to accomplish this task.

Image Generation

Paper
Add Code

Deep GrabCut for Object Selection

no code implementations • 2 Jul 2017 • Ning Xu, Brian Price, Scott Cohen, Jimei Yang, Thomas Huang

In this paper, we propose a novel segmentation approach that uses a rectangle as a soft constraint by transforming it into an Euclidean distance map.

Instance Segmentation Interactive Segmentation +3

Paper
Add Code

Decomposing Motion and Content for Natural Video Sequence Prediction

1 code implementation • 25 Jun 2017 • Ruben Villegas, Jimei Yang, Seunghoon Hong, Xunyu Lin, Honglak Lee

To the best of our knowledge, this is the first end-to-end trainable network architecture with motion and content separation to model the spatiotemporal dynamics for pixel-level future prediction in natural videos.

Ranked #1 on Video Prediction on KTH (Cond metric)

Future prediction Video Prediction

106

Paper
Code

Universal Style Transfer via Feature Transforms

15 code implementations • NeurIPS 2017 • Yijun Li, Chen Fang, Jimei Yang, Zhaowen Wang, Xin Lu, Ming-Hsuan Yang

The whitening and coloring transforms reflect a direct matching of feature covariance of the content image to a given style image, which shares similar spirits with the optimization of Gram matrix based cost in neural style transfer.

Image Reconstruction Style Transfer

590

Paper
Code

Generative Face Completion

2 code implementations • CVPR 2017 • Yijun Li, Sifei Liu, Jimei Yang, Ming-Hsuan Yang

In this paper, we propose an effective face completion algorithm using a deep generative model.

Facial Inpainting Semantic Parsing

316

Paper
Code

Learning to Generate Long-term Future via Hierarchical Prediction

2 code implementations • ICML 2017 • Ruben Villegas, Jimei Yang, Yuliang Zou, Sungryull Sohn, Xunyu Lin, Honglak Lee

To avoid inherent compounding errors in recursive pixel-level prediction, we propose to first estimate high-level structure in the input frames, then predict how that structure evolves in the future, and finally by observing a single frame from the past and the predicted high-level structure, we construct the future frames without having to observe any of the pixel-level predictions.

Video Prediction

Paper
Code

Forecasting Human Dynamics from Static Images

no code implementations • CVPR 2017 • Yu-Wei Chao, Jimei Yang, Brian Price, Scott Cohen, Jia Deng

This paper presents the first study on forecasting human dynamics from static images.

Human Dynamics Pose Estimation

Paper
Add Code

Recurrent Multimodal Interaction for Referring Image Segmentation

1 code implementation • ICCV 2017 • Chenxi Liu, Zhe Lin, Xiaohui Shen, Jimei Yang, Xin Lu, Alan Yuille

In this paper we are interested in the problem of image segmentation given natural language descriptions, i. e. referring expressions.

Image Segmentation Segmentation +1

Paper
Code

Transformation-Grounded Image Generation Network for Novel 3D View Synthesis

2 code implementations • CVPR 2017 • Eunbyung Park, Jimei Yang, Ersin Yumer, Duygu Ceylan, Alexander C. Berg

Instead of taking a 'blank slate' approach, we first explicitly infer the parts of the geometry visible both in the input and novel views and then re-cast the remaining synthesis problem as image completion.

Image Generation Novel View Synthesis

Paper
Code

Diversified Texture Synthesis with Feed-forward Networks

no code implementations • CVPR 2017 • Yijun Li, Chen Fang, Jimei Yang, Zhaowen Wang, Xin Lu, Ming-Hsuan Yang

Recent progresses on deep discriminative and generative modeling have shown promising results on texture synthesis.

Texture Synthesis

Paper
Add Code

Video Scene Parsing with Predictive Feature Learning

no code implementations • ICCV 2017 • Xiaojie Jin, Xin Li, Huaxin Xiao, Xiaohui Shen, Zhe Lin, Jimei Yang, Yunpeng Chen, Jian Dong, Luoqi Liu, Zequn Jie, Jiashi Feng, Shuicheng Yan

In this way, the network can effectively learn to capture video dynamics and temporal context, which are critical clues for video scene parsing, without requiring extra manual annotations.

Representation Learning Scene Parsing

Paper
Add Code

Perspective Transformer Nets: Learning Single-View 3D Object Reconstruction without 3D Supervision

2 code implementations • NeurIPS 2016 • Xinchen Yan, Jimei Yang, Ersin Yumer, Yijie Guo, Honglak Lee

We demonstrate the ability of the model in generating 3D volume from a single 2D image with three sets of experiments: (1) learning from single-class objects; (2) learning from multi-class objects and (3) testing on novel object classes.

3D Object Reconstruction Object

65,339

Paper
Code

Object Tracking via Dual Linear Structured SVM and Explicit Feature Map

no code implementations • CVPR 2016 • Jifeng Ning, Jimei Yang, Shaojie Jiang, Lei Zhang, Ming-Hsuan Yang

Structured support vector machine (SSVM) based methods has demonstrated encouraging performance in recent object tracking benchmarks.

Object Tracking

Paper
Add Code

Object Contour Detection with a Fully Convolutional Encoder-Decoder Network

3 code implementations • CVPR 2016 • Jimei Yang, Brian Price, Scott Cohen, Honglak Lee, Ming-Hsuan Yang

We develop a deep learning algorithm for contour detection with a fully convolutional encoder-decoder network.

Contour Detection Edge Detection +1

121

Paper
Code

Deep Interactive Object Selection

3 code implementations • CVPR 2016 • Ning Xu, Brian Price, Scott Cohen, Jimei Yang, Thomas Huang

Interactive object selection is a very important research problem and has many applications.

Ranked #11 on Interactive Segmentation on SBD

Interactive Segmentation Object

Paper
Code

Weakly-supervised Disentangling with Recurrent Transformations for 3D View Synthesis

no code implementations • NeurIPS 2015 • Jimei Yang, Scott Reed, Ming-Hsuan Yang, Honglak Lee

An important problem for both graphics and vision is to synthesize novel views of a 3D object from a single image.

Object

Paper
Add Code

Attribute2Image: Conditional Image Generation from Visual Attributes

1 code implementation • 2 Dec 2015 • Xinchen Yan, Jimei Yang, Kihyuk Sohn, Honglak Lee

This paper investigates a novel problem of generating images from visual attributes.

Attribute Conditional Image Generation +1

Paper
Code

Multi-Objective Convolutional Learning for Face Labeling

no code implementations • CVPR 2015 • Sifei Liu, Jimei Yang, Chang Huang, Ming-Hsuan Yang

This paper formulates face labeling as a conditional random field with unary and pairwise classifiers.

Paper
Add Code

PatchCut: Data-Driven Object Segmentation via Local Shape Transfer

no code implementations • CVPR 2015 • Jimei Yang, Brian Price, Scott Cohen, Zhe Lin, Ming-Hsuan Yang

The transferred local shape masks constitute a patch-level segmentation solution space and we thus develop a novel cascade algorithm, PatchCut, for coarse-to-fine object segmentation.

Object Object Discovery +2