Search Results for author: Peihao Zhu

Found 12 papers, 7 papers with code

BigVideo: A Large-scale Video Subtitle Translation Dataset for Multimodal Machine Translation

1 code implementation23 May 2023 Liyan Kang, Luyang Huang, Ningxin Peng, Peihao Zhu, Zewei Sun, Shanbo Cheng, Mingxuan Wang, Degen Huang, Jinsong Su

We also introduce two deliberately designed test sets to verify the necessity of visual information: Ambiguous with the presence of ambiguous words, and Unambiguous in which the text context is self-contained for translation.

Contrastive Learning Multimodal Machine Translation +3

3DAvatarGAN: Bridging Domains for Personalized Editable Avatars

no code implementations CVPR 2023 Rameen Abdal, Hsin-Ying Lee, Peihao Zhu, Menglei Chai, Aliaksandr Siarohin, Peter Wonka, Sergey Tulyakov

Finally, we propose a novel inversion method for 3D-GANs linking the latent spaces of the source and the target domains.

Video2StyleGAN: Disentangling Local and Global Variations in a Video

no code implementations27 May 2022 Rameen Abdal, Peihao Zhu, Niloy J. Mitra, Peter Wonka

Image editing using a pretrained StyleGAN generator has emerged as a powerful paradigm for facial editing, providing disentangled controls over age, expression, illumination, etc.

Facial Editing

CLIP2StyleGAN: Unsupervised Extraction of StyleGAN Edit Directions

no code implementations9 Dec 2021 Rameen Abdal, Peihao Zhu, John Femiani, Niloy J. Mitra, Peter Wonka

The success of StyleGAN has enabled unprecedented semantic editing capabilities, on both synthesized and real images.

Zero-Shot Learning

Barbershop: GAN-based Image Compositing using Segmentation Masks

1 code implementation2 Jun 2021 Peihao Zhu, Rameen Abdal, John Femiani, Peter Wonka

Seamlessly blending features from multiple images is extremely challenging because of complex relationships in lighting, geometry, and partial occlusion which cause coupling between different parts of the image.

Improved StyleGAN Embedding: Where are the Good Latents?

3 code implementations13 Dec 2020 Peihao Zhu, Rameen Abdal, Yipeng Qin, John Femiani, Peter Wonka

First, we introduce a new normalized space to analyze the diversity and the quality of the reconstructed latent codes.

Channel-Directed Gradients for Optimization of Convolutional Neural Networks

no code implementations25 Aug 2020 Dong Lao, Peihao Zhu, Peter Wonka, Ganesh Sundaramoorthi

We introduce optimization methods for convolutional neural networks that can be used to improve existing gradient-based optimization in terms of generalization error.

StyleFlow: Attribute-conditioned Exploration of StyleGAN-Generated Images using Conditional Continuous Normalizing Flows

3 code implementations6 Aug 2020 Rameen Abdal, Peihao Zhu, Niloy Mitra, Peter Wonka

We evaluate our method using the face and the car latent space of StyleGAN, and demonstrate fine-grained disentangled edits along various attributes on both real photographs and StyleGAN generated images.

Attribute

SEAN: Image Synthesis with Semantic Region-Adaptive Normalization

1 code implementation CVPR 2020 Peihao Zhu, Rameen Abdal, Yipeng Qin, Peter Wonka

Using SEAN normalization, we can build a network architecture that can control the style of each semantic region individually, e. g., we can specify one style reference image per region.

Image Generation Segmentation

Cannot find the paper you are looking for? You can Submit a new open access paper.