Search Results for author: Nanxuan Zhao

Found 21 papers, 7 papers with code

SwapAnything: Enabling Arbitrary Object Swapping in Personalized Visual Editing

no code implementations8 Apr 2024 Jing Gu, Yilin Wang, Nanxuan Zhao, Wei Xiong, Qing Liu, Zhifei Zhang, He Zhang, Jianming Zhang, HyunJoon Jung, Xin Eric Wang

Compared with existing methods for personalized subject swapping, SwapAnything has three unique advantages: (1) precise control of arbitrary objects and parts rather than the main subject, (2) more faithful preservation of context pixels, (3) better adaptation of the personalized concept to the image.

Image Generation Object

Diff-Plugin: Revitalizing Details for Diffusion-based Low-level Tasks

no code implementations1 Mar 2024 Yuhao Liu, Zhanghan Ke, Fang Liu, Nanxuan Zhao, Rynson W. H. Lau

Diffusion models trained on large-scale datasets have achieved remarkable progress in image synthesis.

Image Generation

AnaMoDiff: 2D Analogical Motion Diffusion via Disentangled Denoising

no code implementations5 Feb 2024 Maham Tanveer, Yizhi Wang, Ruiqi Wang, Nanxuan Zhao, Ali Mahdavi-Amiri, Hao Zhang

We present AnaMoDiff, a novel diffusion-based method for 2D motion analogies that is applied to raw, unannotated videos of articulated characters.

Denoising Optical Flow Estimation

Localizing and Editing Knowledge in Text-to-Image Generative Models

no code implementations20 Oct 2023 Samyadeep Basu, Nanxuan Zhao, Vlad Morariu, Soheil Feizi, Varun Manjunatha

We adapt Causal Mediation Analysis for text-to-image models and trace knowledge about distinct visual attributes to various (causal) components in the (i) UNet and (ii) text-encoder of the diffusion model.

Attribute Image Generation +1

Text-Guided Vector Graphics Customization

no code implementations21 Sep 2023 Peiying Zhang, Nanxuan Zhao, Jing Liao

In this paper, we propose a novel pipeline that generates high-quality customized vector graphics based on textual prompts while preserving the properties and layer-wise information of a given exemplar SVG.

Vector Graphics

NPF-200: A Multi-Modal Eye Fixation Dataset and Method for Non-Photorealistic Videos

1 code implementation23 Aug 2023 Ziyu Yang, Sucheng Ren, Zongwei Wu, Nanxuan Zhao, Junle Wang, Jing Qin, Shengfeng He

Non-photorealistic videos are in demand with the wave of the metaverse, but lack of sufficient research studies.

Saliency Detection

Language-based Photo Color Adjustment for Graphic Designs

no code implementations6 Aug 2023 Zhenwei Wang, Nanxuan Zhao, Gerhard Hancke, Rynson W. H. Lau

We also introduce an approach for generating a synthetic graphic design dataset with instructions to enable model training.

FashionTex: Controllable Virtual Try-on with Text and Texture

1 code implementation8 May 2023 Anran Lin, Nanxuan Zhao, Shuliang Ning, Yuda Qiu, Baoyuan Wang, Xiaoguang Han

Virtual try-on attracts increasing research attention as a promising way for enhancing the user experience for online cloth shopping.

Virtual Try-on

AssetField: Assets Mining and Reconfiguration in Ground Feature Plane Representation

no code implementations ICCV 2023 Yuanbo Xiangli, Linning Xu, Xingang Pan, Nanxuan Zhao, Bo Dai, Dahua Lin

Traditional modeling pipelines keep an asset library storing unique object templates, which is both versatile and memory efficient in practice.

Novel View Synthesis Object

Grid-guided Neural Radiance Fields for Large Urban Scenes

no code implementations CVPR 2023 Linning Xu, Yuanbo Xiangli, Sida Peng, Xingang Pan, Nanxuan Zhao, Christian Theobalt, Bo Dai, Dahua Lin

An alternative solution is to use a feature grid representation, which is computationally efficient and can naturally scale to a large scene with increased grid resolutions.

Neural Preset for Color Style Transfer

1 code implementation CVPR 2023 Zhanghan Ke, Yuhao Liu, Lei Zhu, Nanxuan Zhao, Rynson W. H. Lau

In this paper, we present a Neural Preset technique to address the limitations of existing color style transfer methods, including visual artifacts, vast memory requirement, and slow style switching speed.

4k Color Normalization +4

Bring Clipart to Life

1 code implementation ICCV 2023 Nanxuan Zhao, Shengqi Dang, Hexun Lin, Yang Shi, Nan Cao

The development of face editing has been boosted since the birth of StyleGAN.

UniColor: A Unified Framework for Multi-Modal Colorization with Transformer

no code implementations22 Sep 2022 Zhitong Huang, Nanxuan Zhao, Jing Liao

In the first stage, multi-modal conditions are converted into a common representation of hint points.

Colorization

BungeeNeRF: Progressive Neural Radiance Field for Extreme Multi-scale Scene Rendering

no code implementations10 Dec 2021 Yuanbo Xiangli, Linning Xu, Xingang Pan, Nanxuan Zhao, Anyi Rao, Christian Theobalt, Bo Dai, Dahua Lin

The wide span of viewing positions within these scenes yields multi-scale renderings with very different levels of detail, which poses great challenges to neural radiance field and biases it towards compromised results.

Learning Skeletal Graph Neural Networks for Hard 3D Pose Estimation

no code implementations ICCV 2021 Ailing Zeng, Xiao Sun, Lei Yang, Nanxuan Zhao, Minhao Liu, Qiang Xu

While the average prediction accuracy has been improved significantly over the years, the performance on hard poses with depth ambiguity, self-occlusion, and complex or rare poses is still far from satisfactory.

3D Human Pose Estimation 3D Pose Estimation +3

Unifying Global-Local Representations in Salient Object Detection with Transformer

1 code implementation5 Aug 2021 Sucheng Ren, Qiang Wen, Nanxuan Zhao, Guoqiang Han, Shengfeng He

In this paper, we introduce a new attention-based encoder, vision transformer, into salient object detection to ensure the globalization of the representations from shallow to deep layers.

object-detection Object Detection +1

Cannot find the paper you are looking for? You can Submit a new open access paper.