Search Results for author: Xian Liu

Found 24 papers, 10 papers with code

TextCraftor: Your Text Encoder Can be Image Quality Controller

no code implementations • 27 Mar 2024 • Yanyu Li, Xian Liu, Anil Kag, Ju Hu, Yerlan Idelbayev, Dhritiman Sagar, Yanzhi Wang, Sergey Tulyakov, Jian Ren

Our findings reveal that, instead of replacing the CLIP text encoder used in Stable Diffusion with other large language models, we can enhance it through our proposed fine-tuning approach, TextCraftor, leading to substantial improvements in quantitative benchmarks and human assessments.

Image Generation

Paper
Add Code

TC4D: Trajectory-Conditioned Text-to-4D Generation

no code implementations • 26 Mar 2024 • Sherwin Bahmani, Xian Liu, Yifan Wang, Ivan Skorokhodov, Victor Rong, Ziwei Liu, Xihui Liu, Jeong Joon Park, Sergey Tulyakov, Gordon Wetzstein, Andrea Tagliasacchi, David B. Lindell

We learn local deformations that conform to the global trajectory using supervision from a text-to-video model.

Scene Generation Video Generation

Paper
Add Code

BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion

2 code implementations • 11 Mar 2024 • Xuan Ju, Xian Liu, Xintao Wang, Yuxuan Bian, Ying Shan, Qiang Xu

Image inpainting, the process of restoring corrupted images, has seen significant advancements with the advent of diffusion models (DMs).

Image Inpainting

825

Paper
Code

E$^{2}$GAN: Efficient Training of Efficient GANs for Image-to-Image Translation

no code implementations • 11 Jan 2024 • Yifan Gong, Zheng Zhan, Qing Jin, Yanyu Li, Yerlan Idelbayev, Xian Liu, Andrey Zharkov, Kfir Aberman, Sergey Tulyakov, Yanzhi Wang, Jian Ren

One highly promising direction for enabling flexible real-time on-device image editing is utilizing data distillation by leveraging large-scale text-to-image diffusion models, such as Stable Diffusion, to generate paired datasets used for training generative adversarial networks (GANs).

Image-to-Image Translation

Paper
Add Code

HumanGaussian: Text-Driven 3D Human Generation with Gaussian Splatting

no code implementations • 28 Nov 2023 • Xian Liu, Xiaohang Zhan, Jiaxiang Tang, Ying Shan, Gang Zeng, Dahua Lin, Xihui Liu, Ziwei Liu

In this paper, we propose an efficient yet effective framework, HumanGaussian, that generates high-quality 3D humans with fine-grained geometry and realistic appearance.

Paper
Add Code

HyperHuman: Hyper-Realistic Human Generation with Latent Structural Diffusion

no code implementations • 12 Oct 2023 • Xian Liu, Jian Ren, Aliaksandr Siarohin, Ivan Skorokhodov, Yanyu Li, Dahua Lin, Xihui Liu, Ziwei Liu, Sergey Tulyakov

Our model enforces the joint learning of image appearance, spatial relationship, and geometry in a unified network, where each branch in the model complements to each other with both structural awareness and textural richness.

Image Generation

Paper
Add Code

Semantics Meets Temporal Correspondence: Self-supervised Object-centric Learning in Videos

1 code implementation • ICCV 2023 • Rui Qian, Shuangrui Ding, Xian Liu, Dahua Lin

In the second stage, for each semantics, we randomly sample slots from the corresponding Gaussian distribution and perform masked feature aggregation within the semantic area to exploit temporal correspondence patterns for instance identification.

Object Object Discovery +1

Paper
Code

Make-A-Volume: Leveraging Latent Diffusion Models for Cross-Modality 3D Brain MRI Synthesis

no code implementations • 19 Jul 2023 • Lingting Zhu, Zeyue Xue, Zhenchao Jin, Xian Liu, Jingzhen He, Ziwei Liu, Lequan Yu

This paradigm extends the 2D image diffusion model to a volumetric version with a slightly increasing number of parameters and computation, offering a principled solution for generic cross-modality 3D medical image synthesis.

Computational Efficiency Image Generation

Paper
Add Code

MonoHuman: Animatable Human Neural Field from Monocular Video

1 code implementation • CVPR 2023 • Zhengming Yu, Wei Cheng, Xian Liu, Wayne Wu, Kwan-Yee Lin

Recent works propose to graft a deformation network into the NeRF to further model the dynamics of the human neural field for animating vivid human motions.

121

Paper
Code

Taming Diffusion Models for Audio-Driven Co-Speech Gesture Generation

1 code implementation • CVPR 2023 • Lingting Zhu, Xian Liu, Xuanyu Liu, Rui Qian, Ziwei Liu, Lequan Yu

In this work, we propose a novel diffusion-based framework, named Diffusion Co-Speech Gesture (DiffGesture), to effectively capture the cross-modal audio-to-gesture associations and preserve temporal coherence for high-fidelity audio-driven co-speech gesture generation.

Gesture Generation

210

Paper
Code

Explicit and Implicit Knowledge Distillation via Unlabeled Data

no code implementations • 17 Feb 2023 • Yuzheng Wang, Zuhao Ge, Zhaoyu Chen, Xian Liu, Chuangjia Ma, Yunquan Sun, Lizhe Qi

Data-free knowledge distillation is a challenging model lightweight task for scenarios in which the original dataset is not available.

Data-free Knowledge Distillation

Paper
Add Code

Audio-Driven Co-Speech Gesture Video Generation

no code implementations • 5 Dec 2022 • Xian Liu, Qianyi Wu, Hang Zhou, Yuanqi Du, Wayne Wu, Dahua Lin, Ziwei Liu

Our key insight is that the co-speech gestures can be decomposed into common motion patterns and subtle rhythmic dynamics.

Video Generation

Paper
Add Code

Static and Dynamic Concepts for Self-supervised Video Representation Learning

1 code implementation • 26 Jul 2022 • Rui Qian, Shuangrui Ding, Xian Liu, Dahua Lin

In this paper, we propose a novel learning scheme for self-supervised video representation learning.

Representation Learning Video Understanding

Paper
Code

Object-Compositional Neural Implicit Surfaces

1 code implementation • 20 Jul 2022 • Qianyi Wu, Xian Liu, Yuedong Chen, Kejie Li, Chuanxia Zheng, Jianfei Cai, Jianmin Zheng

This paper proposes a novel framework, ObjectSDF, to build an object-compositional neural implicit representation with high fidelity in 3D reconstruction and object representation.

3D Reconstruction Novel View Synthesis +1

181

Paper
Code

Learning Hierarchical Cross-Modal Association for Co-Speech Gesture Generation

1 code implementation • CVPR 2022 • Xian Liu, Qianyi Wu, Hang Zhou, Yinghao Xu, Rui Qian, Xinyi Lin, Xiaowei Zhou, Wayne Wu, Bo Dai, Bolei Zhou

To enhance the quality of synthesized gestures, we develop a contrastive learning strategy based on audio-text alignment for better audio representations.

Ranked #3 on Gesture Generation on TED Gesture Dataset

Contrastive Learning Gesture Generation

116

Paper
Code

Resource allocation for reconfigurable intelligent surface aided broadcast channels

no code implementations • 14 Feb 2022 • Cong Sun, Xian Liu, Bile Peng, Eduard Jorswieck

A two-user downlink network aided by a reconfigurable intelligent surface is considered.

Paper
Add Code

Visual Sound Localization in the Wild by Cross-Modal Interference Erasing

1 code implementation • 13 Feb 2022 • Xian Liu, Rui Qian, Hang Zhou, Di Hu, Weiyao Lin, Ziwei Liu, Bolei Zhou, Xiaowei Zhou

Specifically, we observe that the previous practice of learning only a single audio representation is insufficient due to the additive nature of audio signals.

Paper
Code

Semantic-Aware Implicit Neural Audio-Driven Video Portrait Generation

no code implementations • 19 Jan 2022 • Xian Liu, Yinghao Xu, Qianyi Wu, Hang Zhou, Wayne Wu, Bolei Zhou

Moreover, to enable portrait rendering in one unified neural radiance field, a Torso Deformation module is designed to stabilize the large-scale non-rigid torso motions.

Paper
Add Code

A Two-Stage Stochastic Programming Model for Blood Supply Chain Management, Considering Facility Disruption and Service Level

no code implementations • 2 Nov 2021 • Mohammad Arani, Mohsen Momenitabar, Zhila Dehdari Ebrahimi, Xian Liu

In this paper, a blood supply chain network, where the occurrence of disruption might interrupt the flow of Red Blood Cells, is dealt with.

Management

Paper
Add Code

Interpreting Molecule Generative Models for Interactive Molecule Discovery

no code implementations • 29 Sep 2021 • Yuanqi Du, Xian Liu, Shengchao Liu, Bolei Zhou

In this work, we develop a simple yet effective method to interpret the latent space of the learned generative models with various molecular properties for more interactive molecule generation and discovery.

Drug Discovery

Paper
Add Code

Enhancing Self-supervised Video Representation Learning via Multi-level Feature Optimization

1 code implementation • ICCV 2021 • Rui Qian, Yuxi Li, Huabin Liu, John See, Shuangrui Ding, Xian Liu, Dian Li, Weiyao Lin

The crux of self-supervised video representation learning is to build general features from unlabeled videos.

Contrastive Learning Representation Learning +1

Paper
Code

A Simulation-Optimization Technique for Service Level Analysis in Conjunction with Reorder Point Estimation and Lead-Time consideration: A Case Study in Sea Port

no code implementations • 28 May 2021 • Mohammad Arani, Saeed Abdolmaleki, Maryam Maleki, Mohsen Momenitabar, Xian Liu

This study offers a step-by-step practical procedure from the analysis of the current status of the spare parts inventory system to advanced service-level analysis by virtue of simulation-optimization technique for a real-world case study associated with a seaport.

Paper
Add Code

Motion Capture from Internet Videos

2 code implementations • ECCV 2020 • Junting Dong, Qing Shuai, Yuanqing Zhang, Xian Liu, Xiaowei Zhou, Hujun Bao

Therefore, we propose to capture human motion by jointly analyzing these Internet videos instead of using single videos separately.

Pose Estimation

3,289

Paper
Code

Optimizing the Total Production and Maintenance Cost of an Integrated Multi-Product Process and Maintenance Planning (IPPMP) Model

no code implementations • 2 Mar 2020 • Mohammad Arani, Mousaalreza Dastmard, Zhila Dehdari Ebrahimi, Mohsen Momenitabar, Xian Liu

Furthermore, a rational presumption is reflected in the problem statement in which the time and cost of PM are pertinent to the interval between the prior perfect repair and current PM.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.