Search Results for author: Zhisheng Xiao

Found 16 papers, 5 papers with code

EM Distillation for One-step Diffusion Models

no code implementations • 27 May 2024 • Sirui Xie, Zhisheng Xiao, Diederik P Kingma, Tingbo Hou, Ying Nian Wu, Kevin Patrick Murphy, Tim Salimans, Ben Poole, Ruiqi Gao

We propose EM Distillation (EMD), a maximum likelihood-based approach that distills a diffusion model to a one-step generator model with minimal loss of perceptual quality.

Paper
Add Code

DreamInpainter: Text-Guided Subject-Driven Image Inpainting with Diffusion Models

no code implementations • 5 Dec 2023 • Shaoan Xie, Yang Zhao, Zhisheng Xiao, Kelvin C. K. Chan, Yandong Li, Yanwu Xu, Kun Zhang, Tingbo Hou

Our extensive experiments demonstrate the superior performance of our method in terms of visual quality, identity preservation, and text control, showcasing its effectiveness in the context of text-guided subject-driven image inpainting.

Image Inpainting

Paper
Add Code

HiFi Tuner: High-Fidelity Subject-Driven Fine-Tuning for Diffusion Models

no code implementations • 30 Nov 2023 • Zhonghao Wang, Wei Wei, Yang Zhao, Zhisheng Xiao, Mark Hasegawa-Johnson, Humphrey Shi, Tingbo Hou

We further extend our method to a novel image editing task: substituting the subject in an image through textual manipulations.

Denoising Image Generation

Paper
Add Code

MobileDiffusion: Subsecond Text-to-Image Generation on Mobile Devices

no code implementations • 28 Nov 2023 • Yang Zhao, Yanwu Xu, Zhisheng Xiao, Tingbo Hou

The deployment of large-scale text-to-image diffusion models on mobile devices is impeded by their substantial model size and slow inference speed.

Computational Efficiency Text-to-Image Generation

Paper
Add Code

UFOGen: You Forward Once Large Scale Text-to-Image Generation via Diffusion GANs

no code implementations • 14 Nov 2023 • Yanwu Xu, Yang Zhao, Zhisheng Xiao, Tingbo Hou

Text-to-image diffusion models have demonstrated remarkable capabilities in transforming textual prompts into coherent images, yet the computational cost of their inference remains a persistent challenge.

Text-to-Image Generation

Paper
Add Code

Adaptive Multi-stage Density Ratio Estimation for Learning Latent Space Energy-based Model

no code implementations • 19 Sep 2022 • Zhisheng Xiao, Tian Han

Instead, we propose to use noise contrastive estimation (NCE) to discriminatively learn the EBM through density ratio estimation between the latent prior density and latent posterior density.

Anomaly Detection Density Ratio Estimation +1

Paper
Add Code

Tackling the Generative Learning Trilemma with Denoising Diffusion GANs

5 code implementations • ICLR 2022 • Zhisheng Xiao, Karsten Kreis, Arash Vahdat

To the best of our knowledge, denoising diffusion GAN is the first model that reduces sampling cost in diffusion models to an extent that allows them to be applied to real-world applications inexpensively.

Ranked #9 on Image Generation on CelebA-HQ 256x256

Image Generation

658

Paper
Code

Do We Really Need to Learn Representations from In-domain Data for Outlier Detection?

no code implementations • 19 May 2021 • Zhisheng Xiao, Qing Yan, Yali Amit

Unsupervised outlier detection, which predicts if a test sample is an outlier or not using only the information from unlabelled inlier data, is an important but challenging task.

Outlier Detection Representation Learning

Paper
Add Code

EBMs Trained with Maximum Likelihood are Generator Models Trained with a Self-adverserial Loss

no code implementations • ICLR Workshop EBM 2021 • Zhisheng Xiao, Qing Yan, Yali Amit

Doing so allows us to study the density induced by the dynamics (if the dynamics are invertible), and connect with GANs by treating the dynamics as generator models, the initial values as latent variables and the loss as optimizing a critic defined by the very same energy that determines the generator through its gradient.

Paper
Add Code

ControlVAE: Tuning, Analytical Properties, and Performance Analysis

4 code implementations • 31 Oct 2020 • Huajie Shao, Zhisheng Xiao, Shuochao Yao, Aston Zhang, Shengzhong Liu, Tarek Abdelzaher

ControlVAE is a new variational autoencoder (VAE) framework that combines the automatic control theory with the basic VAE to stabilize the KL-divergence of VAE models to a specified value.

Disentanglement Image Generation +1

22,039

Paper
Code

VAEBM: A Symbiosis between Variational Autoencoders and Energy-based Models

1 code implementation • ICLR 2021 • Zhisheng Xiao, Karsten Kreis, Jan Kautz, Arash Vahdat

VAEBM captures the overall mode structure of the data distribution using a state-of-the-art VAE and it relies on its EBM component to explicitly exclude non-data-like regions from the model and refine the image samples.

Ranked #1 on Image Generation on Stacked MNIST

Image Generation Out-of-Distribution Detection

Paper
Code

Exponential Tilting of Generative Models: Improving Sample Quality by Training and Sampling from Latent Energy

no code implementations • 15 Jun 2020 • Zhisheng Xiao, Qing Yan, Yali Amit

In this paper, we present a general method that can improve the sample quality of pre-trained likelihood based generative models.

Paper
Add Code

Likelihood Regret: An Out-of-Distribution Detection Score For Variational Auto-encoder

2 code implementations • NeurIPS 2020 • Zhisheng Xiao, Qing Yan, Yali Amit

An important application of generative modeling should be the ability to detect out-of-distribution (OOD) samples by setting a threshold on the likelihood.

Out-of-Distribution Detection Out of Distribution (OOD) Detection

Paper
Code

A Method to Model Conditional Distributions with Normalizing Flows

no code implementations • 5 Nov 2019 • Zhisheng Xiao, Qing Yan, Yali Amit

In particular, we use our proposed method to analyze inverse problems with invertible neural networks by maximizing the posterior likelihood.

Paper
Add Code

Understanding Limitation of Two Symmetrized Orders by Worst-case Complexity

no code implementations • 10 Oct 2019 • Peijun Xiao, Zhisheng Xiao, Ruoyu Sun

Recently, Coordinate Descent (CD) with cyclic order was shown to be $O(n^2)$ times slower than randomized versions in the worst-case.

Vocal Bursts Valence Prediction

Paper
Add Code

Generative Latent Flow

1 code implementation • 24 May 2019 • Zhisheng Xiao, Qing Yan, Yali Amit

In this work, we propose the Generative Latent Flow (GLF), an algorithm for generative modeling of the data distribution.

Ranked #1 on Image Generation on Fashion-MNIST

Image Generation

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.