Search Results for author: Yizhe Zhu

Found 28 papers, 9 papers with code

MoMA: Multimodal LLM Adapter for Fast Personalized Image Generation

no code implementations • 8 Apr 2024 • Kunpeng Song, Yizhe Zhu, Bingchen Liu, Qing Yan, Ahmed Elgammal, Xiao Yang

This approach effectively synergizes reference image and text prompt information to produce valuable image features, facilitating an image diffusion model.

Image-to-Image Translation Language Modelling +1

Paper
Add Code

Online Differentially Private Synthetic Data Generation

no code implementations • 12 Feb 2024 • Yiyun He, Roman Vershynin, Yizhe Zhu

We present a polynomial-time algorithm for online differentially private synthetic data generation.

Synthetic Data Generation

Paper
Add Code

MagicPose: Realistic Human Poses and Facial Expressions Retargeting with Identity-aware Diffusion

2 code implementations • 18 Nov 2023 • Di Chang, Yichun Shi, Quankai Gao, Jessica Fu, Hongyi Xu, Guoxian Song, Qing Yan, Yizhe Zhu, Xiao Yang, Mohammad Soleymani

In this work, we propose MagicPose, a diffusion-based model for 2D human pose and facial expression retargeting.

Video Generation

901

Paper
Code

Spectral gap-based deterministic tensor completion

no code implementations • 9 Jun 2023 • Kameron Decker Harris, Oscar López, Angus Read, Yizhe Zhu

However, numerical experiments illustrate the dependence of the reconstruction error on the spectral gap for the practical max-quasinorm, ridge penalty, and Poisson loss minimization algorithms.

Recommendation Systems

Paper
Add Code

Differentially Private Low-dimensional Synthetic Data from High-dimensional Datasets

no code implementations • 26 May 2023 • Yiyun He, Thomas Strohmer, Roman Vershynin, Yizhe Zhu

Differentially private synthetic data provide a powerful mechanism to enable data analysis while protecting sensitive information about individuals.

Paper
Add Code

Shifted Diffusion for Text-to-image Generation

1 code implementation • CVPR 2023 • Yufan Zhou, Bingchen Liu, Yizhe Zhu, Xiao Yang, Changyou Chen, Jinhui Xu

Unlike the baseline diffusion model used in DALL-E 2, our method seamlessly encodes prior knowledge of the pre-trained CLIP model in its diffusion process by designing a new initialization distribution and a new transition step of the diffusion.

Ranked #3 on Text-to-Image Generation on Multi-Modal-CelebA-HQ

Zero-Shot Text-to-Image Generation

154

Paper
Code

MagicVideo: Efficient Video Generation With Latent Diffusion Models

no code implementations • 20 Nov 2022 • Daquan Zhou, Weimin WANG, Hanshu Yan, Weiwei Lv, Yizhe Zhu, Jiashi Feng

In specific, unlike existing works that directly train video models in the RGB space, we use a pre-trained VAE to map video clips into a low-dimensional latent space and learn the distribution of videos' latent codes via a diffusion model.

Ranked #10 on Text-to-Video Generation on MSR-VTT

Text-to-Video Generation Video Generation

Paper
Add Code

Overparameterized random feature regression with nearly orthogonal data

no code implementations • 11 Nov 2022 • Zhichao Wang, Yizhe Zhu

Our analysis shows high-probability non-asymptotic concentration results for the training errors, cross-validations, and generalization errors of RFRR centered around their respective values for a kernel ridge regression (KRR).

regression

Paper
Add Code

Sparse random hypergraphs: Non-backtracking spectra and community detection

1 code implementation • 14 Mar 2022 • Ludovic Stephan, Yizhe Zhu

We consider the community detection problem in a sparse $q$-uniform hypergraph $G$, assuming that $G$ is generated according to the Hypergraph Stochastic Block Model (HSBM).

Community Detection Dimensionality Reduction +1

Paper
Code

Partial recovery and weak consistency in the non-uniform hypergraph Stochastic Block Model

no code implementations • 22 Dec 2021 • Ioana Dumitriu, Haixiao Wang, Yizhe Zhu

When the random hypergraph has bounded expected degrees, we provide a spectral algorithm that outputs a partition with at least a $\gamma$ fraction of the vertices classified correctly, where $\gamma\in (0. 5, 1)$ depends on the signal-to-noise ratio (SNR) of the model.

Community Detection Stochastic Block Model

Paper
Add Code

PIVQGAN: Posture and Identity Disentangled Image-to-Image Translation via Vector Quantization

no code implementations • 29 Sep 2021 • Bingchen Liu, Yizhe Zhu, Xiao Yang, Ahmed Elgammal

The VQSN module facilitates a more delicate separation of posture and identity, while the training scheme ensures the VQSN module learns the pose-related representations.

Disentanglement Image-to-Image Translation +2

Paper
Add Code

Deformed semicircle law and concentration of nonlinear random matrices for ultra-wide neural networks

no code implementations • 20 Sep 2021 • Zhichao Wang, Yizhe Zhu

As an application, we show that random feature regression induced by the empirical kernel achieves the same asymptotic performance as its limiting kernel regression under the ultra-wide regime.

regression

Paper
Add Code

Towards Faster and Stabilized GAN Training for High-fidelity Few-shot Image Synthesis

7 code implementations • ICLR 2021 • Bingchen Liu, Yizhe Zhu, Kunpeng Song, Ahmed Elgammal

Training Generative Adversarial Networks (GAN) on high-fidelity images usually requires large-scale GPU-clusters and a vast number of training images.

Ranked #2 on Image Generation on ADE-Indoor

Image Generation

1,604

Paper
Code

Self-Supervised Sketch-to-Image Synthesis

1 code implementation • 16 Dec 2020 • Bingchen Liu, Yizhe Zhu, Kunpeng Song, Ahmed Elgammal

Moreover, with the proposed sketch generator, the model shows a promising performance on style mixing and style transfer, which require synthesized images to be both style-consistent and semantically meaningful.

Image Generation Self-Supervised Learning +1

Paper
Code

Global eigenvalue fluctuations of random biregular bipartite graphs

no code implementations • 26 Aug 2020 • Ioana Dumitriu, Yizhe Zhu

We compute the eigenvalue fluctuations of uniformly distributed random biregular bipartite graphs with fixed and growing degrees for a large class of analytic functions.

Probability Combinatorics

Paper
Add Code

TIME: Text and Image Mutual-Translation Adversarial Networks

no code implementations • 27 May 2020 • Bingchen Liu, Kunpeng Song, Yizhe Zhu, Gerard de Melo, Ahmed Elgammal

Focusing on text-to-image (T2I) generation, we propose Text and Image Mutual-Translation Adversarial Networks (TIME), a lightweight but effective model that jointly learns a T2I generator G and an image captioning discriminator D under the Generative Adversarial Network framework.

Generative Adversarial Network Image Captioning +3

Paper
Add Code

S3VAE: Self-Supervised Sequential VAE for Representation Disentanglement and Data Generation

no code implementations • CVPR 2020 • Yizhe Zhu, Martin Renqiang Min, Asim Kadav, Hans Peter Graf

We propose a sequential variational autoencoder to learn disentangled representations of sequential data (e. g., videos and audios) under self-supervision.

Disentanglement

Paper
Add Code

Federated Adversarial Domain Adaptation

no code implementations • ICLR 2020 • Xingchao Peng, Zijun Huang, Yizhe Zhu, Kate Saenko

In this work, we present a principled approach to the problem of federated domain adaptation, which aims to align the representations learned among the different nodes with the data distribution of the target node.

Disentanglement Domain Adaptation +4

Paper
Add Code

Deterministic tensor completion with hypergraph expanders

2 code implementations • 23 Oct 2019 • Kameron Decker Harris, Yizhe Zhu

We provide a novel analysis of low-rank tensor completion based on hypergraph expanders.

LEMMA

Paper
Code

OOGAN: Disentangling GAN with One-Hot Sampling and Orthogonal Regularization

1 code implementation • 26 May 2019 • Bingchen Liu, Yizhe Zhu, Zuohui Fu, Gerard de Melo, Ahmed Elgammal

Exploring the potential of GANs for unsupervised disentanglement learning, this paper proposes a novel GAN-based disentanglement framework with One-Hot Sampling and Orthogonal Regularization (OOGAN).

Disentanglement

Paper
Code

Learning Feature-to-Feature Translator by Alternating Back-Propagation for Generative Zero-Shot Learning

1 code implementation • ICCV 2019 • Yizhe Zhu, Jianwen Xie, Bingchen Liu, Ahmed Elgammal

We investigate learning feature-to-feature translator networks by alternating back-propagation as a general-purpose solution to zero-shot learning (ZSL) problems.

Zero-Shot Learning

Paper
Code

Community detection in the sparse hypergraph stochastic block model

no code implementations • 11 Apr 2019 • Soumik Pal, Yizhe Zhu

We consider the community detection problem in sparse random hypergraphs.

Community Detection Stochastic Block Model

Paper
Add Code

Semantic-Guided Multi-Attention Localization for Zero-Shot Learning

no code implementations • NeurIPS 2019 • Yizhe Zhu, Jianwen Xie, Zhiqiang Tang, Xi Peng, Ahmed Elgammal

Zero-shot learning extends the conventional object classification to the unseen class recognition by introducing semantic representations of classes.

Zero-Shot Learning

Paper
Add Code

Exact Recovery in the Hypergraph Stochastic Block Model: a Spectral Algorithm

no code implementations • 16 Nov 2018 • Sam Cole, Yizhe Zhu

We consider the exact recovery problem in the hypergraph stochastic block model (HSBM) with $k$ blocks of equal size.

Stochastic Block Model

Paper
Add Code

CU-Net: Coupled U-Nets

1 code implementation • 20 Aug 2018 • Zhiqiang Tang, Xi Peng, Shijie Geng, Yizhe Zhu, Dimitris N. Metaxas

We design a new connectivity pattern for the U-Net architecture.

Ranked #30 on Pose Estimation on MPII Human Pose

Pose Estimation

225

Paper
Code

A Generative Adversarial Approach for Zero-Shot Learning from Noisy Texts

no code implementations • CVPR 2018 • Yizhe Zhu, Mohamed Elhoseiny, Bingchen Liu, Xi Peng, Ahmed Elgammal

Most existing zero-shot learning methods consider the problem as a visual semantic embedding one.

Zero-Shot Learning

Paper
Add Code

Link the head to the "beak": Zero Shot Learning from Noisy Text Description at Part Precision

no code implementations • CVPR 2017 • Mohamed Elhoseiny, Yizhe Zhu, Han Zhang, Ahmed Elgammal

We propose a learning framework that is able to connect text terms to its relevant parts and suppress connections to non-visual text terms without any part-text annotations.

Zero-Shot Learning

Paper
Add Code

A Multilayer-Based Framework for Online Background Subtraction with Freely Moving Cameras

no code implementations • ICCV 2017 • Yizhe Zhu, Ahmed Elgammal

The exponentially increasing use of moving platforms for video capture introduces the urgent need to develop the general background subtraction algorithms with the capability to deal with the moving background.

Segmentation

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.