Search Results for author: Yizhe Zhu

Found 28 papers, 9 papers with code

MoMA: Multimodal LLM Adapter for Fast Personalized Image Generation

no code implementations8 Apr 2024 Kunpeng Song, Yizhe Zhu, Bingchen Liu, Qing Yan, Ahmed Elgammal, Xiao Yang

This approach effectively synergizes reference image and text prompt information to produce valuable image features, facilitating an image diffusion model.

Image-to-Image Translation Language Modelling +1

Online Differentially Private Synthetic Data Generation

no code implementations12 Feb 2024 Yiyun He, Roman Vershynin, Yizhe Zhu

We present a polynomial-time algorithm for online differentially private synthetic data generation.

Synthetic Data Generation

Spectral gap-based deterministic tensor completion

no code implementations9 Jun 2023 Kameron Decker Harris, Oscar López, Angus Read, Yizhe Zhu

However, numerical experiments illustrate the dependence of the reconstruction error on the spectral gap for the practical max-quasinorm, ridge penalty, and Poisson loss minimization algorithms.

Recommendation Systems

Differentially Private Low-dimensional Synthetic Data from High-dimensional Datasets

no code implementations26 May 2023 Yiyun He, Thomas Strohmer, Roman Vershynin, Yizhe Zhu

Differentially private synthetic data provide a powerful mechanism to enable data analysis while protecting sensitive information about individuals.

Shifted Diffusion for Text-to-image Generation

1 code implementation CVPR 2023 Yufan Zhou, Bingchen Liu, Yizhe Zhu, Xiao Yang, Changyou Chen, Jinhui Xu

Unlike the baseline diffusion model used in DALL-E 2, our method seamlessly encodes prior knowledge of the pre-trained CLIP model in its diffusion process by designing a new initialization distribution and a new transition step of the diffusion.

Zero-Shot Text-to-Image Generation

MagicVideo: Efficient Video Generation With Latent Diffusion Models

no code implementations20 Nov 2022 Daquan Zhou, Weimin WANG, Hanshu Yan, Weiwei Lv, Yizhe Zhu, Jiashi Feng

In specific, unlike existing works that directly train video models in the RGB space, we use a pre-trained VAE to map video clips into a low-dimensional latent space and learn the distribution of videos' latent codes via a diffusion model.

Text-to-Video Generation Video Generation

Overparameterized random feature regression with nearly orthogonal data

no code implementations11 Nov 2022 Zhichao Wang, Yizhe Zhu

Our analysis shows high-probability non-asymptotic concentration results for the training errors, cross-validations, and generalization errors of RFRR centered around their respective values for a kernel ridge regression (KRR).

regression

Sparse random hypergraphs: Non-backtracking spectra and community detection

1 code implementation14 Mar 2022 Ludovic Stephan, Yizhe Zhu

We consider the community detection problem in a sparse $q$-uniform hypergraph $G$, assuming that $G$ is generated according to the Hypergraph Stochastic Block Model (HSBM).

Community Detection Dimensionality Reduction +1

Partial recovery and weak consistency in the non-uniform hypergraph Stochastic Block Model

no code implementations22 Dec 2021 Ioana Dumitriu, Haixiao Wang, Yizhe Zhu

When the random hypergraph has bounded expected degrees, we provide a spectral algorithm that outputs a partition with at least a $\gamma$ fraction of the vertices classified correctly, where $\gamma\in (0. 5, 1)$ depends on the signal-to-noise ratio (SNR) of the model.

Community Detection Stochastic Block Model

PIVQGAN: Posture and Identity Disentangled Image-to-Image Translation via Vector Quantization

no code implementations29 Sep 2021 Bingchen Liu, Yizhe Zhu, Xiao Yang, Ahmed Elgammal

The VQSN module facilitates a more delicate separation of posture and identity, while the training scheme ensures the VQSN module learns the pose-related representations.

Disentanglement Image-to-Image Translation +2

Deformed semicircle law and concentration of nonlinear random matrices for ultra-wide neural networks

no code implementations20 Sep 2021 Zhichao Wang, Yizhe Zhu

As an application, we show that random feature regression induced by the empirical kernel achieves the same asymptotic performance as its limiting kernel regression under the ultra-wide regime.

regression

Towards Faster and Stabilized GAN Training for High-fidelity Few-shot Image Synthesis

7 code implementations ICLR 2021 Bingchen Liu, Yizhe Zhu, Kunpeng Song, Ahmed Elgammal

Training Generative Adversarial Networks (GAN) on high-fidelity images usually requires large-scale GPU-clusters and a vast number of training images.

Image Generation

Self-Supervised Sketch-to-Image Synthesis

1 code implementation16 Dec 2020 Bingchen Liu, Yizhe Zhu, Kunpeng Song, Ahmed Elgammal

Moreover, with the proposed sketch generator, the model shows a promising performance on style mixing and style transfer, which require synthesized images to be both style-consistent and semantically meaningful.

Image Generation Self-Supervised Learning +1

Global eigenvalue fluctuations of random biregular bipartite graphs

no code implementations26 Aug 2020 Ioana Dumitriu, Yizhe Zhu

We compute the eigenvalue fluctuations of uniformly distributed random biregular bipartite graphs with fixed and growing degrees for a large class of analytic functions.

Probability Combinatorics

TIME: Text and Image Mutual-Translation Adversarial Networks

no code implementations27 May 2020 Bingchen Liu, Kunpeng Song, Yizhe Zhu, Gerard de Melo, Ahmed Elgammal

Focusing on text-to-image (T2I) generation, we propose Text and Image Mutual-Translation Adversarial Networks (TIME), a lightweight but effective model that jointly learns a T2I generator G and an image captioning discriminator D under the Generative Adversarial Network framework.

Generative Adversarial Network Image Captioning +3

S3VAE: Self-Supervised Sequential VAE for Representation Disentanglement and Data Generation

no code implementations CVPR 2020 Yizhe Zhu, Martin Renqiang Min, Asim Kadav, Hans Peter Graf

We propose a sequential variational autoencoder to learn disentangled representations of sequential data (e. g., videos and audios) under self-supervision.

Disentanglement

Federated Adversarial Domain Adaptation

no code implementations ICLR 2020 Xingchao Peng, Zijun Huang, Yizhe Zhu, Kate Saenko

In this work, we present a principled approach to the problem of federated domain adaptation, which aims to align the representations learned among the different nodes with the data distribution of the target node.

Disentanglement Domain Adaptation +4

Deterministic tensor completion with hypergraph expanders

2 code implementations23 Oct 2019 Kameron Decker Harris, Yizhe Zhu

We provide a novel analysis of low-rank tensor completion based on hypergraph expanders.

LEMMA

OOGAN: Disentangling GAN with One-Hot Sampling and Orthogonal Regularization

1 code implementation26 May 2019 Bingchen Liu, Yizhe Zhu, Zuohui Fu, Gerard de Melo, Ahmed Elgammal

Exploring the potential of GANs for unsupervised disentanglement learning, this paper proposes a novel GAN-based disentanglement framework with One-Hot Sampling and Orthogonal Regularization (OOGAN).

Disentanglement

Learning Feature-to-Feature Translator by Alternating Back-Propagation for Generative Zero-Shot Learning

1 code implementation ICCV 2019 Yizhe Zhu, Jianwen Xie, Bingchen Liu, Ahmed Elgammal

We investigate learning feature-to-feature translator networks by alternating back-propagation as a general-purpose solution to zero-shot learning (ZSL) problems.

Zero-Shot Learning

Semantic-Guided Multi-Attention Localization for Zero-Shot Learning

no code implementations NeurIPS 2019 Yizhe Zhu, Jianwen Xie, Zhiqiang Tang, Xi Peng, Ahmed Elgammal

Zero-shot learning extends the conventional object classification to the unseen class recognition by introducing semantic representations of classes.

Zero-Shot Learning

Exact Recovery in the Hypergraph Stochastic Block Model: a Spectral Algorithm

no code implementations16 Nov 2018 Sam Cole, Yizhe Zhu

We consider the exact recovery problem in the hypergraph stochastic block model (HSBM) with $k$ blocks of equal size.

Stochastic Block Model

Link the head to the "beak": Zero Shot Learning from Noisy Text Description at Part Precision

no code implementations CVPR 2017 Mohamed Elhoseiny, Yizhe Zhu, Han Zhang, Ahmed Elgammal

We propose a learning framework that is able to connect text terms to its relevant parts and suppress connections to non-visual text terms without any part-text annotations.

Zero-Shot Learning

A Multilayer-Based Framework for Online Background Subtraction with Freely Moving Cameras

no code implementations ICCV 2017 Yizhe Zhu, Ahmed Elgammal

The exponentially increasing use of moving platforms for video capture introduces the urgent need to develop the general background subtraction algorithms with the capability to deal with the moving background.

Segmentation

Cannot find the paper you are looking for? You can Submit a new open access paper.