Search Results for author: Wendi Zheng

Found 4 papers, 4 papers with code

CogVideo: Large-scale Pretraining for Text-to-Video Generation via Transformers

1 code implementation29 May 2022 Wenyi Hong, Ming Ding, Wendi Zheng, Xinghan Liu, Jie Tang

Large-scale pretrained transformers have created milestones in text (GPT-3) and text-to-image (DALL-E and CogView) generation.

Text-to-Video Generation Video Generation

CogView2: Faster and Better Text-to-Image Generation via Hierarchical Transformers

1 code implementation28 Apr 2022 Ming Ding, Wendi Zheng, Wenyi Hong, Jie Tang

The development of the transformer-based text-to-image models are impeded by its slow generation and complexity for high-resolution images.

Language Modelling Super-Resolution +2

CogView: Mastering Text-to-Image Generation via Transformers

3 code implementations NeurIPS 2021 Ming Ding, Zhuoyi Yang, Wenyi Hong, Wendi Zheng, Chang Zhou, Da Yin, Junyang Lin, Xu Zou, Zhou Shao, Hongxia Yang, Jie Tang

Text-to-Image generation in the general domain has long been an open problem, which requires both a powerful generative model and cross-modal understanding.

Ranked #31 on Text-to-Image Generation on COCO (using extra training data)

Super-Resolution Text to image generation +1

Cannot find the paper you are looking for? You can Submit a new open access paper.