StyleGAN-T: Unlocking the Power of GANs for Fast Large-Scale Text-to-Image Synthesis

autonomousvision/stylegan-t 23 Jan 2023

Text-to-image synthesis has recently seen significant progress thanks to large pretrained language models, large-scale training data, and the introduction of scalable model families such as diffusion and autoregressive models.

Pretrained Language Models Text-to-Image Generation

377
0.74 stars / hour

K-Planes: Explicit Radiance Fields in Space, Time, and Appearance

sarafridov/k-planes 24 Jan 2023

We introduce k-planes, a white-box model for radiance fields in arbitrary dimensions.

Novel View Synthesis

121
0.71 stars / hour

Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation

showlab/Tune-A-Video 22 Dec 2022

To reproduce the success of text-to-image (T2I) generation, recent works in text-to-video (T2V) generation employ large-scale text-video dataset for fine-tuning.

Style Transfer Text-to-Video Generation +1

284
0.63 stars / hour

Towards Robust Blind Face Restoration with Codebook Lookup Transformer

sczhou/codeformer 22 Jun 2022

In this paper, we demonstrate that a learned discrete codebook prior in a small proxy space largely reduces the uncertainty and ambiguity of restoration mapping by casting blind face restoration as a code prediction task, while providing rich visual atoms for generating high-quality faces.

Blind Face Restoration

4,150
0.58 stars / hour

DAMO-YOLO : A Report on Real-Time Object Detection Design

tinyvision/damo-yolo 23 Nov 2022

In this report, we present a fast and accurate object detection method dubbed DAMO-YOLO, which achieves higher performance than the state-of-the-art YOLO series.

Neural Architecture Search object-detection +1

1,125
0.56 stars / hour

Scaling Language-Image Pre-training via Masking

ofa-sys/chinese-clip 1 Dec 2022

We present Fast Language-Image Pre-training (FLIP), a simple and more efficient method for training CLIP.

793
0.55 stars / hour

ProGen2: Exploring the Boundaries of Protein Language Models

salesforce/progen 27 Jun 2022

Attention-based models trained on protein sequences have demonstrated incredible success at classification and generation tasks relevant for artificial intelligence-driven protein design.

189
0.53 stars / hour

MorphMLP: An Efficient MLP-Like Backbone for Spatial-Temporal Representation Learning

MTLab/MorphMLP 24 Nov 2021

With such multi-dimension and multi-scale factorization, our MorphMLP block can achieve a great accuracy-computation balance.

Ranked #18 on Action Recognition on Something-Something V2 (using extra training data)

Action Recognition Image Classification +2

107
0.47 stars / hour

Demonstrate-Search-Predict: Composing retrieval and language models for knowledge-intensive NLP

stanfordnlp/dsp 28 Dec 2022

Retrieval-augmented in-context learning has emerged as a powerful approach for addressing knowledge-intensive tasks using frozen language models (LM) and retrieval models (RM).

Language Modelling Question Answering +1

201
0.41 stars / hour