Scaling Autoregressive Models for Content-Rich Text-to-Image Generation

lucidrains/parti-pytorch 22 Jun 2022

We present the Pathways Autoregressive Text-to-Image (Parti) model, which generates high-fidelity photorealistic images and supports content-rich synthesis involving complex compositions and world knowledge.

Machine Translation Text to image generation +1

208
1.27 stars / hour

EPro-PnP: Generalized End-to-End Probabilistic Perspective-n-Points for Monocular Object Pose Estimation

tjiiv-cprg/epro-pnp CVPR 2022

The 2D-3D coordinates and corresponding weights are treated as intermediate variables learned by minimizing the KL divergence between the predicted and target pose distribution.

3D Object Detection 6D Pose Estimation using RGB +1

497
1.13 stars / hour

Evaluating Large Language Models Trained on Code

codedotal/gpt-code-clippy 7 Jul 2021

We introduce Codex, a GPT language model fine-tuned on publicly available code from GitHub, and study its Python code-writing capabilities.

Code Generation Language Modelling

1,325
1.10 stars / hour

Nocturne: a scalable driving benchmark for bringing multi-agent learning one step closer to the real world

facebookresearch/nocturne 20 Jun 2022

We introduce \textit{Nocturne}, a new 2D driving simulator for investigating multi-agent coordination under partial observability.

Imitation Learning

105
0.80 stars / hour

Pythae: Unifying Generative Autoencoders in Python -- A Benchmarking Use Case

clementchadebec/benchmark_VAE 16 Jun 2022

In recent years, deep generative models have attracted increasing interest due to their capacity to model complex distributions.

Density Estimation Image Reconstruction +1

700
0.73 stars / hour

The ArtBench Dataset: Benchmarking Generative Models with Artworks

liaopeiyuan/artbench 22 Jun 2022

We introduce ArtBench-10, the first class-balanced, high-quality, cleanly annotated, and standardized dataset for benchmarking artwork generation.

Conditional Image Generation Unconditional Image Generation

72
0.73 stars / hour

Exploiting Temporal Contexts with Strided Transformer for 3D Human Pose Estimation

Vegetebird/StridedTransformer-Pose3D 26 Mar 2021

The modified VTE is termed as Strided Transformer Encoder (STE), which is built upon the outputs of VTE.

Monocular 3D Human Pose Estimation

153
0.65 stars / hour

Free-Form Image Inpainting with Gated Convolution

zuruoke/watermark-removal ICCV 2019

We present a generative image inpainting system to complete images with free-form mask and guidance.

feature selection Image Inpainting

266
0.57 stars / hour

MineDojo: Building Open-Ended Embodied Agents with Internet-Scale Knowledge

MineDojo/MineDojo 17 Jun 2022

Autonomous agents have made great strides in specialist domains like Atari games and Go.

Atari Games

187
0.52 stars / hour

Latent Image Animator: Learning to animate image via latent space navigation

wyhsirius/LIA ICLR 2022

Deviating from such models, we here introduce Latent Image Animator (LIA), a self-supervised auto-encoder that evades need for structure representation.

Image Animation Video Generation

129
0.49 stars / hour