Scaling Autoregressive Models for Content-Rich Text-to-Image Generation

lucidrains/parti-pytorch 22 Jun 2022

We present the Pathways Autoregressive Text-to-Image (Parti) model, which generates high-fidelity photorealistic images and supports content-rich synthesis involving complex compositions and world knowledge.

Machine Translation Text to image generation +1

HM3D-ABO: A Photo-realistic Dataset for Object-centric Multi-view 3D Reconstruction

zhenpeiyang/hm3d-abo 24 Jun 2022

Reconstructing 3D objects is an important computer vision task that has wide application in AR/VR.

3D Reconstruction Novel View Synthesis +1

EPro-PnP: Generalized End-to-End Probabilistic Perspective-n-Points for Monocular Object Pose Estimation

tjiiv-cprg/epro-pnp CVPR 2022

The 2D-3D coordinates and corresponding weights are treated as intermediate variables learned by minimizing the KL divergence between the predicted and target pose distribution.

3D Object Detection 6D Pose Estimation using RGB +1

Exploiting Temporal Contexts with Strided Transformer for 3D Human Pose Estimation

Vegetebird/StridedTransformer-Pose3D 26 Mar 2021

The modified VTE is termed as Strided Transformer Encoder (STE), which is built upon the outputs of VTE.

Monocular 3D Human Pose Estimation

Pythae: Unifying Generative Autoencoders in Python -- A Benchmarking Use Case

clementchadebec/benchmark_VAE 16 Jun 2022

In recent years, deep generative models have attracted increasing interest due to their capacity to model complex distributions.

Density Estimation Image Reconstruction +1

Evaluating Large Language Models Trained on Code

codedotal/gpt-code-clippy 7 Jul 2021

We introduce Codex, a GPT language model fine-tuned on publicly available code from GitHub, and study its Python code-writing capabilities.

Code Generation Language Modelling

Nocturne: a scalable driving benchmark for bringing multi-agent learning one step closer to the real world

facebookresearch/nocturne 20 Jun 2022

We introduce \textit{Nocturne}, a new 2D driving simulator for investigating multi-agent coordination under partial observability.

Imitation Learning

The ArtBench Dataset: Benchmarking Generative Models with Artworks

liaopeiyuan/artbench 22 Jun 2022

We introduce ArtBench-10, the first class-balanced, high-quality, cleanly annotated, and standardized dataset for benchmarking artwork generation.

Conditional Image Generation Unconditional Image Generation

OmniXAI: A Library for Explainable AI

salesforce/omnixai 1 Jun 2022

We introduce OmniXAI (short for Omni eXplainable AI), an open-source Python library of eXplainable AI (XAI), which offers omni-way explainable AI capabilities and various interpretable machine learning techniques to address the pain points of understanding and interpreting the decisions made by machine learning (ML) in practice.

Counterfactual Explanation Decision Making +3

I M Avatar: Implicit Morphable Head Avatars from Videos

zhengyuf/imavatar CVPR 2022

Traditional 3D morphable face models (3DMMs) provide fine-grained control over expression but cannot easily capture geometric and appearance details.

