Conffusion: Confidence Intervals for Diffusion Models

eliahuhorwitz/conffusion 17 Nov 2022

Diffusion models have become the go-to method for many generative tasks, particularly for image-to-image generation tasks such as super-resolution and inpainting.

Conformal Prediction Facial Inpainting +2

Shape, Pose, and Appearance from a Single Image via Bootstrapped Radiance Field Inversion

google-research/nerf-from-image 21 Nov 2022

Neural Radiance Fields (NeRF) coupled with GANs represent a promising direction in the area of 3D reconstruction from a single view, owing to their ability to efficiently model arbitrary topologies.

3D Reconstruction Pose Estimation

Galactica: A Large Language Model for Science

paperswithcode/galai 16 Nov 2022

We believe these results demonstrate the potential for language models as a new interface for science.

Abstract Algebra Astronomy +31

Perceiver-Actor: A Multi-Task Transformer for Robotic Manipulation

peract/peract 12 Sep 2022

With this formulation, we train a single multi-task Transformer for 18 RLBench tasks (with 249 variations) and 7 real-world tasks (with 18 variations) from just a few demonstrations per task.

EVA: Exploring the Limits of Masked Visual Representation Learning at Scale

baaivision/eva 14 Nov 2022

We launch EVA, a vision-centric foundation model to explore the limits of visual representation at scale using only publicly accessible data.

 Ranked #1 on Object Detection on LVIS v1.0 val (using extra training data)

Action Classification Action Recognition +6

AltCLIP: Altering the Language Encoder in CLIP for Extended Language Capabilities

Flag-Open/FlagAI 12 Nov 2022

In this work, we present a conceptually simple and effective method to train a strong bilingual/multilingual multimodal representation model.

Contrastive Learning Cross-Modal Retrieval +11

Generate rather than Retrieve: Large Language Models are Strong Context Generators

wyu97/GenRead 21 Sep 2022

A common approach for knowledge-intensive tasks is to employ a retrieve-then-read pipeline that first retrieves a handful of relevant contextual documents from an external corpus such as Wikipedia and then predicts an answer conditioned on the retrieved documents.

Fact Checking Language Modelling +3

Closed-form Continuous-time Neural Models

raminmh/CfC 25 Jun 2021

To this end, we compute a tightly-bounded approximation of the solution of an integral appearing in LTCs' dynamics, that has had no known closed-form solution so far.

Sentiment Analysis Time Series Prediction

Inversion-Based Creativity Transfer with Diffusion Models

zyxelsa/creativity-transfer 23 Nov 2022

In this paper, we introduce the task of "Creativity Transfer".

Denoising Style Transfer +1

MineDojo: Building Open-Ended Embodied Agents with Internet-Scale Knowledge

MineDojo/MineDojo 17 Jun 2022

Autonomous agents have made great strides in specialist domains like Atari games and Go.

Atari Games

