TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Optical Flow Estimation	KITTI 2015	Perceiver IO	Average End-Point Error	4.98	# 1
Optical Flow Estimation	Sintel-clean	Perceiver IO	Average End-Point Error	1.81	# 10
Optical Flow Estimation	Sintel-final	Perceiver IO	Average End-Point Error	2.42	# 3

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/perceiver-io-a-general-architecture-for/optical-flow-estimation-on-kitti-2015)](https://paperswithcode.com/sota/optical-flow-estimation-on-kitti-2015?p=perceiver-io-a-general-architecture-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/perceiver-io-a-general-architecture-for/optical-flow-estimation-on-sintel-final)](https://paperswithcode.com/sota/optical-flow-estimation-on-sintel-final?p=perceiver-io-a-general-architecture-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/perceiver-io-a-general-architecture-for/optical-flow-estimation-on-sintel-clean)](https://paperswithcode.com/sota/optical-flow-estimation-on-sintel-clean?p=perceiver-io-a-general-architecture-for)`

Perceiver IO: A General Architecture for Structured Inputs & Outputs

ICLR 2022 · Andrew Jaegle, Sebastian Borgeaud, Jean-Baptiste Alayrac, Carl Doersch, Catalin Ionescu, David Ding, Skanda Koppula, Daniel Zoran, Andrew Brock, Evan Shelhamer, Olivier Hénaff, Matthew M. Botvinick, Andrew Zisserman, Oriol Vinyals, Joāo Carreira ·

A central goal of machine learning is the development of systems that can solve many problems in as many data domains as possible. Current architectures, however, cannot be applied beyond a small set of stereotyped settings, as they bake in domain & task assumptions or scale poorly to large inputs or outputs. In this work, we propose Perceiver IO, a general-purpose architecture that handles data from arbitrary settings while scaling linearly with the size of inputs and outputs. Our model augments the Perceiver with a flexible querying mechanism that enables outputs of various sizes and semantics, doing away with the need for task-specific architecture engineering. The same architecture achieves strong results on tasks spanning natural language and visual understanding, multi-task and multi-modal reasoning, and StarCraft II. As highlights, Perceiver IO outperforms a Transformer-based BERT baseline on the GLUE language benchmark despite removing input tokenization and achieves state-of-the-art performance on Sintel optical flow estimation with no explicit mechanisms for multiscale correspondence.

PDF Abstract ICLR 2022 PDF ICLR 2022 Abstract

Code

Add Remove Mark official

deepmind/deepmind-research official

12,800

huggingface/transformers

125,118

lucidrains/perceiver-pytorch

1,048

krasserm/perceiver-io

↳ Quickstart in

Colab

404

SforAiDl/vformer

161

See all 7 implementations

Tasks

Add Remove

Optical Flow Estimation

Starcraft

Starcraft II

Datasets

ImageNet

KITTI

GLUE

AudioSet

MPI Sintel

Results from the Paper

Edit

Ranked #1 on Optical Flow Estimation on KITTI 2015 (Average End-Point Error metric)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Optical Flow Estimation	KITTI 2015	Perceiver IO	Average End-Point Error	4.98	# 1	Compare
Optical Flow Estimation	Sintel-clean	Perceiver IO	Average End-Point Error	1.81	# 10	Compare
Optical Flow Estimation	Sintel-final	Perceiver IO	Average End-Point Error	2.42	# 3	Compare

Methods

Add Remove

Adam • Attention Dropout • BERT • Dense Connections • Dropout • GELU • Layer Normalization • Linear Layer • Linear Warmup With Linear Decay • Multi-Head Attention • Perceiver IO • Residual Connection • Scaled Dot-Product Attention • Softmax • Weight Decay • WordPiece

Edit Social Preview

Perceiver IO: A General Architecture for Structured Inputs & Outputs

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove