TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Image Generation	Binarized MNIST	BFN	nats	77.87	# 3
Image Generation	CIFAR-10	BFN	bits/dimension	2.66	# 9
Language Modelling	Text8	BFN	Bit per Character (BPC)	1.41	# 22

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/bayesian-flow-networks/image-generation-on-binarized-mnist)](https://paperswithcode.com/sota/image-generation-on-binarized-mnist?p=bayesian-flow-networks)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/bayesian-flow-networks/image-generation-on-cifar-10)](https://paperswithcode.com/sota/image-generation-on-cifar-10?p=bayesian-flow-networks)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/bayesian-flow-networks/language-modelling-on-text8)](https://paperswithcode.com/sota/language-modelling-on-text8?p=bayesian-flow-networks)`

Bayesian Flow Networks

14 Aug 2023 · Alex Graves, Rupesh Kumar Srivastava, Timothy Atkinson, Faustino Gomez ·

This paper introduces Bayesian Flow Networks (BFNs), a new class of generative model in which the parameters of a set of independent distributions are modified with Bayesian inference in the light of noisy data samples, then passed as input to a neural network that outputs a second, interdependent distribution. Starting from a simple prior and iteratively updating the two distributions yields a generative procedure similar to the reverse process of diffusion models; however it is conceptually simpler in that no forward process is required. Discrete and continuous-time loss functions are derived for continuous, discretised and discrete data, along with sample generation procedures. Notably, the network inputs for discrete data lie on the probability simplex, and are therefore natively differentiable, paving the way for gradient-based sample guidance and few-step generation in discrete domains such as language modelling. The loss function directly optimises data compression and places no restrictions on the network architecture. In our experiments BFNs achieve competitive log-likelihoods for image modelling on dynamically binarized MNIST and CIFAR-10, and outperform all known discrete diffusion models on the text8 character-level language modelling task.

PDF Abstract

Code

Add Remove Mark official

nnaisense/bayesian-flow-networks official

203

Tasks

Add Remove

Bayesian Inference

Data Compression

Image Generation

Language Modelling

Datasets

CIFAR-10 Text8

Binarized MNIST

Results from the Paper

Edit

Ranked #3 on Image Generation on Binarized MNIST

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Image Generation	Binarized MNIST	BFN	nats	77.87	# 3	Compare
Image Generation	CIFAR-10	BFN	bits/dimension	2.66	# 9	Compare
Language Modelling	Text8	BFN	Bit per Character (BPC)	1.41	# 22	Compare

Methods

Add Remove

Diffusion

Edit Social Preview

Bayesian Flow Networks

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove