TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Code Generation	APPS	AlphaCode 1B	Introductory Pass@1000	17.67%	# 6
Code Generation	APPS	AlphaCode 1B	Interview Pass@1000	5.24%	# 5
Code Generation	APPS	AlphaCode 1B	Competition Pass@1000	7.06%	# 5
Code Generation	APPS	AlphaCode 1B Filtered from 50000	Competition Pass@5	7.75%	# 1
Code Generation	APPS	AlphaCode 1B Filtered from 50000	Interview Pass@5	9.66%	# 1
Code Generation	APPS	AlphaCode 1B Filtered from 50000	Introductory Pass@5	20.36%	# 1
Code Generation	APPS	AlphaCode 1B Filtered from 50000	Competition Pass@any	7.75%	# 6
Code Generation	APPS	AlphaCode 1B Filtered from 50000	Interview Pass@any	9.66%	# 5
Code Generation	APPS	AlphaCode 1B Filtered from 50000	Introductory Pass@any	20.36%	# 7
Code Generation	CodeContests	AlphaCode 41B + clustering	Test Set 10@100k	29.6	# 1
Code Generation	HumanEval	Pretrained Decoder-only 1.1B	Pass@1	17.1	# 105

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/competition-level-code-generation-with-1/code-generation-on-codecontests)](https://paperswithcode.com/sota/code-generation-on-codecontests?p=competition-level-code-generation-with-1)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/competition-level-code-generation-with-1/code-generation-on-apps)](https://paperswithcode.com/sota/code-generation-on-apps?p=competition-level-code-generation-with-1)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/competition-level-code-generation-with-1/code-generation-on-humaneval)](https://paperswithcode.com/sota/code-generation-on-humaneval?p=competition-level-code-generation-with-1)`

Competition-Level Code Generation with AlphaCode

DeepMind 2022 · Yujia Li, David Choi, Junyoung Chung, Nate Kushman, Julian Schrittwieser, Rémi Leblond, Tom Eccles, James Keeling, Felix Gimeno, Agustin Dal Lago, Thomas Hubert, Peter Choy, Cyprien de Masson d'Autume, Igor Babuschkin, Xinyun Chen, Po-Sen Huang, Johannes Welbl, Sven Gowal, Alexey Cherepanov, James Molloy, Daniel J. Mankowitz, Esme Sutherland Robson, Pushmeet Kohli, Nando de Freitas, Koray Kavukcuoglu, Oriol Vinyals ·

Programming is a powerful and ubiquitous problem-solving tool. Developing systems that can assist programmers or even generate programs independently could make programming more productive and accessible, yet so far incorporating innovations in AI has proven challenging. Recent large-scale language models have demonstrated an impressive ability to generate code, and are now able to complete simple programming tasks. However, these models still perform poorly when evaluated on more complex, unseen problems that require problem-solving skills beyond simply translating instructions into code. For example, competitive programming problems which require an understanding of algorithms and complex natural language remain extremely challenging. To address this gap, we introduce AlphaCode, a system for code generation that can create novel solutions to these problems that require deeper reasoning. In simulated evaluations on recent programming competitions on the Codeforces platform, AlphaCode achieved on average a ranking of top 54.3% in competitions with more than 5,000 participants. We found that three key components were critical to achieve good and reliable performance: (1) an extensive and clean competitive programming dataset for training and evaluation, (2) large and efficient-to-sample transformer-based architectures, and (3) large-scale model sampling to explore the search space, followed by filtering based on program behavior to a small set of submissions.

PDF Abstract DeepMind 2022 PDF

Code

Add Remove Mark official

deepmind/code_contests official

2,013

Tasks

Add Remove

Code Generation

Datasets

HumanEval

APPS CodeContests MassiveText

Results from the Paper

Add Remove

Ranked #1 on Code Generation on CodeContests

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Code Generation	APPS	AlphaCode 1B	Introductory Pass@1000	17.67%	# 6	Compare
			Interview Pass@1000	5.24%	# 5	Compare
			Competition Pass@1000	7.06%	# 5	Compare
Code Generation	APPS	AlphaCode 1B Filtered from 50000	Competition Pass@5	7.75%	# 1	Compare
			Interview Pass@5	9.66%	# 1	Compare
			Introductory Pass@5	20.36%	# 1	Compare
			Competition Pass@any	7.75%	# 6	Compare
			Interview Pass@any	9.66%	# 5	Compare
			Introductory Pass@any	20.36%	# 7	Compare
Code Generation	CodeContests	AlphaCode 41B + clustering	Test Set 10@100k	29.6	# 1	Compare
Code Generation	HumanEval	Pretrained Decoder-only 1.1B	Pass@1	17.1	# 105	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Competition-Level Code Generation with AlphaCode

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove