TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Code Generation	HumanEval	GPT-4 (0-shot)	Pass@1	76.5	# 13
Code Generation	HumanEval	GPT-3.5 Turbo (zero-shot)	Pass@1	64.9	# 30
Code Generation	HumanEval	DeepSeek-Coder-Instruct 33B (0-shot)	Pass@1	69.2	# 23
Code Generation	HumanEval	DeepSeek-Coder-Base 1.3B (zero-shot)	Pass@1	28.3	# 85
Code Generation	HumanEval	DeepSeek-Coder-Base 6.7B (zero-shot)	Pass@1	44.7	# 53
Code Generation	HumanEval	DeepSeek-Coder-Instruct 6.7B (0-shot)	Pass@1	66.1	# 27
Code Generation	HumanEval	DeepSeek-Coder-Instruct 1.3B (zero-shot)	Pass@1	48.4	# 46
Code Generation	HumanEval	DeepSeek-Coder-Base 33B (zero-shot)	Pass@1	50.3	# 43
Code Generation	MBPP	DeepSeek-Coder-Base 6.7B (few-shot)	Accuracy	60.6	# 36
Code Generation	MBPP	DeepSeek-Coder-Instruct 6.7B (few-shot)	Accuracy	65.4	# 28
Code Generation	MBPP	DeepSeek-Coder-Base 33B (few-shot)	Accuracy	66	# 26
Code Generation	MBPP	GPT-3.5 Turbo (few-shot)	Accuracy	70.8	# 16
Code Generation	MBPP	GPT-4 (few-shot)	Accuracy	80	# 11
Code Generation	MBPP	DeepSeek-Coder-Instruct 1.3B (few-shot)	Accuracy	49.4	# 48
Code Generation	MBPP	DeepSeek-Coder-Base 1.3B (few-shot)	Accuracy	46.2	# 60
Code Generation	MBPP	DeepSeek-Coder-Instruct 33B (few-shot)	Accuracy	70	# 18

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/deepseek-coder-when-the-large-language-model/code-generation-on-mbpp)](https://paperswithcode.com/sota/code-generation-on-mbpp?p=deepseek-coder-when-the-large-language-model)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/deepseek-coder-when-the-large-language-model/code-generation-on-humaneval)](https://paperswithcode.com/sota/code-generation-on-humaneval?p=deepseek-coder-when-the-large-language-model)`

DeepSeek-Coder: When the Large Language Model Meets Programming -- The Rise of Code Intelligence

25 Jan 2024 · Daya Guo, Qihao Zhu, Dejian Yang, Zhenda Xie, Kai Dong, Wentao Zhang, Guanting Chen, Xiao Bi, Y. Wu, Y. K. Li, Fuli Luo, Yingfei Xiong, Wenfeng Liang ·

The rapid development of large language models has revolutionized code intelligence in software development. However, the predominance of closed-source models has restricted extensive research and development. To address this, we introduce the DeepSeek-Coder series, a range of open-source code models with sizes from 1.3B to 33B, trained from scratch on 2 trillion tokens. These models are pre-trained on a high-quality project-level code corpus and employ a fill-in-the-blank task with a 16K window to enhance code generation and infilling. Our extensive evaluations demonstrate that DeepSeek-Coder not only achieves state-of-the-art performance among open-source code models across multiple benchmarks but also surpasses existing closed-source models like Codex and GPT-3.5. Furthermore, DeepSeek-Coder models are under a permissive license that allows for both research and unrestricted commercial use.

PDF Abstract

Code

Add Remove Mark official

deepseek-ai/DeepSeek-Coder official

↳ Quickstart in

Spaces

5,331

Tasks

Add Remove

16k

Code Generation

Language Modelling

Large Language Model

Datasets

MMLU

GSM8K

HumanEval

HellaSwag

MATH MBPP

SVAMP BBH ASDiv

DS-1000

Results from the Paper

Add Remove

Ranked #11 on Code Generation on MBPP

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Code Generation	HumanEval	GPT-4 (0-shot)	Pass@1	76.5	# 13	Compare
Code Generation	HumanEval	GPT-3.5 Turbo (zero-shot)	Pass@1	64.9	# 30	Compare
Code Generation	HumanEval	DeepSeek-Coder-Instruct 33B (0-shot)	Pass@1	69.2	# 23	Compare
Code Generation	HumanEval	DeepSeek-Coder-Base 1.3B (zero-shot)	Pass@1	28.3	# 85	Compare
Code Generation	HumanEval	DeepSeek-Coder-Base 6.7B (zero-shot)	Pass@1	44.7	# 53	Compare
Code Generation	HumanEval	DeepSeek-Coder-Instruct 6.7B (0-shot)	Pass@1	66.1	# 27	Compare
Code Generation	HumanEval	DeepSeek-Coder-Instruct 1.3B (zero-shot)	Pass@1	48.4	# 46	Compare
Code Generation	HumanEval	DeepSeek-Coder-Base 33B (zero-shot)	Pass@1	50.3	# 43	Compare
Code Generation	MBPP	DeepSeek-Coder-Base 6.7B (few-shot)	Accuracy	60.6	# 36	Compare
Code Generation	MBPP	DeepSeek-Coder-Instruct 6.7B (few-shot)	Accuracy	65.4	# 28	Compare
Code Generation	MBPP	DeepSeek-Coder-Base 33B (few-shot)	Accuracy	66	# 26	Compare
Code Generation	MBPP	GPT-3.5 Turbo (few-shot)	Accuracy	70.8	# 16	Compare
Code Generation	MBPP	GPT-4 (few-shot)	Accuracy	80	# 11	Compare
Code Generation	MBPP	DeepSeek-Coder-Instruct 1.3B (few-shot)	Accuracy	49.4	# 48	Compare
Code Generation	MBPP	DeepSeek-Coder-Base 1.3B (few-shot)	Accuracy	46.2	# 60	Compare
Code Generation	MBPP	DeepSeek-Coder-Instruct 33B (few-shot)	Accuracy	70	# 18	Compare

Methods

Add Remove

Adam • Attention Dropout • BPE • Cosine Annealing • Dense Connections • Dropout • Fixed Factorized Attention • GELU • GPT-3 • Layer Normalization • Linear Layer • Linear Warmup With Cosine Annealing • Multi-Head Attention • Residual Connection • Scaled Dot-Product Attention • Softmax • Strided Attention • Weight Decay

Edit Social Preview

DeepSeek-Coder: When the Large Language Model Meets Programming -- The Rise of Code Intelligence

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove