Code Generation

331 papers with code • 15 benchmarks • 43 datasets

Code Generation is an important field to predict explicit code or program structure from multimodal data sources such as incomplete code, programs in another programming language, natural language descriptions or execution examples. Code Generation tools can assist the development of automatic programming tools to improve programming productivity.

Source: Deep Learning for Source Code Modeling and Generation

Image source: Measuring Coding Challenge Competence With APPS

Benchmarks

Add a Result

These leaderboards are used to track progress in Code Generation

Dataset	Best Model	Compare
HumanEval	AgentCoder (GPT-4)	See all
MBPP	GPT-4 + AgentCoder	See all
CoNaLa	PanGu-Coder-FT-I	See all
APPS	CodeRL+CodeT5	See all
Django	MarianCG	See all
WikiSQL	NL2SQL-RULE	See all
CoNaLa-Ext	BART Base	See all
Turbulence	GPT-4	See all
CodeContests	AlphaCode 41B + clustering	See all
Shellcode_IA32	CodeBERT	See all
TACO-Code	GPT-4	See all
CodeXGLUE - CodeSearchNet	Redcoder-ext	See all
CONCODE	Redcoder-ext	See all
Verified Smart Contract Code Comments	GPT-J 6B Smart Contract	See all
Android Repos	Entity Type Model	See all

Show all 15 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find Code Generation models and implementations

epfllm/megatron-llm

4 papers

459

eth-sri/sven

4 papers

uiuc-focal-lab/syncode

3 papers

DeepLearnXMU/CG-RL

3 papers

See all 15 libraries.

Datasets

Subtasks

Most implemented papers

Most implemented Social Latest No code

PaLM: Scaling Language Modeling with Pathways

lucidrains/CoCa-pytorch • • Google Research 2022

To further our understanding of the impact of scale on few-shot learning, we trained a 540-billion parameter, densely activated, Transformer language model, which we call Pathways Language Model PaLM.

Paper
Code

SantaCoder: don't reach for the stars!

bigcode-project/starcoder • • 9 Jan 2023

The BigCode project is an open-scientific collaboration working on the responsible development of large language models for code.

Paper
Code

Mistral 7B

mistralai/mistral-src • • 10 Oct 2023

We introduce Mistral 7B v0. 1, a 7-billion-parameter language model engineered for superior performance and efficiency.

Paper
Code

TRANX: A Transition-based Neural Abstract Syntax Parser for Semantic Parsing and Code Generation

pcyin/tranX • • EMNLP 2018

We present TRANX, a transition-based neural semantic parser that maps natural language (NL) utterances into formal meaning representations (MRs).

Paper
Code

Content Enhanced BERT-based Text-to-SQL Generation

guotong1988/NL2SQL-RULE • • 16 Oct 2019

We present a simple methods to leverage the table content for the BERT-based model to solve the text-to-SQL problem.

Paper
Code

CodeXGLUE: A Machine Learning Benchmark Dataset for Code Understanding and Generation

microsoft/CodeXGLUE • • 9 Feb 2021

Benchmark datasets have a significant impact on accelerating research in programming language tasks.

Paper
Code

StarCoder: may the source be with you!

bigcode-project/starcoder • • 9 May 2023

The BigCode community, an open-scientific collaboration working on the responsible development of Large Language Models for Code (Code LLMs), introduces StarCoder and StarCoderBase: 15. 5B parameter models with 8K context length, infilling capabilities and fast large-batch inference enabled by multi-query attention.

Paper
Code

How is ChatGPT's behavior changing over time?

lchen001/llmdrift • 18 Jul 2023

We find that the performance and behavior of both GPT-3. 5 and GPT-4 can vary greatly over time.

Paper
Code

FacTool: Factuality Detection in Generative AI -- A Tool Augmented Framework for Multi-Task and Multi-Domain Scenarios

gair-nlp/factool • 25 Jul 2023

With the above challenges in mind, in this paper, we propose FacTool, a task and domain agnostic framework for detecting factual errors of texts generated by large language models (e. g., ChatGPT).

Paper
Code

Measuring Coding Challenge Competence With APPS

hendrycks/apps • • 20 May 2021

Recent models such as GPT-Neo can pass approximately 20% of the test cases of introductory problems, so we find that machine learning models are now beginning to learn how to code.

Paper
Code

Code Generation

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Most implemented papers

Content

Benchmarks

Add a Result