Code Generation

322 papers with code • 17 benchmarks • 41 datasets

Code Generation is an important field to predict explicit code or program structure from multimodal data sources such as incomplete code, programs in another programming language, natural language descriptions or execution examples. Code Generation tools can assist the development of automatic programming tools to improve programming productivity.

Source: Deep Learning for Source Code Modeling and Generation

Image source: Measuring Coding Challenge Competence With APPS

Benchmarks

Add a Result

These leaderboards are used to track progress in Code Generation

Dataset	Best Model	Compare
HumanEval	AgentCoder (GPT-4)	See all
MBPP	GPT-4 + AgentCoder	See all
CoNaLa	PanGu-Coder-FT-I	See all
APPS	CodeRL+CodeT5	See all
Django	MarianCG	See all
WikiSQL	NL2SQL-RULE	See all
CoNaLa-Ext	BART Base	See all
Turbulence	GPT-4	See all
CodeContests	AlphaCode 41B + clustering	See all
Shellcode_IA32	CodeBERT	See all
TACO-Code	GPT-4	See all
CodeXGLUE - CodeSearchNet	Redcoder-ext	See all
CONCODE	Redcoder-ext	See all
Verified Smart Contract Code Comments	GPT-J 6B Smart Contract	See all
Android Repos	Entity Type Model	See all

Show all 17 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find Code Generation models and implementations

epfllm/megatron-llm

4 papers

451

eth-sri/sven

4 papers

uiuc-focal-lab/syncode

3 papers

DeepLearnXMU/CG-RL

3 papers

See all 15 libraries.

Datasets

Subtasks

Latest papers

Most implemented Social Latest No code

DevBench: A Comprehensive Benchmark for Software Development

open-compass/devbench • 13 Mar 2024

Recent advancements in large language models (LLMs) have significantly enhanced their coding capabilities.

13 Mar 2024

Paper
Code

CleanAgent: Automating Data Standardization with LLM-based Agents

sfu-db/CleanAgent • 13 Mar 2024

Data standardization is a crucial part in data science life cycle.

13 Mar 2024

Paper
Code

Bugs in Large Language Models Generated Code: An Empirical Study

flowss/bugsinllms • 13 Mar 2024

The bug patterns are presented in the form of a taxonomy.

13 Mar 2024

Paper
Code

Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM

Leeroo-AI/mergoo • • 12 Mar 2024

We investigate efficient methods for training Large Language Models (LLMs) to possess capabilities in multiple specialized domains, such as coding, math reasoning and world knowledge.

155

12 Mar 2024

Paper
Code

Automatic Generation of Python Programs Using Context-Free Grammars

marwanair/tinypy-generator • 11 Mar 2024

In recent years, data has emerged as the new gold, serving as a powerful tool for creating intelligent systems.

11 Mar 2024

Paper
Code

Text2QR: Harmonizing Aesthetic Customization and Scanning Robustness for Text-Guided QR Code Generation

mulns/text2qr • 11 Mar 2024

This approach harnesses the potent generation capabilities of stable-diffusion models, navigating the trade-off between image aesthetics and QR code scannability.

11 Mar 2024

Paper
Code

UniSparse: An Intermediate Language for General Sparse Format Customization

cornell-zhang/unisparse • 9 Mar 2024

The ongoing trend of hardware specialization has led to a growing use of custom data formats when processing sparse workloads, which are typically memory-bound.

09 Mar 2024

Paper
Code

Quantifying Contamination in Evaluating Code Generation Capabilities of Language Models

yale-nlp/code-llm-contamination • 6 Mar 2024

While large language models have achieved remarkable performance on various code generation benchmarks, there have been growing concerns regarding potential contamination of these benchmarks as they may be leaked into pretraining and finetuning data.

06 Mar 2024

Paper
Code

IRCoder: Intermediate Representations Make Language Models Robust Multilingual Code Generators

UKPLab/arxiv2024-IRCoder • 6 Mar 2024

In particular, most mainstream Code-LMs have been pre-trained on source code files alone.

06 Mar 2024

Paper
Code

DACO: Towards Application-Driven and Comprehensive Data Analysis via Code Generation

shirley-wu/daco • 4 Mar 2024

We construct the DACO dataset, containing (1) 440 databases (of tabular data) collected from real-world scenarios, (2) ~2k query-answer pairs that can serve as weak supervision for model training, and (3) a concentrated but high-quality test set with human refined annotations that serves as our main evaluation benchmark.

04 Mar 2024

Paper
Code

Code Generation

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers

Content

Benchmarks

Add a Result