Source Code Summarization

37 papers with code • 9 benchmarks • 7 datasets

Code Summarization is a task that tries to comprehend code and automatically generate descriptions directly from the source code.

Source: Improving Automatic Source Code Summarization via Deep Reinforcement Learning

Benchmarks

Add a Result

These leaderboards are used to track progress in Source Code Summarization

Dataset	Best Model	Compare
DeepCom-Java	AdaMo-noise	See all
ParallelCorpus-Python	AdaMo-basic	See all
CodeSearchNet	ContraCode	See all
Summarizing Source Code using a Neural Attention Model - C#	CodeTrans-MT-Large	See all
Summarizing Source Code using a Neural Attention Model - Python	CodeTrans-MT-Base	See all
Summarizing Source Code using a Neural Attention Model - SQL	CodeTrans-MT-TF-Large	See all
CoDesc	Transformer	See all
Java scripts	AdaMo-basic	See all
CodeSearchNet - Python	AdaMo-basic	See all

Libraries

Use these libraries to find Source Code Summarization models and implementations

transms/m2ts

2 papers

Datasets

Subtasks

Method name prediction

Most implemented papers

Most implemented Social Latest No code

A Transformer-based Approach for Source Code Summarization

wasiahmad/NeuralCodeSum • • ACL 2020

Generating a readable summary that describes the functionality of a program is known as source code summarization.

Paper
Code

Recommendations for Datasets for Source Code Summarization

transms/m2ts • • NAACL 2019

The main use for these descriptions is in software documentation e. g. the one-sentence Java method descriptions in JavaDocs.

Paper
Code

code2seq: Generating Sequences from Structured Representations of Code

tech-srl/code2seq • • ICLR 2019

The ability to generate natural language sequences from source code snippets has a variety of applications such as code summarization, documentation, and retrieval.

Paper
Code

Structured Neural Summarization

CoderPat/structured-neural-summarization • • ICLR 2019

Summarization of long sequences into a concise statement is a core problem in natural language processing, requiring non-trivial understanding of the input.

Paper
Code

Improving Automatic Source Code Summarization via Deep Reinforcement Learning

mf1832146/tree_transformer_2.0 • • 17 Nov 2018

To the best of our knowledge, most state-of-the-art approaches follow an encoder-decoder framework which encodes the code into a hidden space and then decode it into natural language space, suffering from two major drawbacks: a) Their encoders only consider the sequential content of code, ignoring the tree structure which is also critical for the task of code summarization, b) Their decoders are typically trained to predict the next word by maximizing the likelihood of next ground-truth word with previous ground-truth word given.

Paper
Code

Code Generation as a Dual Task of Code Summarization

Bolin0215/CSCGDual • • NeurIPS 2019

Code summarization (CS) and code generation (CG) are two crucial tasks in the field of automatic software development.

Paper
Code

Improved Code Summarization via a Graph Neural Network

acleclair/ICPC2020_GNN • • 6 Apr 2020

The first approaches to use structural information flattened the AST into a sequence.

Paper
Code

HAConvGNN: Hierarchical Attention Based Convolutional Graph Neural Network for Code Documentation Generation in Jupyter Notebooks

dakuo/haconvgnn • • Findings (EMNLP) 2021

Jupyter notebook allows data scientists to write machine learning code together with its documentation in cells.

Paper
Code

Summarizing Source Code using a Neural Attention Model

sriniiyer/codenn • • ACL 2016

Paper
Code

Automatic Source Code Summarization with Extended Tree-LSTM

sh1doy/summarization_tf • • 19 Jun 2019

Neural machine translation models are used to automatically generate a document from given source code since this can be regarded as a machine translation task.

Paper
Code

Source Code Summarization

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Most implemented papers

Content

Benchmarks

Add a Result