Source Code Summarization

37 papers with code • 9 benchmarks • 7 datasets

Code Summarization is a task that tries to comprehend code and automatically generate descriptions directly from the source code.

Source: Improving Automatic Source Code Summarization via Deep Reinforcement Learning

Benchmarks

Add a Result

These leaderboards are used to track progress in Source Code Summarization

Dataset	Best Model	Compare
DeepCom-Java	AdaMo-noise	See all
ParallelCorpus-Python	AdaMo-basic	See all
CodeSearchNet	ContraCode	See all
Summarizing Source Code using a Neural Attention Model - C#	CodeTrans-MT-Large	See all
Summarizing Source Code using a Neural Attention Model - Python	CodeTrans-MT-Base	See all
Summarizing Source Code using a Neural Attention Model - SQL	CodeTrans-MT-TF-Large	See all
CoDesc	Transformer	See all
Java scripts	AdaMo-basic	See all
CodeSearchNet - Python	AdaMo-basic	See all

Libraries

Use these libraries to find Source Code Summarization models and implementations

transms/m2ts

2 papers

Datasets

Subtasks

Method name prediction

Most implemented papers

Most implemented Social Latest No code

Retrieval-Augmented Generation for Code Summarization via Hybrid GNN

shangqing-liu/CCSD-benchmark-for-code-summarization • ICLR 2021

However, automatic code summarization is challenging due to the complexity of the source code and the language gap between the source code and natural language summaries.

Paper
Code

Contrastive Code Representation Learning

parasj/contracode • • EMNLP 2021

Recent work learns contextual representations of source code by reconstructing tokens from their context.

Paper
Code

GraphCodeBERT: Pre-training Code Representations with Data Flow

microsoft/CodeBERT • • ICLR 2021

Instead of taking syntactic-level structure of code like abstract syntax tree (AST), we use data flow in the pre-training stage, which is a semantic-level structure of code that encodes the relation of "where-the-value-comes-from" between variables.

Paper
Code

Code Summarization with Structure-induced Transformer

gingasan/sit3 • • Findings (ACL) 2021

Code summarization (CS) is becoming a promising area in recent language understanding, which aims to generate sensible human language automatically for programming language in the format of source code, serving in the most convenience of programmer developing.

Paper
Code

Neural Code Summarization

shrivastava-piyush/nlp-code-summarization • • 26 Feb 2021

Code summarization is the task of generating readable summaries that are semantically meaningful and can accurately describe the presumed task of a software.

Paper
Code

Unified Pre-training for Program Understanding and Generation

wasiahmad/PLBART • • NAACL 2021

Experiments on code summarization in the English language, code generation, and code translation in seven programming languages show that PLBART outperforms or rivals state-of-the-art models.

Paper
Code

Improving Code Summarization with Block-wise Abstract Syntax Tree Splitting

XMUDM/BASTS • • 14 Mar 2021

In this paper, we present the Block-wise Abstract Syntax Tree Splitting method (BASTS for short), which fully utilizes the rich tree-form syntax structure in ASTs, for improving code summarization.

Paper
Code

Language-Agnostic Representation Learning of Source Code from Structure and Context

danielzuegner/code-transformer • • ICLR 2021

Source code (Context) and its parsed abstract syntax tree (AST; Structure) are two complementary representations of the same computer program.

Paper
Code

Project-Level Encoding for Neural Source Code Summarization of Subroutines

aakashba/projcon • • 22 Mar 2021

Source code summarization of a subroutine is the task of writing a short, natural language description of that subroutine.

Paper
Code

CodeTrans: Towards Cracking the Language of Silicon's Code Through Self-Supervised Deep Learning and High Performance Computing

agemagician/CodeTrans • • 6 Apr 2021

Simultaneously, the transformer model, especially its combination with transfer learning, has been proven to be a powerful technique for natural language processing tasks.

Paper
Code

Source Code Summarization

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Most implemented papers

Content

Benchmarks

Add a Result