Code Summarization

68 papers with code • 1 benchmarks • 7 datasets

This task has no description! Would you like to contribute one?

Libraries

Use these libraries to find Code Summarization models and implementations
2 papers
21

Most implemented papers

Retrieval-Augmented Generation for Code Summarization via Hybrid GNN

shangqing-liu/CCSD-benchmark-for-code-summarization ICLR 2021

However, automatic code summarization is challenging due to the complexity of the source code and the language gap between the source code and natural language summaries.

GraphCodeBERT: Pre-training Code Representations with Data Flow

microsoft/CodeBERT ICLR 2021

Instead of taking syntactic-level structure of code like abstract syntax tree (AST), we use data flow in the pre-training stage, which is a semantic-level structure of code that encodes the relation of "where-the-value-comes-from" between variables.

Code Summarization with Structure-induced Transformer

gingasan/sit3 Findings (ACL) 2021

Code summarization (CS) is becoming a promising area in recent language understanding, which aims to generate sensible human language automatically for programming language in the format of source code, serving in the most convenience of programmer developing.

Neural Code Summarization

shrivastava-piyush/nlp-code-summarization 26 Feb 2021

Code summarization is the task of generating readable summaries that are semantically meaningful and can accurately describe the presumed task of a software.

Unified Pre-training for Program Understanding and Generation

wasiahmad/PLBART NAACL 2021

Experiments on code summarization in the English language, code generation, and code translation in seven programming languages show that PLBART outperforms or rivals state-of-the-art models.

Improving Code Summarization with Block-wise Abstract Syntax Tree Splitting

XMUDM/BASTS 14 Mar 2021

In this paper, we present the Block-wise Abstract Syntax Tree Splitting method (BASTS for short), which fully utilizes the rich tree-form syntax structure in ASTs, for improving code summarization.

Language-Agnostic Representation Learning of Source Code from Structure and Context

danielzuegner/code-transformer ICLR 2021

Source code (Context) and its parsed abstract syntax tree (AST; Structure) are two complementary representations of the same computer program.

Project-Level Encoding for Neural Source Code Summarization of Subroutines

aakashba/projcon 22 Mar 2021

Source code summarization of a subroutine is the task of writing a short, natural language description of that subroutine.

CoTexT: Multi-task Learning with Code-Text Transformer

justinphan3110/CoTexT ACL (NLP4Prog) 2021

We train CoTexT on different combinations of available PL corpus including both "bimodal" and "unimodal" data.

TreeBERT: A Tree-Based Pre-Trained Model for Programming Language

17385/TreeBERT 26 May 2021

In this paper, we present TreeBERT, a tree-based pre-trained model for improving programming language-oriented generation tasks.