Code Summarization

68 papers with code • 1 benchmarks • 7 datasets

This task has no description! Would you like to contribute one?

Libraries

Use these libraries to find Code Summarization models and implementations
2 papers
22

Most implemented papers

CoDesc: A Large Code-Description Parallel Dataset

csebuetnlp/CoDesc 29 May 2021

In this study, we present CoDesc -- a large parallel dataset composed of 4. 2 million Java methods and natural language descriptions.

Code to Comment Translation: A Comparative Study on Model Effectiveness & Errors

SageSELab/CodeSumStudy ACL (NLP4Prog) 2021

Automated source code summarization is a popular software engineering research topic wherein machine translation models are employed to "translate" code snippets into relevant natural language descriptions.

On the Evaluation of Neural Code Summarization

DeepSoftwareAnalytics/CodeSumEvaluation 15 Jul 2021

To achieve a profound understanding of how far we are from solving this problem and provide suggestions to future research, in this paper, we conduct a systematic and in-depth analysis of 5 state-of-the-art neural code summarization models on 6 widely used BLEU variants, 4 pre-processing operations and their combinations, and 3 widely used datasets.

Autoencoders as Tools for Program Synthesis

sander102907/autoencoder_program_synthesis 16 Aug 2021

Recently there have been many advances in research on language modeling of source code.

Retrieval Augmented Code Generation and Summarization

rizwan09/redcoder Findings (EMNLP) 2021

To mimic developers' code or summary generation behavior, we propose a retrieval augmented framework, REDCODER, that retrieves relevant code or summaries from a retrieval database and provides them as a supplement to code generation or summarization models.

GraphSearchNet: Enhancing GNNs via Capturing Global Dependencies for Semantic Code Search

shangqing-liu/graphsearchnet 4 Nov 2021

Specifically, we propose to construct graphs for the source code and queries with bidirectional GGNN (BiGGNN) to capture the local structural information of the source code and queries.

Modeling Hierarchical Syntax Structure with Triplet Position for Source Code Summarization

gjcexp/codescribe ACL ARR November 2021

In this paper, we propose CODESCRIBE to model the hierarchical syntax structure of code by introducing a novel triplet position for code summarization.

GN-Transformer: Fusing Sequence and Graph Representation for Improved Code Summarization

chengjunyan1/gn-transformer-ast 17 Nov 2021

SCG is the result of the early fusion between a source code snippet and the AST representation.

Leveraging Unsupervised Learning to Summarize APIs Discussed in Stack Overflow

scam2021-so/scam2021 27 Nov 2021

Automated source code summarization is a task that generates summarized information about the purpose, usage, and--or implementation of methods and classes to support understanding of these code entities.