Search Results for author: Collin McMillan

Found 18 papers, 14 papers with code

EyeTrans: Merging Human and Machine Attention for Neural Code Summarization

1 code implementation21 Feb 2024 Yifan Zhang, Jiliang Li, Zachary Karas, Aakash Bansal, Toby Jia-Jun Li, Collin McMillan, Kevin Leach, Yu Huang

Neural code summarization leverages deep learning models to automatically generate brief natural language summaries of code snippets.

Code Summarization

Revisiting File Context for Source Code Summarization

1 code implementation5 Sep 2023 Aakash Bansal, Chia-Yi Su, Collin McMillan

Source code summarization is the task of writing natural language descriptions of source code.

Code Summarization Source Code Summarization

Semantic Similarity Loss for Neural Source Code Summarization

1 code implementation14 Aug 2023 Chia-Yi Su, Collin McMillan

We also propose to combine our loss with traditional CCE for each word, which streamlines the training process compared to baselines.

Code Summarization Semantic Similarity +3

Towards Modeling Human Attention from Eye Movements for Neural Source Code Summarization

no code implementations16 May 2023 Aakash Bansal, Bonita Sharif, Collin McMillan

The attention mechanism learns to connect features in source code to specific words to use when generating natural language descriptions.

Code Summarization Source Code Summarization

A Language Model of Java Methods with Train/Test Deduplication

1 code implementation15 May 2023 Chia-Yi Su, Aakash Bansal, Vijayanta Jain, Sepideh Ghanavati, Collin McMillan

In contrast to many existing language models, we prioritize features for researchers including an open and easily-searchable training set, a held out test set with different levels of deduplication from the training set, infrastructure for deduplicating new examples, and an implementation platform suitable for execution on equipment accessible to a relatively modest budget.

Descriptive Language Modelling

Generating Clarifying Questions for Query Refinement in Source Code Search

1 code implementation24 Jan 2022 Zachary Eberhart, Collin McMillan

In source code search, a common information-seeking strategy involves providing a short initial query with a broad meaning, and then iteratively refining the query using terms gleaned from the results of subsequent searches.

Code Search

Project-Level Encoding for Neural Source Code Summarization of Subroutines

1 code implementation22 Mar 2021 Aakash Bansal, Sakib Haque, Collin McMillan

Source code summarization of a subroutine is the task of writing a short, natural language description of that subroutine.

Code Summarization Source Code Summarization

A Neural Question Answering System for Basic Questions about Subroutines

no code implementations11 Jan 2021 Aakash Bansal, Zachary Eberhart, Lingfei Wu, Collin McMillan

In this paper, we take initial steps to bringing state-of-the-art neural QA technologies to Software Engineering applications by designing a context-based QA system for basic questions about subroutines.

Question Answering

Improved Automatic Summarization of Subroutines via Attention to File Context

1 code implementation10 Apr 2020 Sakib Haque, Alexander LeClair, Lingfei Wu, Collin McMillan

In this paper, we present an approach that models the file context of subroutines (i. e. other subroutines in the same file) and uses an attention mechanism to find words and concepts to use in summaries.

Software Engineering

Recommendations for Datasets for Source Code Summarization

7 code implementations NAACL 2019 Alexander LeClair, Collin McMillan

The main use for these descriptions is in software documentation e. g. the one-sentence Java method descriptions in JavaDocs.

Code Summarization Sentence +1

A Neural Model for Generating Natural Language Summaries of Program Subroutines

2 code implementations5 Feb 2019 Alexander LeClair, Siyuan Jiang, Collin McMillan

In this paper, we present a neural model that combines words from code with code structure from an AST.

Software Engineering

Adapting Neural Text Classification for Improved Software Categorization

1 code implementation5 Jun 2018 Alexander LeClair, Zachary Eberhart, Collin McMillan

Software Categorization is the task of organizing software into groups that broadly describe the behavior of the software, such as "editors" or "science."

General Classification text-classification +1

Automatically Generating Commit Messages from Diffs using Neural Machine Translation

1 code implementation30 Aug 2017 Siyuan Jiang, Ameer Armaly, Collin McMillan

We trained an NMT algorithm using a corpus of diffs and human-written commit messages from the top 1k Github projects.

Machine Translation NMT +1

Cannot find the paper you are looking for? You can Submit a new open access paper.