Search Results for author: Collin McMillan

In contrast to many existing language models, we prioritize features for researchers including an open and easily-searchable training set, a held out test set with different levels of deduplication from the training set, infrastructure for deduplicating new examples, and an implementation platform suitable for execution on equipment accessible to a relatively modest budget.

Descriptive Language Modelling

Paper
Code

Generating Clarifying Questions for Query Refinement in Source Code Search

1 code implementation • 24 Jan 2022 • Zachary Eberhart, Collin McMillan

In source code search, a common information-seeking strategy involves providing a short initial query with a broad meaning, and then iteratively refining the query using terms gleaned from the results of subsequent searches.

Code Search

Paper
Code

Project-Level Encoding for Neural Source Code Summarization of Subroutines

1 code implementation • 22 Mar 2021 • Aakash Bansal, Sakib Haque, Collin McMillan

Source code summarization of a subroutine is the task of writing a short, natural language description of that subroutine.

Code Summarization Source Code Summarization

Paper
Code

A Neural Question Answering System for Basic Questions about Subroutines

no code implementations • 11 Jan 2021 • Aakash Bansal, Zachary Eberhart, Lingfei Wu, Collin McMillan

In this paper, we take initial steps to bringing state-of-the-art neural QA technologies to Software Engineering applications by designing a context-based QA system for basic questions about subroutines.

Question Answering

Paper
Add Code

Improved Automatic Summarization of Subroutines via Attention to File Context

1 code implementation • 10 Apr 2020 • Sakib Haque, Alexander LeClair, Lingfei Wu, Collin McMillan

In this paper, we present an approach that models the file context of subroutines (i. e. other subroutines in the same file) and uses an attention mechanism to find words and concepts to use in summaries.

Software Engineering

Paper
Code

Improved Code Summarization via a Graph Neural Network

2 code implementations • 6 Apr 2020 • Alexander LeClair, Sakib Haque, Lingfei Wu, Collin McMillan

The first approaches to use structural information flattened the AST into a sequence.

Code Summarization Source Code Summarization

Paper
Code

Recommendations for Datasets for Source Code Summarization

7 code implementations • NAACL 2019 • Alexander LeClair, Collin McMillan

The main use for these descriptions is in software documentation e. g. the one-sentence Java method descriptions in JavaDocs.

Code Summarization Sentence +1

Paper
Code

A Neural Model for Generating Natural Language Summaries of Program Subroutines

2 code implementations • 5 Feb 2019 • Alexander LeClair, Siyuan Jiang, Collin McMillan

In this paper, we present a neural model that combines words from code with code structure from an AST.

Software Engineering

Paper
Code

Detecting Speech Act Types in Developer Question/Answer Conversations During Bug Repair

no code implementations • 13 Jun 2018 • Andrew Wood, Paige Rodeghero, Ameer Armaly, Collin McMillan

This paper targets the problem of speech act detection in conversations about bug repair.

Paper
Add Code

Adapting Neural Text Classification for Improved Software Categorization

1 code implementation • 5 Jun 2018 • Alexander LeClair, Zachary Eberhart, Collin McMillan

Software Categorization is the task of organizing software into groups that broadly describe the behavior of the software, such as "editors" or "science."

General Classification text-classification +1

Paper
Code

Automatically Generating Commit Messages from Diffs using Neural Machine Translation

1 code implementation • 30 Aug 2017 • Siyuan Jiang, Ameer Armaly, Collin McMillan

We trained an NMT algorithm using a corpus of diffs and human-written commit messages from the top 1k Github projects.

Machine Translation NMT +1

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.