Search Results for author: Denys Poshyvanyk

Found 27 papers, 6 papers with code

On Explaining (Large) Language Models For Code Using Global Code-Based Explanations

no code implementations21 Mar 2025 David N. Palacio, Dipin Khati, Daniel Rodriguez-Cardenas, Alejandro Velasco, Denys Poshyvanyk

Moreover, participants of this study highlighted Code$Q$'s ability to show a causal relationship between the input and output of the model with readable and informative explanations on code completion and test generation tasks.

Code Completion Code Generation

Mapping the Trust Terrain: LLMs in Software Engineering -- Insights and Perspectives

no code implementations18 Mar 2025 Dipin Khati, Yijin Liu, David N. Palacio, Yixuan Zhang, Denys Poshyvanyk

To bring clarity to the current research status and identify opportunities for future work, we conducted a comprehensive review of $88$ papers: a systematic literature review of $18$ papers focused on LLMs in SE, complemented by an analysis of 70 papers from broader trust literature.

Systematic Literature Review

SnipGen: A Mining Repository Framework for Evaluating LLMs for Code

no code implementations10 Feb 2025 Daniel Rodriguez-Cardenas, Alejandro Velasco, Denys Poshyvanyk

SnipGen aims to mitigate data contamination by generating robust testbeds and crafting tailored data points to assist researchers and practitioners in evaluating LLMs for code-related tasks.

Code Generation Prompt Engineering

The ML Supply Chain in the Era of Software 2.0: Lessons Learned from Hugging Face

no code implementations6 Feb 2025 Trevor Stalnaker, Nathan Wintersgill, Oscar Chaparro, Laura A. Heymann, Massimiliano Di Penta, Daniel M German, Denys Poshyvanyk

First, we evaluate the current state of documentation in the Hugging Face supply chain, report real-world examples of shortcomings, and offer actionable suggestions for improvement.

Toward Neurosymbolic Program Comprehension

no code implementations3 Feb 2025 Alejandro Velasco, Aya Garryyeva, David N. Palacio, Antonio Mastropaolo, Denys Poshyvanyk

Recent advancements in Large Language Models (LLMs) have paved the way for Large Code Models (LCMs), enabling automation in complex software engineering tasks, such as code generation, software testing, and program comprehension, among others.

Code Generation software testing

How Propense Are Large Language Models at Producing Code Smells? A Benchmarking Study

no code implementations25 Dec 2024 Alejandro Velasco, Daniel Rodriguez-Cardenas, Luftar Rahman Alif, David N. Palacio, Denys Poshyvanyk

These findings highlight the effectiveness of our benchmark in evaluating LLMs, providing valuable insights into their reliability and their propensity to introduce code smells in code generation tasks.

Benchmarking Code Generation

On Interpreting the Effectiveness of Unsupervised Software Traceability with Information Theory

no code implementations6 Dec 2024 David N. Palacio, Daniel Rodriguez-Cardenas, Denys Poshyvanyk, Kevin Moran

In this paper, we propose an approach, TraceXplainer, for using information theory metrics to evaluate and better understand the performance (limits) of unsupervised traceability techniques.

Information Retrieval Informativeness

Perspective of Software Engineering Researchers on Machine Learning Practices Regarding Research, Review, and Education

1 code implementation28 Nov 2024 Anamaria Mojica-Hanke, David Nader Palacio, Denys Poshyvanyk, Mario Linares-Vásquez, Steffen Herbold

Objective: This study aims to contribute to the knowledge, about the synergy between ML and SE from the perspective of SE researchers, by providing insights into the practices followed when researching, teaching, and reviewing SE studies that apply ML.

Articles

Developer Perspectives on Licensing and Copyright Issues Arising from Generative AI for Software Development

no code implementations16 Nov 2024 Trevor Stalnaker, Nathan Wintersgill, Oscar Chaparro, Laura A. Heymann, Massimiliano Di Penta, Daniel M German, Denys Poshyvanyk

Despite the utility that Generative AI (GenAI) tools provide for tasks such as writing code, the use of these tools raises important legal questions and potential risks, particularly those associated with copyright law.

Misconceptions Survey

Semantic GUI Scene Learning and Video Alignment for Detecting Duplicate Video-based Bug Reports

no code implementations11 Jul 2024 Yanfu Yan, Nathan Cooper, Oscar Chaparro, Kevin Moran, Denys Poshyvanyk

Video-based bug reports are increasingly being used to document bugs for programs centered around a graphical user interface (GUI).

Video Alignment

Benchmarking Causal Study to Interpret Large Language Models for Source Code

no code implementations23 Aug 2023 Daniel Rodriguez-Cardenas, David N. Palacio, Dipin Khati, Henry Burke, Denys Poshyvanyk

We illustrate the insights of our benchmarking strategy by conducting a case study on the performance of ChatGPT under distinct prompt engineering methods.

Benchmarking Causal Inference +4

Evaluating and Explaining Large Language Models for Code Using Syntactic Structures

no code implementations7 Aug 2023 David N Palacio, Alejandro Velasco, Daniel Rodriguez-Cardenas, Kevin Moran, Denys Poshyvanyk

To this end, this paper introduces ASTxplainer, an explainability method specific to LLMs for code that enables both new methods for LLM evaluation and visualizations of LLM predictions that aid end-users in understanding model predictions.

Toward a Theory of Causation for Interpreting Neural Code Models

no code implementations7 Feb 2023 David N. Palacio, Alejandro Velasco, Nathan Cooper, Alvaro Rodriguez, Kevin Moran, Denys Poshyvanyk

To demonstrate the practical benefit of $do_{code}$, we illustrate the insights that our framework can provide by performing a case study on two popular deep learning architectures and ten NCMs.

Causal Inference

An Empirical Investigation into the Use of Image Captioning for Automated Software Documentation

no code implementations3 Jan 2023 Kevin Moran, Ali Yachnes, George Purnell, Junayed Mahmud, Michele Tufano, Carlos Bernal-Cárdenas, Denys Poshyvanyk, Zach H'Doubler

This paper offers one of the first comprehensive empirical investigations into the connection between GUIs and functional, natural language descriptions of software.

Image Captioning Machine Translation

It Takes Two to Tango: Combining Visual and Textual Information for Detecting Duplicate Video-Based Bug Reports

1 code implementation22 Jan 2021 Nathan Cooper, Carlos Bernal-Cárdenas, Oscar Chaparro, Kevin Moran, Denys Poshyvanyk

Given the importance of visual information to the process of identifying and understanding such bugs, users are increasingly making use of screenshots and screen-recordings as a means to report issues to developers.

Optical Character Recognition Optical Character Recognition (OCR) +1

A Systematic Literature Review on the Use of Deep Learning in Software Engineering Research

no code implementations14 Sep 2020 Cody Watson, Nathan Cooper, David Nader Palacio, Kevin Moran, Denys Poshyvanyk

An increasingly popular set of techniques adopted by software engineering (SE) researchers to automate development tasks are those rooted in the concept of Deep Learning (DL).

Automated Feature Engineering Feature Engineering +1

Translating Video Recordings of Mobile App Usages into Replayable Scenarios

no code implementations18 May 2020 Carlos Bernal-Cárdenas, Nathan Cooper, Kevin Moran, Oscar Chaparro, Andrian Marcus, Denys Poshyvanyk

In light of unique mobile development constraints, including swift release cycles and rapidly evolving platforms, automated techniques for analyzing all types of rich software artifacts provide benefit to mobile developers.

image-classification Image Classification +2

Improving the Effectiveness of Traceability Link Recovery using Hierarchical Bayesian Networks

no code implementations18 May 2020 Kevin Moran, David N. Palacio, Carlos Bernal-Cárdenas, Daniel McCrystal, Denys Poshyvanyk, Chris Shenefiel, Jeff Johnson

To this end, we design and implement a HierarchiCal PrObabilistic Model for SoftwarE Traceability (Comet) that is able to infer candidate trace links.

DeepMutation: A Neural Mutation Tool

no code implementations12 Feb 2020 Michele Tufano, Jason Kimko, Shiya Wang, Cody Watson, Gabriele Bavota, Massimiliano Di Penta, Denys Poshyvanyk

To this aim, two characteristics of mutation testing frameworks are of paramount importance: (i) they should generate mutants that are representative of real faults; and (ii) they should provide a complete tool chain able to automatically generate, inject, and test the mutants.

Decoder Fault Detection

On Learning Meaningful Code Changes via Neural Machine Translation

no code implementations25 Jan 2019 Michele Tufano, Jevgenija Pantiuchina, Cody Watson, Gabriele Bavota, Denys Poshyvanyk

We show that, when applied in a narrow enough context (i. e., small/medium-sized pairs of methods before/after the pull request changes), NMT can automatically replicate the changes implemented by developers during pull requests in up to 36% of the cases.

Bug fixing Machine Translation +2

Learning How to Mutate Source Code from Bug-Fixes

no code implementations27 Dec 2018 Michele Tufano, Cody Watson, Gabriele Bavota, Massimiliano Di Penta, Martin White, Denys Poshyvanyk

Starting from code fixed by developers in the context of a bug-fix, our empirical evaluation showed that our models are able to predict mutants that resemble original fixed bugs in between 9% and 45% of the cases (depending on the model).

Software Engineering

Sorting and Transforming Program Repair Ingredients via Deep Learning Code Similarities

1 code implementation15 Jul 2017 Martin White, Michele Tufano, Matias Martinez, Martin Monperrus, Denys Poshyvanyk

We aim to reason about the repair ingredients by using code similarities to prioritize and transform statements in a codebase for patch generation.

Software Engineering

Cannot find the paper you are looking for? You can Submit a new open access paper.