Search Results for author: Steffen Herbold

Found 13 papers, 6 papers with code

Semantic similarity prediction is better than other semantic similarity measures

1 code implementation • 22 Sep 2023 • Steffen Herbold

Semantic similarity between natural language texts is typically measured either by looking at the overlap between subsequences (e. g., BLEU) or by using embeddings (e. g., BERTScore, S-BERT).

Paper
Code

AI, write an essay for me: A large-scale comparison of human-written versus ChatGPT-generated essays

no code implementations • 24 Apr 2023 • Steffen Herbold, Annette Hautli-Janisz, Ute Heuer, Zlata Kikteva, Alexander Trautsch

The writing style of the AI models exhibits linguistic characteristics that are different from those of the human-written essays, e. g., it is characterized by fewer discourse and epistemic markers, but more nominalizations and greater lexical diversity.

Math

Paper
Add Code

What are the Machine Learning best practices reported by practitioners on Stack Exchange?

no code implementations • 25 Jan 2023 • Anamaria Mojica-Hanke, Andrea Bayona, Mario Linares-Vásquez, Steffen Herbold, Fabio A. González

In particular, Software Engineering (SE) is one of those disciplines in which ML has been used for multiple tasks, like software categorization, bugs prediction, and testing.

Paper
Add Code

Studying the explanations for the automated prediction of bug and non-bug issues using LIME and SHAP

no code implementations • 15 Sep 2022 • Benjamin Ledel, Steffen Herbold

Objective: We want to understand if machine learning models provide explanations for the classification that are reasonable to us as humans and align with our assumptions of what the models should learn.

Type prediction

Paper
Add Code

Differential testing for machine learning: an analysis for classification algorithms beyond deep learning

1 code implementation • 25 Jul 2022 • Steffen Herbold, Steffen Tunkel

The execution of the feasible tests revealed that there is a large amount of deviations for the scores and classes.

Paper
Code

Predicting Issue Types with seBERT

1 code implementation • 3 May 2022 • Alexander Trautsch, Steffen Herbold

Pre-trained transformer models are the current state-of-the-art for natural language models processing.

Type prediction

Paper
Code

On the validity of pre-trained transformers for natural language processing in the software engineering domain

no code implementations • 10 Sep 2021 • Julian von der Mosel, Alexander Trautsch, Steffen Herbold

Transformers are the current state-of-the-art of natural language processing in many domains and are using traction within software engineering research as well.

Paper
Add Code

Expert Decision Support System for aeroacoustic source type identification using clustering

no code implementations • 27 Feb 2021 • Armin Goudarzi, Carsten Spehr, Steffen Herbold

Second, clustering is performed based on these properties.

Clustering General Classification +1

Paper
Add Code

Automatic source localization and spectra generation from sparse beamforming maps

no code implementations • 16 Dec 2020 • Armin Goudarzi, Carsten Spehr, Steffen Herbold

This paper presents two methods that enable the automated identification of aeroacoustic sources in sparse beamforming maps and the extraction of their corresponding spectra to overcome the manual definition of Regions Of Interest.

Clustering

Paper
Add Code

A Fine-grained Data Set and Analysis of Tangling in Bug Fixing Commits

1 code implementation • 12 Nov 2020 • Steffen Herbold, Alexander Trautsch, Benjamin Ledel, Alireza Aghamohammadi, Taher Ahmed Ghaleb, Kuljit Kaur Chahal, Tim Bossenmaier, Bhaveet Nagaria, Philip Makedonski, Matin Nili Ahmadabadi, Kristof Szabados, Helge Spieker, Matej Madeja, Nathaniel Hoy, Valentina Lenarduzzi, Shangwen Wang, Gema Rodríguez-Pérez, Ricardo Colomo-Palacios, Roberto Verdecchia, Paramvir Singh, Yihao Qin, Debasish Chakroborti, Willard Davis, Vijay Walunj, Hongjun Wu, Diego Marcilio, Omar Alam, Abdullah Aldaeej, Idan Amit, Burak Turhan, Simon Eismann, Anna-Katharina Wickert, Ivano Malavolta, Matus Sulir, Fatemeh Fard, Austin Z. Henley, Stratos Kourtzanidis, Eray Tuzun, Christoph Treude, Simin Maleki Shamasbi, Ivan Pashchenko, Marvin Wyrich, James Davis, Alexander Serebrenik, Ella Albrecht, Ethem Utku Aktas, Daniel Strüber, Johannes Erbel

Methods: We use a crowd sourcing approach for manual labeling to validate which changes contribute to bug fixes for each line in bug fixing commits.

Software Engineering

Paper
Code

Smoke Testing for Machine Learning: Simple Tests to Discover Severe Defects

1 code implementation • 3 Sep 2020 • Steffen Herbold, Tobias Haar

Moreover, we found that these concepts can also be applied to hyperparameters, to further improve the quality of the smoke tests.

BIG-bench Machine Learning valid

Paper
Code

A Multi-Objective Anytime Rule Mining System to Ease Iterative Feedback from Domain Experts

1 code implementation • 23 Dec 2018 • Tobias Baum, Steffen Herbold, Kurt Schneider

In our research in this area, with data from open source projects as well as an industrial partner, we noticed several shortcomings of conventional data mining approaches for classification problems: (1) Domain experts' acceptance is of critical importance, and domain experts can provide valuable input, but it is hard to use this feedback.

Paper
Code

An Industrial Case Study on Shrinking Code Review Changesets through Remark Prediction

no code implementations • 22 Dec 2018 • Tobias Baum, Steffen Herbold, Kurt Schneider

To determine the importance of change parts, we extract data from software repositories and build prediction models for review remarks based on this data.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.