Search Results for author: Steffen Herbold

Found 13 papers, 6 papers with code

Semantic similarity prediction is better than other semantic similarity measures

1 code implementation22 Sep 2023 Steffen Herbold

Semantic similarity between natural language texts is typically measured either by looking at the overlap between subsequences (e. g., BLEU) or by using embeddings (e. g., BERTScore, S-BERT).

Semantic Similarity Semantic Textual Similarity +2

AI, write an essay for me: A large-scale comparison of human-written versus ChatGPT-generated essays

no code implementations24 Apr 2023 Steffen Herbold, Annette Hautli-Janisz, Ute Heuer, Zlata Kikteva, Alexander Trautsch

The writing style of the AI models exhibits linguistic characteristics that are different from those of the human-written essays, e. g., it is characterized by fewer discourse and epistemic markers, but more nominalizations and greater lexical diversity.

Math

What are the Machine Learning best practices reported by practitioners on Stack Exchange?

no code implementations25 Jan 2023 Anamaria Mojica-Hanke, Andrea Bayona, Mario Linares-Vásquez, Steffen Herbold, Fabio A. González

In particular, Software Engineering (SE) is one of those disciplines in which ML has been used for multiple tasks, like software categorization, bugs prediction, and testing.

Studying the explanations for the automated prediction of bug and non-bug issues using LIME and SHAP

no code implementations15 Sep 2022 Benjamin Ledel, Steffen Herbold

Objective: We want to understand if machine learning models provide explanations for the classification that are reasonable to us as humans and align with our assumptions of what the models should learn.

Type prediction

Differential testing for machine learning: an analysis for classification algorithms beyond deep learning

1 code implementation25 Jul 2022 Steffen Herbold, Steffen Tunkel

The execution of the feasible tests revealed that there is a large amount of deviations for the scores and classes.

Predicting Issue Types with seBERT

1 code implementation3 May 2022 Alexander Trautsch, Steffen Herbold

Pre-trained transformer models are the current state-of-the-art for natural language models processing.

Type prediction

On the validity of pre-trained transformers for natural language processing in the software engineering domain

no code implementations10 Sep 2021 Julian von der Mosel, Alexander Trautsch, Steffen Herbold

Transformers are the current state-of-the-art of natural language processing in many domains and are using traction within software engineering research as well.

Automatic source localization and spectra generation from sparse beamforming maps

no code implementations16 Dec 2020 Armin Goudarzi, Carsten Spehr, Steffen Herbold

This paper presents two methods that enable the automated identification of aeroacoustic sources in sparse beamforming maps and the extraction of their corresponding spectra to overcome the manual definition of Regions Of Interest.

Clustering

Smoke Testing for Machine Learning: Simple Tests to Discover Severe Defects

1 code implementation3 Sep 2020 Steffen Herbold, Tobias Haar

Moreover, we found that these concepts can also be applied to hyperparameters, to further improve the quality of the smoke tests.

BIG-bench Machine Learning valid

A Multi-Objective Anytime Rule Mining System to Ease Iterative Feedback from Domain Experts

1 code implementation23 Dec 2018 Tobias Baum, Steffen Herbold, Kurt Schneider

In our research in this area, with data from open source projects as well as an industrial partner, we noticed several shortcomings of conventional data mining approaches for classification problems: (1) Domain experts' acceptance is of critical importance, and domain experts can provide valuable input, but it is hard to use this feedback.

An Industrial Case Study on Shrinking Code Review Changesets through Remark Prediction

no code implementations22 Dec 2018 Tobias Baum, Steffen Herbold, Kurt Schneider

To determine the importance of change parts, we extract data from software repositories and build prediction models for review remarks based on this data.

Cannot find the paper you are looking for? You can Submit a new open access paper.