1 code implementation • 22 Sep 2023 • Steffen Herbold
Semantic similarity between natural language texts is typically measured either by looking at the overlap between subsequences (e. g., BLEU) or by using embeddings (e. g., BERTScore, S-BERT).
no code implementations • 24 Apr 2023 • Steffen Herbold, Annette Hautli-Janisz, Ute Heuer, Zlata Kikteva, Alexander Trautsch
The writing style of the AI models exhibits linguistic characteristics that are different from those of the human-written essays, e. g., it is characterized by fewer discourse and epistemic markers, but more nominalizations and greater lexical diversity.
no code implementations • 25 Jan 2023 • Anamaria Mojica-Hanke, Andrea Bayona, Mario Linares-Vásquez, Steffen Herbold, Fabio A. González
In particular, Software Engineering (SE) is one of those disciplines in which ML has been used for multiple tasks, like software categorization, bugs prediction, and testing.
no code implementations • 15 Sep 2022 • Benjamin Ledel, Steffen Herbold
Objective: We want to understand if machine learning models provide explanations for the classification that are reasonable to us as humans and align with our assumptions of what the models should learn.
1 code implementation • 25 Jul 2022 • Steffen Herbold, Steffen Tunkel
The execution of the feasible tests revealed that there is a large amount of deviations for the scores and classes.
1 code implementation • 3 May 2022 • Alexander Trautsch, Steffen Herbold
Pre-trained transformer models are the current state-of-the-art for natural language models processing.
no code implementations • 10 Sep 2021 • Julian von der Mosel, Alexander Trautsch, Steffen Herbold
Transformers are the current state-of-the-art of natural language processing in many domains and are using traction within software engineering research as well.
no code implementations • 27 Feb 2021 • Armin Goudarzi, Carsten Spehr, Steffen Herbold
Second, clustering is performed based on these properties.
no code implementations • 16 Dec 2020 • Armin Goudarzi, Carsten Spehr, Steffen Herbold
This paper presents two methods that enable the automated identification of aeroacoustic sources in sparse beamforming maps and the extraction of their corresponding spectra to overcome the manual definition of Regions Of Interest.
1 code implementation • 12 Nov 2020 • Steffen Herbold, Alexander Trautsch, Benjamin Ledel, Alireza Aghamohammadi, Taher Ahmed Ghaleb, Kuljit Kaur Chahal, Tim Bossenmaier, Bhaveet Nagaria, Philip Makedonski, Matin Nili Ahmadabadi, Kristof Szabados, Helge Spieker, Matej Madeja, Nathaniel Hoy, Valentina Lenarduzzi, Shangwen Wang, Gema Rodríguez-Pérez, Ricardo Colomo-Palacios, Roberto Verdecchia, Paramvir Singh, Yihao Qin, Debasish Chakroborti, Willard Davis, Vijay Walunj, Hongjun Wu, Diego Marcilio, Omar Alam, Abdullah Aldaeej, Idan Amit, Burak Turhan, Simon Eismann, Anna-Katharina Wickert, Ivano Malavolta, Matus Sulir, Fatemeh Fard, Austin Z. Henley, Stratos Kourtzanidis, Eray Tuzun, Christoph Treude, Simin Maleki Shamasbi, Ivan Pashchenko, Marvin Wyrich, James Davis, Alexander Serebrenik, Ella Albrecht, Ethem Utku Aktas, Daniel Strüber, Johannes Erbel
Methods: We use a crowd sourcing approach for manual labeling to validate which changes contribute to bug fixes for each line in bug fixing commits.
Software Engineering
1 code implementation • 3 Sep 2020 • Steffen Herbold, Tobias Haar
Moreover, we found that these concepts can also be applied to hyperparameters, to further improve the quality of the smoke tests.
1 code implementation • 23 Dec 2018 • Tobias Baum, Steffen Herbold, Kurt Schneider
In our research in this area, with data from open source projects as well as an industrial partner, we noticed several shortcomings of conventional data mining approaches for classification problems: (1) Domain experts' acceptance is of critical importance, and domain experts can provide valuable input, but it is hard to use this feedback.
no code implementations • 22 Dec 2018 • Tobias Baum, Steffen Herbold, Kurt Schneider
To determine the importance of change parts, we extract data from software repositories and build prediction models for review remarks based on this data.