no code implementations • 30 Jul 2023 • Tiezhu Sun, Weiguo Pian, Nadia Daoudi, Kevin Allix, Tegawendé F. Bissyandé, Jacques Klein
This efficiency, coupled with its state-of-the-art performance, highlights LaFiCMIL's potential as a groundbreaking approach in the field of large file classification.
1 code implementation • 12 Dec 2022 • Tiezhu Sun, Kevin Allix, Kisub Kim, Xin Zhou, Dongsun Kim, David Lo, Tegawendé F. Bissyandé, Jacques Klein
Central to applying ML to software artifacts (like source or executable code) is converting them into forms suitable for learning.
no code implementations • 9 Dec 2022 • Xunzhu Tang, Tiezhu Sun, Rujie Zhu, Shi Wang
Recently, neural language representation models pre-trained on large corpus can capture rich co-occurrence information and be fine-tuned in downstream tasks to improve the performance.
no code implementations • 9 Dec 2022 • Xunzhu Tang, Rujie Zhu, Tiezhu Sun, Shi Wang
Recently, language representation techniques have achieved great performances in text classification.
1 code implementation • 3 Dec 2022 • Yinghua Li, Xueqi Dang, Haoye Tian, Tiezhu Sun, Zhijie Wang, Lei Ma, Jacques Klein, Tegawende F. Bissyande
In this paper, we conduct the most extensive empirical study on 56, 682 published AI apps from three perspectives: dataset characteristics, development issues, and user feedback and privacy.
1 code implementation • 13 Jun 2022 • Weiguo Pian, Hanyu Peng, Xunzhu Tang, Tiezhu Sun, Haoye Tian, Andrew Habib, Jacques Klein, Tegawendé F. Bissyandé
Representation learning of source code is essential for applying machine learning to software engineering tasks.