no code implementations • 2 Jan 2025 • Shuzheng Gao, Chaozheng Wang, Cuiyun Gao, Xiaoqian Jiao, Chun Yong Chong, Shan Gao, Michael Lyu
Test cases are essential for validating the reliability and quality of software applications.
1 code implementation • 19 Dec 2024 • Ruida Hu, Chao Peng, Jingyi Ren, Bo Jiang, Xiangxin Meng, Qinyun Wu, Pengfei Gao, Xinchen Wang, Cuiyun Gao
In this work, we introduce CodeRepoQA, a large-scale benchmark specifically designed for evaluating repository-level question-answering capabilities in the field of software engineering.
1 code implementation • 11 Dec 2024 • Xin-Cheng Wen, Zirui Lin, Cuiyun Gao, Hongyu Zhang, Yong Wang, Qing Liao
To evaluate RepoSPD, we employ two widely-used datasets in security patch detection: SPI-DB and PatchDB.
no code implementations • 9 Dec 2024 • Shuqing Li, Chenran Zhang, Cuiyun Gao, Michael R. Lyu
The rapid advancement of Extended Reality (XR, encompassing AR, MR, and VR) and spatial computing technologies forms a foundational layer for the emerging Metaverse, enabling innovative applications across healthcare, education, manufacturing, and entertainment.
no code implementations • 27 Nov 2024 • Xinchen Wang, Pengfei Gao, Xiangxin Meng, Chao Peng, Ruida Hu, Yun Lin, Cuiyun Gao
Manually writing reproduction scripts is a time-consuming task with high requirements for developers.
1 code implementation • 22 Aug 2024 • Shuzheng Gao, Cuiyun Gao, Wenchao Gu, Michael Lyu
First, complex optimization methods such as combinatorial ones are hard to be captured by LLMs.
no code implementations • 7 Aug 2024 • Qinyun Wu, Chao Peng, Pengfei Gao, Ruida Hu, Haoyu Gan, Bo Jiang, Jinhe Tang, Zhiwen Deng, Zhanming Guan, Cuiyun Gao, Xia Liu, Ping Yang
Our empirical evaluation on 6 state-of-the-art models shows that test argumentation is critical in improving the accuracy of the benchmark and RepoMasterEval is able to report difference in model performance in real-world scenarios.
1 code implementation • 27 Feb 2024 • Yuanhang Yang, shiyi qi, Wenchao Gu, Chaozheng Wang, Cuiyun Gao, Zenglin Xu
To address this issue, we present \tool, a novel MoE designed to enhance both the efficacy and efficiency of sparse MoE models.
no code implementations • 7 Dec 2023 • Zongjie Li, Chaozheng Wang, Chaowei Liu, Pingchuan Ma, Daoyuan Wu, Shuai Wang, Cuiyun Gao
With recent advancements in Large Multimodal Models (LMMs) across various domains, a novel prompting method called visual referring prompting has emerged, showing significant potential in enhancing human-computer interaction within multimodal systems.
no code implementations • 29 Sep 2023 • Zongjie Li, Chaozheng Wang, Pingchuan Ma, Daoyuan Wu, Shuai Wang, Cuiyun Gao, Yang Liu
Specifically, PORTIA splits the answers into multiple segments, aligns similar content across candidate answers, and then merges them back into a single prompt for evaluation by LLMs.
1 code implementation • 21 Aug 2023 • Xin-Cheng Wen, Xinchen Wang, Cuiyun Gao, Shaohua Wang, Yang Liu, Zhaoquan Gu
In this paper, we focus on the Positive and Unlabeled (PU) learning problem for vulnerability detection and propose a novel model named PILOT, i. e., PositIve and unlabeled Learning mOdel for vulnerability deTection.
1 code implementation • 12 Jun 2023 • Xin-Cheng Wen, Cuiyun Gao, Feng Luo, Haoyu Wang, Ge Li, Qing Liao
(2) adaptive re-weighting module, which adjusts the learning weights for different types according to the training epochs and numbers of associated samples by a novel training loss.
no code implementations • 19 Dec 2022 • Zi Gong, Yinpeng Guo, Pingyi Zhou, Cuiyun Gao, Yasheng Wang, Zenglin Xu
On the other hand, there are few studies exploring the effects of multi-programming-lingual (MultiPL) pre-training for the code completion, especially the impact on low-resource programming languages.
no code implementations • 12 Dec 2022 • Qingfu Zhu, Xianzhen Luo, Fang Liu, Cuiyun Gao, Wanxiang Che
Natural language processing for programming aims to use NLP techniques to assist programming.
no code implementations • 10 Nov 2022 • Anguo Dong, Cuiyun Gao, Yan Jia, Qing Liao, Xuan Wang, Lei Wang, Jing Xiao
In this work, we propose a novel Syntax-guided Domain Adaptation Model, named SDAM, for more effective cross-domain ABSA.
Aspect-Based Sentiment Analysis Aspect-Based Sentiment Analysis (ABSA) +2
1 code implementation • 11 Oct 2022 • Yuanhang Yang, shiyi qi, Chuanyi Liu, Qifan Wang, Cuiyun Gao, Zenglin Xu
Transformer-based models have achieved great success on sentence pair modeling tasks, such as answer selection and natural language inference (NLI).
no code implementations • 28 Sep 2022 • Xinni Zhang, Yankai Chen, Cuiyun Gao, Qing Liao, Shenglin Zhao, Irwin King
Incorporating knowledge graphs (KGs) as side information in recommendation has recently attracted considerable attention.
1 code implementation • 24 Jul 2022 • Chaozheng Wang, Yuanhang Yang, Cuiyun Gao, Yun Peng, Hongyu Zhang, Michael R. Lyu
Besides, the performance of fine-tuning strongly relies on the amount of downstream data, while in practice, the scenarios with scarce data are common.
no code implementations • 14 May 2022 • Jingya Zang, Cuiyun Gao, Yupan Chen, Ruifeng Xu, Lanjun Zhou, Xuan Wang
However, reviews of music songs are generally long in length and most of them are non-informative for users.
no code implementations • 8 Apr 2022 • Jiezhu Cheng, Cuiyun Gao, Zibin Zheng
Due to the complex interactions among multiple options and the high cost of performance measurement under a huge configuration space, it is challenging to study how different configurations influence the system performance.
1 code implementation • 14 Feb 2022 • Zi Gong, Cuiyun Gao, Yasheng Wang, Wenchao Gu, Yun Peng, Zenglin Xu
We further show that how the proposed SCRIPT captures the structural relative dependencies.
no code implementations • 9 Nov 2021 • Chaozheng Wang, Shuzheng Gao, Cuiyun Gao, Pengyun Wang, Wenjie Pei, Lujia Pan, Zenglin Xu
Real-world data usually present long-tailed distributions.
no code implementations • 18 Oct 2021 • Langzhang Liang, Cuiyun Gao, Shiyi Chen, Shishi Duan, Yu Pan, Junjin Zheng, Lei Wang, Zenglin Xu
Graph Convolutional Networks (GCNs) are powerful for processing graph-structured data and have achieved state-of-the-art performance in several tasks such as node classification, link prediction, and graph classification.
no code implementations • 19 Apr 2021 • Shuzheng Gao, Cuiyun Gao, Yulan He, Jichuan Zeng, Lun Yiu Nie, Xin Xia, Michael R. Lyu
Code summaries help developers comprehend programs and reduce their time to infer the program functionalities during software maintenance.
no code implementations • 23 Aug 2020 • Cuiyun Gao, Jichuan Zeng, Zhiyuan Wen, David Lo, Xin Xia, Irwin King, Michael R. Lyu
Experiments on popular apps from Google Play and Apple's App Store demonstrate the effectiveness of MERIT in identifying emerging app issues, improving the state-of-the-art method by 22. 3% in terms of F1-score.
1 code implementation • 14 Jul 2020 • Lun Yiu Nie, Cuiyun Gao, Zhicong Zhong, Wai Lam, Yang Liu, Zenglin Xu
In this paper, we propose a novel Contextualized code representation learning strategy for commit message Generation (CoreGen).
no code implementations • 25 Jun 2020 • Chao Liu, Cuiyun Gao, Xin Xia, David Lo, John Grundy, Xiaohu Yang
Experimental results show the importance of replicability and reproducibility, where the reported performance of a DL model could not be replicated for an unstable optimization process.
1 code implementation • 24 Apr 2020 • Bozhi Wu, Sen Chen, Cuiyun Gao, Lingling Fan, Yang Liu, Weiping Wen, Michael R. Lyu
In this paper, to fill this gap, we propose a novel and interpretable ML-based approach (named XMal) to classify malware with high accuracy and explain the classification result meanwhile.
1 code implementation • 10 Feb 2020 • Cuiyun Gao, Jichuan Zeng, Xin Xia, David Lo, Michael R. Lyu, Irwin King
Previous studies showed that replying to a user review usually has a positive effect on the rating that is given by the user to the app.
no code implementations • 10 Feb 2020 • Jichuan Zeng, Jing Li, Yulan He, Cuiyun Gao, Michael R. Lyu, Irwin King
In our world with full of uncertainty, debates and argumentation contribute to the progress of science and society.
no code implementations • 20 Dec 2019 • JingKai Siow, Cuiyun Gao, Lingling Fan, Sen Chen, Yang Liu
The hinge of accurate code review suggestion is to learn good representations for both code changes and reviews.
no code implementations • 6 Dec 2019 • Shangqing Liu, Cuiyun Gao, Sen Chen, Lun Yiu Nie, Yang Liu
Moreover, although generation models have the advantages of synthesizing commit messages for new code changes, they are not easy to bridge the semantic gap between code and natural languages which could be mitigated by retrieval models.
Software Engineering
no code implementations • WS 2019 • Fenglei Jin, Cuiyun Gao, Michael R. Lyu
In this paper, we propose a novel online topic tracking framework, named IEDL, for tracking the topic changes related to deep learning techniques on Stack Exchange and automatically interpreting each identified topic.
1 code implementation • TACL 2019 • Jichuan Zeng, Jing Li, Yulan He, Cuiyun Gao, Michael R. Lyu, Irwin King
This paper presents an unsupervised framework for jointly modeling topic content and discourse behavior in microblog conversations.
no code implementations • EMNLP 2018 • Jichuan Zeng, Jing Li, Yan Song, Cuiyun Gao, Michael R. Lyu, Irwin King
Many classification models work poorly on short texts due to data sparsity.