Search Results for author: Anh T. V. Dau

Found 3 papers, 2 papers with code

The Vault: A Comprehensive Multilingual Dataset for Advancing Code Understanding and Generation

1 code implementation9 May 2023 Dung Nguyen Manh, Nam Le Hai, Anh T. V. Dau, Anh Minh Nguyen, Khanh Nghiem, Jin Guo, Nghi D. Q. Bui

We present The Vault, a dataset of high-quality code-text pairs in multiple programming languages for training large language models to understand and generate code.

Code Generation Code Search +1

Class based Influence Functions for Error Detection

1 code implementation2 May 2023 Thang Nguyen-Duc, Hoang Thanh-Tung, Quan Hung Tran, Dang Huu-Tien, Hieu Ngoc Nguyen, Anh T. V. Dau, Nghi D. Q. Bui

Influence functions (IFs) are a powerful tool for detecting anomalous examples in large scale datasets.

Towards Using Data-Influence Methods to Detect Noisy Samples in Source Code Corpora

no code implementations25 May 2022 Anh T. V. Dau, Thang Nguyen-Duc, Hoang Thanh-Tung, Nghi D. Q. Bui

Despite the recent trend of developing and applying neural source code models to software engineering tasks, the quality of such models is insufficient for real-world use.

Code Classification Representation Learning

Cannot find the paper you are looking for? You can Submit a new open access paper.