1 code implementation • 1 Jul 2024 • Chunqiu Steven Xia, Yinlin Deng, Soren Dunn, Lingming Zhang
However, the complexity of these agent-based approaches, together with the limited abilities of current LLMs, raises the following question: Do we really have to employ complex autonomous software agents?
no code implementations • 23 Apr 2024 • Ansong Ni, Miltiadis Allamanis, Arman Cohan, Yinlin Deng, Kensen Shi, Charles Sutton, Pengcheng Yin
A fundamental skill among human developers is the ability to understand and reason about program execution.
1 code implementation • 28 Mar 2024 • Chunqiu Steven Xia, Yinlin Deng, Lingming Zhang
Such limitations inevitably lead us to inquire: Is the leaderboard performance on existing benchmarks reliable and comprehensive enough to measure the program synthesis ability of LLMs?
1 code implementation • 24 Oct 2023 • Chenyuan Yang, Yinlin Deng, Runyu Lu, Jiayi Yao, Jiawei Liu, Reyhaneh Jabbarvand, Lingming Zhang
To this end, we propose WhiteFox, the first white-box compiler fuzzer using LLMs with source-code information to test compiler optimization, with a spotlight on detecting deep logic bugs in the deep learning (DL) compilers.
no code implementations • 26 Jul 2023 • Kensen Shi, Joey Hong, Yinlin Deng, Pengcheng Yin, Manzil Zaheer, Charles Sutton
When writing programs, people have the ability to tackle a new complex task by decomposing it into smaller and more familiar subtasks.
1 code implementation • 21 Feb 2022 • Jiawei Liu, Yuxiang Wei, Sen yang, Yinlin Deng, Lingming Zhang
Our results show that Tzer substantially outperforms existing fuzzing techniques on tensor compiler testing, with 75% higher coverage and 50% more valuable tests than the 2nd-best technique.