Search Results for author: Lingming Zhang

Found 29 papers, 20 papers with code

KNighter: Transforming Static Analysis with LLM-Synthesized Checkers

1 code implementation12 Mar 2025 Chenyuan Yang, Zijie Zhao, Zichen Xie, Haoyu Li, Lingming Zhang

Our evaluation on the Linux kernel demonstrates that KNighter generates high-precision checkers capable of detecting diverse bug patterns overlooked by existing human-written analyzers.

SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution

no code implementations25 Feb 2025 Yuxiang Wei, Olivier Duchenne, Jade Copet, Quentin Carbonneaux, Lingming Zhang, Daniel Fried, Gabriel Synnaeve, Rishabh Singh, Sida I. Wang

The recent DeepSeek-R1 release has demonstrated the immense potential of reinforcement learning (RL) in enhancing the general reasoning capabilities of large language models (LLMs).

Math Reinforcement Learning (RL)

SelfCodeAlign: Self-Alignment for Code Generation

2 code implementations31 Oct 2024 Yuxiang Wei, Federico Cassano, Jiawei Liu, Yifeng Ding, Naman jain, Zachary Mueller, Harm de Vries, Leandro von Werra, Arjun Guha, Lingming Zhang

In our primary experiments, we use SelfCodeAlign with CodeQwen1. 5-7B to generate a dataset of 74k instruction-response pairs.

Code Generation HumanEval

Large Language Model-Based Agents for Software Engineering: A Survey

1 code implementation4 Sep 2024 Junwei Liu, Kaixin Wang, Yixuan Chen, Xin Peng, Zhenpeng Chen, Lingming Zhang, Yiling Lou

The recent advance in Large Language Models (LLMs) has shaped a new paradigm of AI agents, i. e., LLM-based agents.

AI Agent Language Modeling +2

Evaluating Language Models for Efficient Code Generation

no code implementations12 Aug 2024 Jiawei Liu, Songrun Xie, Junhao Wang, Yuxiang Wei, Yifeng Ding, Lingming Zhang

We introduce Differential Performance Evaluation (DPE), a framework designed to reliably evaluate Large Language Models (LLMs) for efficient code generation.

Code Generation

Agentless: Demystifying LLM-based Software Engineering Agents

1 code implementation1 Jul 2024 Chunqiu Steven Xia, Yinlin Deng, Soren Dunn, Lingming Zhang

However, the complexity of these agent-based approaches, together with the limited abilities of current LLMs, raises the following question: Do we really have to employ complex autonomous software agents?

Program Repair

XFT: Unlocking the Power of Code Instruction Tuning by Simply Merging Upcycled Mixture-of-Experts

1 code implementation23 Apr 2024 Yifeng Ding, Jiawei Liu, Yuxiang Wei, Terry Yue Zhuo, Lingming Zhang

We introduce XFT, a simple yet powerful training scheme, by simply merging upcycled Mixture-of-Experts (MoE) to unleash the performance limit of instruction-tuned code Large Language Models (LLMs).

HumanEval mbpp +1

Productively Deploying Emerging Models on Emerging Platforms: A Top-Down Approach for Testing and Debugging

no code implementations14 Apr 2024 Siyuan Feng, Jiawei Liu, Ruihang Lai, Charlie F. Ruan, Yong Yu, Lingming Zhang, Tianqi Chen

However, this traditional development approach fails to meet the productivity requirements when deploying emerging ML applications, with the testing and debugging part as a bottleneck.

Top Leaderboard Ranking = Top Coding Proficiency, Always? EvoEval: Evolving Coding Benchmarks via LLM

1 code implementation28 Mar 2024 Chunqiu Steven Xia, Yinlin Deng, Lingming Zhang

Such limitations inevitably lead us to inquire: Is the leaderboard performance on existing benchmarks reliable and comprehensive enough to measure the program synthesis ability of LLMs?

Code Generation HumanEval +2

KernelGPT: Enhanced Kernel Fuzzing via Large Language Models

1 code implementation31 Dec 2023 Chenyuan Yang, Zijie Zhao, Lingming Zhang

As a result, a large body of research has been focused on kernel fuzzing, i. e., automatically generating syscall (system call) sequences to detect potential kernel bugs or vulnerabilities.

valid

Magicoder: Empowering Code Generation with OSS-Instruct

3 code implementations4 Dec 2023 Yuxiang Wei, Zhe Wang, Jiawei Liu, Yifeng Ding, Lingming Zhang

Magicoder models are trained on 75K synthetic instruction data using OSS-Instruct, a novel approach to enlightening LLMs with open-source code snippets to generate diverse instruction data for code.

Code Generation HumanEval +1

WhiteFox: White-Box Compiler Fuzzing Empowered by Large Language Models

1 code implementation24 Oct 2023 Chenyuan Yang, Yinlin Deng, Runyu Lu, Jiayi Yao, Jiawei Liu, Reyhaneh Jabbarvand, Lingming Zhang

To this end, we propose WhiteFox, the first white-box compiler fuzzer using LLMs with source-code information to test compiler optimization, with a spotlight on detecting deep logic bugs in the deep learning (DL) compilers.

Code Generation Compiler Optimization

Copiloting the Copilots: Fusing Large Language Models with Completion Engines for Automated Program Repair

1 code implementation1 Sep 2023 Yuxiang Wei, Chunqiu Steven Xia, Lingming Zhang

Therefore, we propose Repilot, a general code generation framework to further copilot the AI "copilots" (i. e., LLMs) by synthesizing more valid patches during the repair process.

Code Generation Program Repair +1

Fuzz4All: Universal Fuzzing with Large Language Models

1 code implementation9 Aug 2023 Chunqiu Steven Xia, Matteo Paltenghi, Jia Le Tian, Michael Pradel, Lingming Zhang

Moreover, the inputs generated by existing fuzzers are often limited to specific features of the input language, and thus can hardly reveal bugs related to other or new features.

Keep the Conversation Going: Fixing 162 out of 337 bugs for $0.42 each using ChatGPT

no code implementations1 Apr 2023 Chunqiu Steven Xia, Lingming Zhang

For earlier patches that failed to pass all tests, we combine the incorrect patches with their corresponding relevant test failure information to construct a new prompt for the LLM to generate the next patch.

Program Repair

Revisiting the Plastic Surgery Hypothesis via Large Language Models

no code implementations18 Mar 2023 Chunqiu Steven Xia, Yifeng Ding, Lingming Zhang

Traditional APR tools have largely leveraged the plastic surgery hypothesis by designing manual or heuristic-based approaches to exploit such existing code ingredients.

Program Repair

NeuRI: Diversifying DNN Generation via Inductive Rule Inference

1 code implementation4 Feb 2023 Jiawei Liu, Jinjun Peng, Yuyao Wang, Lingming Zhang

NeuRI finds 100 new bugs for PyTorch and TensorFlow in four months, with 81 already fixed or confirmed.

Decision Making Program Synthesis +1

Conversational Automated Program Repair

no code implementations30 Jan 2023 Chunqiu Steven Xia, Lingming Zhang

As such, we leverage the long-term context window of LLMs to not only avoid generating previously incorrect patches but also incorporate validation feedback to help the model understand the semantic meaning of the program under test.

Program Repair

NNSmith: Generating Diverse and Valid Test Cases for Deep Learning Compilers

1 code implementation26 Jul 2022 Jiawei Liu, JinKun Lin, Fabian Ruffy, Cheng Tan, Jinyang Li, Aurojit Panda, Lingming Zhang

In this work, we propose a new fuzz testing approach for finding bugs in deep-learning compilers.

valid

Two-Stream Graph Convolutional Network for Intra-oral Scanner Image Segmentation

1 code implementation19 Apr 2022 Yue Zhao, Lingming Zhang, Yang Liu, Deyu Meng, Zhiming Cui, Chenqiang Gao, Xinbo Gao, Chunfeng Lian, Dinggang Shen

The state-of-the-art deep learning-based methods often simply concatenate the raw geometric attributes (i. e., coordinates and normal vectors) of mesh cells to train a single-stream network for automatic intra-oral scanner image segmentation.

Graph Learning Image Segmentation +3

Coverage-Guided Tensor Compiler Fuzzing with Joint IR-Pass Mutation

1 code implementation21 Feb 2022 Jiawei Liu, Yuxiang Wei, Sen yang, Yinlin Deng, Lingming Zhang

Our results show that Tzer substantially outperforms existing fuzzing techniques on tensor compiler testing, with 75% higher coverage and 50% more valuable tests than the 2nd-best technique.

3D Dental model segmentation with graph attentional convolution network

no code implementations Pattern Recognition Letters 2021 Yue Zhao, Lingming Zhang, Chongshi Yang, Yingyun Tan, Yang Liu, Pengcheng Li, Tianhao Huang, Chenqiang Gao

We have evaluated our network on a real-patient dataset of dental models acquired through 3D intraoral scanners, and experimental results show that our method outperforms state-of-the-art deep learning methods for 3D shape segmentation.

Segmentation

TSGCNet: Discriminative Geometric Feature Learning With Two-Stream Graph Convolutional Network for 3D Dental Model Segmentation

no code implementations CVPR 2021 Lingming Zhang, Yue Zhao, Deyu Meng, Zhiming Cui, Chenqiang Gao, Xinbo Gao, Chunfeng Lian, Dinggang Shen

State-of-the-art methods directly concatenate the raw attributes of 3D inputs, namely coordinates and normal vectors of mesh cells, to train a single-stream network for fully-automated tooth segmentation.

Graph Learning

TSGCNet: Discriminative Geometric Feature Learning with Two-Stream GraphConvolutional Network for 3D Dental Model Segmentation

1 code implementation26 Dec 2020 Lingming Zhang, Yue Zhao, Deyu Meng, Zhiming Cui, Chenqiang Gao, Xinbo Gao, Chunfeng Lian, Dinggang Shen

State-of-the-art methods directly concatenate the raw attributes of 3D inputs, namely coordinates and normal vectors of mesh cells, to train a single-stream network for fully-automated tooth segmentation.

Graph Learning

DeepBillboard: Systematic Physical-World Testing of Autonomous Driving Systems

no code implementations27 Dec 2018 Husheng Zhou, Wei Li, Yuankun Zhu, Yuqun Zhang, Bei Yu, Lingming Zhang, Cong Liu

Furthermore, DeepBillboard is sufficiently robust and resilient for generating physical-world adversarial billboard tests for real-world driving under various weather conditions.

Autonomous Driving DNN Testing

DeepRoad: GAN-based Metamorphic Autonomous Driving System Testing

1 code implementation7 Feb 2018 Mengshi Zhang, Yuqun Zhang, Lingming Zhang, Cong Liu, Sarfraz Khurshid

In this paper, we propose DeepRoad, an unsupervised framework to automatically generate large amounts of accurate driving scenes to test the consistency of DNN-based autonomous driving systems across different scenes.

Software Engineering

Cannot find the paper you are looking for? You can Submit a new open access paper.