Search Results for author: Pinjia He

Found 19 papers, 10 papers with code

Aligning LLMs for FL-free Program Repair

no code implementations • 13 Apr 2024 • Junjielong Xu, Ying Fu, Shin Hwei Tan, Pinjia He

Our core insight is that LLM's APR capability can be greatly improved by simply aligning the output to their training objective and allowing them to refine the whole program without first performing fault localization.

Fault localization

Paper
Add Code

A & B == B & A: Triggering Logical Reasoning Failures in Large Language Models

no code implementations • 1 Jan 2024 • Yuxuan Wan, Wenxuan Wang, Yiliu Yang, Youliang Yuan, Jen-tse Huang, Pinjia He, Wenxiang Jiao, Michael R. Lyu

In addition, the test cases of LogicAsker can be further used to design demonstration examples for in-context learning, which effectively improves the logical reasoning ability of LLMs, e. g., 10\% for GPT-4.

Code Generation In-Context Learning +2

Paper
Add Code

Retromorphic Testing: A New Approach to the Test Oracle Problem

1 code implementation • 10 Oct 2023 • Boxi Yu, Qiuyang Mang, Qingshuo Guo, Pinjia He

Inspired by the mathematical concept of inverse function, we present Retromorphic Testing, a novel black-box testing methodology.

Paper
Code

An Image is Worth a Thousand Toxic Words: A Metamorphic Testing Framework for Content Moderation Software

no code implementations • 18 Aug 2023 • Wenxuan Wang, Jingyuan Huang, Jen-tse Huang, Chang Chen, Jiazhen Gu, Pinjia He, Michael R. Lyu

Moreover, through retraining the models with the test cases generated by OASIS, the robustness of the moderation model can be improved without performance degradation.

Paper
Add Code

Automated Testing and Improvement of Named Entity Recognition Systems

no code implementations • 14 Aug 2023 • Boxi Yu, Yiyan Hu, Qiuyang Mang, Wenhan Hu, Pinjia He

For automated repairing, TIN achieves a high error reduction rate (26. 8%-50. 6%) over the four systems under test, which successfully repairs 1, 056 out of the 1, 877 reported NER errors.

named-entity-recognition Named Entity Recognition +3

Paper
Add Code

GPT-4 Is Too Smart To Be Safe: Stealthy Chat with LLMs via Cipher

1 code implementation • 12 Aug 2023 • Youliang Yuan, Wenxiang Jiao, Wenxuan Wang, Jen-tse Huang, Pinjia He, Shuming Shi, Zhaopeng Tu

We propose a novel framework CipherChat to systematically examine the generalizability of safety alignment to non-natural languages -- ciphers.

Ethics

516

Paper
Code

Validating Multimedia Content Moderation Software via Semantic Fusion

no code implementations • 23 May 2023 • Wenxuan Wang, Jingyuan Huang, Chang Chen, Jiazhen Gu, Jianping Zhang, Weibin Wu, Pinjia He, Michael Lyu

To this end, content moderation software has been widely deployed on these platforms to detect and blocks toxic content.

Sentence

Paper
Add Code

BiasAsker: Measuring the Bias in Conversational AI System

1 code implementation • 21 May 2023 • Yuxuan Wan, Wenxuan Wang, Pinjia He, Jiazhen Gu, Haonan Bai, Michael Lyu

Particularly, it is hard to generate inputs that can comprehensively trigger potential bias due to the lack of data containing both social groups as well as biased properties.

Bias Detection

Paper
Code

MTTM: Metamorphic Testing for Textual Content Moderation Software

1 code implementation • 11 Feb 2023 • Wenxuan Wang, Jen-tse Huang, Weibin Wu, Jianping Zhang, Yizhan Huang, Shuqing Li, Pinjia He, Michael Lyu

In addition, we leverage the test cases generated by MTTM to retrain the model we explored, which largely improves model robustness (0% to 5. 9% EFR) while maintaining the accuracy on the original test set.

Sentence

Paper
Code

AEON: A Method for Automatic Evaluation of NLP Test Cases

1 code implementation • 13 May 2022 • Jen-tse Huang, Jianping Zhang, Wenxuan Wang, Pinjia He, Yuxin Su, Michael R. Lyu

However, in practice, many of the generated test cases fail to preserve similar semantic meaning and are unnatural (e. g., grammar errors), which leads to a high false alarm rate and unnatural test cases.

Paper
Code

Empirical Standards for Software Engineering Research

1 code implementation • 7 Oct 2020 • Paul Ralph, Nauman bin Ali, Sebastian Baltes, Domenico Bianculli, Jessica Diaz, Yvonne Dittrich, Neil Ernst, Michael Felderer, Robert Feldt, Antonio Filieri, Breno Bernard Nicolau de França, Carlo Alberto Furia, Greg Gay, Nicolas Gold, Daniel Graziotin, Pinjia He, Rashina Hoda, Natalia Juristo, Barbara Kitchenham, Valentina Lenarduzzi, Jorge Martínez, Jorge Melegati, Daniel Mendez, Tim Menzies, Jefferson Molleri, Dietmar Pfahl, Romain Robbes, Daniel Russo, Nyyti Saarimäki, Federica Sarro, Janet Siegmund, Diomidis Spinellis, Miroslaw Staron, Klaas Stol, Margaret-Anne Storey, Davide Taibi, Damian Tamburri, Marco Torchiano, Christoph Treude, Burak Turhan, XiaoFeng Wang, Sira Vegas

Empirical Standards are natural-language models of a scientific community's expectations for a specific kind of study (e. g. a questionnaire survey).

Software Engineering General Literature

277

Paper
Code

Loghub: A Large Collection of System Log Datasets for AI-driven Log Analytics

8 code implementations • 14 Aug 2020 • Shilin He, Jieming Zhu, Pinjia He, Michael R. Lyu

To fill this significant gap and facilitate more research on AI-driven log analytics, we have collected and released loghub, a large collection of system log datasets.

Software Engineering

1,518

Paper
Code

Testing Machine Translation via Referential Transparency

no code implementations • 22 Apr 2020 • Pinjia He, Clara Meister, Zhendong Su

Machine translation software has seen rapid progress in recent years due to the advancement of deep neural networks.

Machine Translation Medical Diagnosis +1

Paper
Add Code

Logzip: Extracting Hidden Structures via Iterative Clustering for Log Compression

1 code implementation • 24 Sep 2019 • Jinyang Liu, Jieming Zhu, Shilin He, Pinjia He, Zibin Zheng, Michael R. Lyu

Data compression is essential to reduce the cost of log storage.

Databases Software Engineering

Paper
Code

Structure-Invariant Testing for Machine Translation

2 code implementations • 19 Jul 2019 • Pinjia He, Clara Meister, Zhendong Su

Despite its apparent importance, validating the robustness of machine translation systems is very difficult and has, therefore, been much under-explored.

Dependency Parsing Machine Translation +3

Paper
Code

Tools and Benchmarks for Automated Log Parsing

7 code implementations • 8 Nov 2018 • Jieming Zhu, Shilin He, Jinyang Liu, Pinjia He, Qi Xie, Zibin Zheng, Michael R. Lyu

Logs are imperative in the development and maintenance process of many software systems.

Software Engineering

1,518

Paper
Code

Testing Untestable Neural Machine Translation: An Industrial Case

no code implementations • 6 Jul 2018 • Wujie Zheng, Wenyu Wang, Dian Liu, Changrong Zhang, Qinsong Zeng, Yuetang Deng, Wei Yang, Pinjia He, Tao Xie

To fill the gap of lacking test oracle for in-vivo testing of an NMT system, in this paper, we propose a new approach for automatically identifying translation failures, without requiring reference translations for a translation task; our approach can directly serve as a test oracle for in-vivo testing.

Machine Translation NMT +2

Paper
Add Code

A Directed Acyclic Graph Approach to Online Log Parsing

no code implementations • 12 Jun 2018 • Pinjia He, Jieming Zhu, Pengcheng Xu, Zibin Zheng, Michael R. Lyu

A typical log-based system reliability management procedure is to first parse log messages because of their unstructured format; and apply data mining techniques on the parsed logs to obtain critical system behavior information.

Software Engineering

Paper
Add Code

Semantically Consistent Image Completion with Fine-grained Details

no code implementations • 26 Nov 2017 • Pengpeng Liu, Xiaojuan Qi, Pinjia He, Yikang Li, Michael R. Lyu, Irwin King

Image completion has achieved significant progress due to advances in generative adversarial networks (GANs).

Image Inpainting

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.