Search Results for author: Pinjia He

Found 19 papers, 10 papers with code

Aligning LLMs for FL-free Program Repair

no code implementations13 Apr 2024 Junjielong Xu, Ying Fu, Shin Hwei Tan, Pinjia He

Our core insight is that LLM's APR capability can be greatly improved by simply aligning the output to their training objective and allowing them to refine the whole program without first performing fault localization.

Fault localization

A & B == B & A: Triggering Logical Reasoning Failures in Large Language Models

no code implementations1 Jan 2024 Yuxuan Wan, Wenxuan Wang, Yiliu Yang, Youliang Yuan, Jen-tse Huang, Pinjia He, Wenxiang Jiao, Michael R. Lyu

In addition, the test cases of LogicAsker can be further used to design demonstration examples for in-context learning, which effectively improves the logical reasoning ability of LLMs, e. g., 10\% for GPT-4.

Code Generation In-Context Learning +2

Retromorphic Testing: A New Approach to the Test Oracle Problem

1 code implementation10 Oct 2023 Boxi Yu, Qiuyang Mang, Qingshuo Guo, Pinjia He

Inspired by the mathematical concept of inverse function, we present Retromorphic Testing, a novel black-box testing methodology.

2k

An Image is Worth a Thousand Toxic Words: A Metamorphic Testing Framework for Content Moderation Software

no code implementations18 Aug 2023 Wenxuan Wang, Jingyuan Huang, Jen-tse Huang, Chang Chen, Jiazhen Gu, Pinjia He, Michael R. Lyu

Moreover, through retraining the models with the test cases generated by OASIS, the robustness of the moderation model can be improved without performance degradation.

Automated Testing and Improvement of Named Entity Recognition Systems

no code implementations14 Aug 2023 Boxi Yu, Yiyan Hu, Qiuyang Mang, Wenhan Hu, Pinjia He

For automated repairing, TIN achieves a high error reduction rate (26. 8%-50. 6%) over the four systems under test, which successfully repairs 1, 056 out of the 1, 877 reported NER errors.

named-entity-recognition Named Entity Recognition +3

GPT-4 Is Too Smart To Be Safe: Stealthy Chat with LLMs via Cipher

1 code implementation12 Aug 2023 Youliang Yuan, Wenxiang Jiao, Wenxuan Wang, Jen-tse Huang, Pinjia He, Shuming Shi, Zhaopeng Tu

We propose a novel framework CipherChat to systematically examine the generalizability of safety alignment to non-natural languages -- ciphers.

Ethics

Validating Multimedia Content Moderation Software via Semantic Fusion

no code implementations23 May 2023 Wenxuan Wang, Jingyuan Huang, Chang Chen, Jiazhen Gu, Jianping Zhang, Weibin Wu, Pinjia He, Michael Lyu

To this end, content moderation software has been widely deployed on these platforms to detect and blocks toxic content.

Sentence

BiasAsker: Measuring the Bias in Conversational AI System

1 code implementation21 May 2023 Yuxuan Wan, Wenxuan Wang, Pinjia He, Jiazhen Gu, Haonan Bai, Michael Lyu

Particularly, it is hard to generate inputs that can comprehensively trigger potential bias due to the lack of data containing both social groups as well as biased properties.

Bias Detection

MTTM: Metamorphic Testing for Textual Content Moderation Software

1 code implementation11 Feb 2023 Wenxuan Wang, Jen-tse Huang, Weibin Wu, Jianping Zhang, Yizhan Huang, Shuqing Li, Pinjia He, Michael Lyu

In addition, we leverage the test cases generated by MTTM to retrain the model we explored, which largely improves model robustness (0% to 5. 9% EFR) while maintaining the accuracy on the original test set.

Sentence

AEON: A Method for Automatic Evaluation of NLP Test Cases

1 code implementation13 May 2022 Jen-tse Huang, Jianping Zhang, Wenxuan Wang, Pinjia He, Yuxin Su, Michael R. Lyu

However, in practice, many of the generated test cases fail to preserve similar semantic meaning and are unnatural (e. g., grammar errors), which leads to a high false alarm rate and unnatural test cases.

Semantic Similarity Semantic Textual Similarity +1

Loghub: A Large Collection of System Log Datasets for AI-driven Log Analytics

8 code implementations14 Aug 2020 Shilin He, Jieming Zhu, Pinjia He, Michael R. Lyu

To fill this significant gap and facilitate more research on AI-driven log analytics, we have collected and released loghub, a large collection of system log datasets.

Software Engineering

Testing Machine Translation via Referential Transparency

no code implementations22 Apr 2020 Pinjia He, Clara Meister, Zhendong Su

Machine translation software has seen rapid progress in recent years due to the advancement of deep neural networks.

Machine Translation Medical Diagnosis +1

Logzip: Extracting Hidden Structures via Iterative Clustering for Log Compression

1 code implementation24 Sep 2019 Jinyang Liu, Jieming Zhu, Shilin He, Pinjia He, Zibin Zheng, Michael R. Lyu

Data compression is essential to reduce the cost of log storage.

Databases Software Engineering

Structure-Invariant Testing for Machine Translation

2 code implementations19 Jul 2019 Pinjia He, Clara Meister, Zhendong Su

Despite its apparent importance, validating the robustness of machine translation systems is very difficult and has, therefore, been much under-explored.

Dependency Parsing Machine Translation +3

Tools and Benchmarks for Automated Log Parsing

7 code implementations8 Nov 2018 Jieming Zhu, Shilin He, Jinyang Liu, Pinjia He, Qi Xie, Zibin Zheng, Michael R. Lyu

Logs are imperative in the development and maintenance process of many software systems.

Software Engineering

Testing Untestable Neural Machine Translation: An Industrial Case

no code implementations6 Jul 2018 Wujie Zheng, Wenyu Wang, Dian Liu, Changrong Zhang, Qinsong Zeng, Yuetang Deng, Wei Yang, Pinjia He, Tao Xie

To fill the gap of lacking test oracle for in-vivo testing of an NMT system, in this paper, we propose a new approach for automatically identifying translation failures, without requiring reference translations for a translation task; our approach can directly serve as a test oracle for in-vivo testing.

Machine Translation NMT +2

A Directed Acyclic Graph Approach to Online Log Parsing

no code implementations12 Jun 2018 Pinjia He, Jieming Zhu, Pengcheng Xu, Zibin Zheng, Michael R. Lyu

A typical log-based system reliability management procedure is to first parse log messages because of their unstructured format; and apply data mining techniques on the parsed logs to obtain critical system behavior information.

Software Engineering

Semantically Consistent Image Completion with Fine-grained Details

no code implementations26 Nov 2017 Pengpeng Liu, Xiaojuan Qi, Pinjia He, Yikang Li, Michael R. Lyu, Irwin King

Image completion has achieved significant progress due to advances in generative adversarial networks (GANs).

Image Inpainting

Cannot find the paper you are looking for? You can Submit a new open access paper.