TripleAgent: Monitoring, Perturbation And Failure-obliviousness for Automated Resilience Improvement in Java Applications

27 Dec 2018  ·  Long Zhang, Martin Monperrus ·

In this paper, we present a novel system for fault injection in production for Java applications. The unique feature of this system is to combine automated monitoring, automated perturbation injection, and automated resilience improvement. The latter is achieved with ideas coming from the failure-oblivious literature. We design and implement the system as agents for the Java virtual machine. We evaluate the system on a real-world application for transferring files with the BitTorrent protocol. Our results shows that it is possible to automatically improve the resilience of Java applications with respect to uncaught exceptions.

PDF Abstract

Datasets


  Add Datasets introduced or used in this paper