论文标题
通过系统调用,最大化错误注入现实主义的混乱工程
Maximizing Error Injection Realism for Chaos Engineering with System Calls
论文作者
论文摘要
在本文中,我们提出了一个新颖的故障注入框架,用于系统调用调用错误,称为菲比。菲比(Phoebe)是独一无二的,如下所示。首先,Phoebe使开发人员能够完全可观察到系统呼叫调用。其次,菲比(Phoebe)生成了错误模型,这些模型从模仿生产中自然发生的错误的意义上是现实的。第三,菲比能够自动进行实验,以系统地评估有关生产中系统调用调用错误的应用程序的可靠性。我们评估了菲比在生产环境中两个现实世界应用上的有效性和运行时间开销。结果表明,Phoebe成功生成了现实的错误模型,并且能够检测到有关系统调用调用错误的重要可靠性弱点。据我们所知,这个新颖的“现实误差注射”概念是从未研究过的,该概念包括在生产错误上进行故障注入。
In this paper, we present a novel fault injection framework for system call invocation errors, called Phoebe. Phoebe is unique as follows. First, Phoebe enables developers to have full observability of system call invocations. Second, Phoebe generates error models that are realistic in the sense that they mimic errors that naturally happen in production. Third, Phoebe is able to automatically conduct experiments to systematically assess the reliability of applications with respect to system call invocation errors in production. We evaluate the effectiveness and runtime overhead of Phoebe on two real-world applications in a production environment. The results show that Phoebe successfully generates realistic error models and is able to detect important reliability weaknesses with respect to system call invocation errors. To our knowledge, this novel concept of "realistic error injection", which consists of grounding fault injection on production errors, has never been studied before.