论文标题
测试片状的原因,检测,影响和响应:多局部审查
Test Flakiness' Causes, Detection, Impact and Responses: A Multivocal Review
论文作者
论文摘要
片状测试(具有非确定性结果的测试)对软件测试构成了重大挑战。众所周知,它们会引起重大问题,例如降低测试和延迟软件发行的有效性和效率。近年来,人们对片状测试的兴趣增加了,研究着重于片状的不同方面,例如识别原因,检测方法和缓解策略。由于在整个行业中都感受到了片状测试的影响,因此测试片状也已成为从业人员(博客文章,技术杂志等)的关键讨论点。本文提出了一项多局部评论,该评论研究了研究和实践中如何解决片状测试作为一个主题。我们总共涵盖了651篇文章(560篇学术文章和91篇灰色文献/文章),并使用四个不同的维度来构建相关研究和知识的主体:原因,检测,影响和响应。对于每个维度,我们提供分类,并对现有的研究,讨论,方法和工具进行分类。因此,我们提供了有关测试片状,涵盖学术观点和工业实践的现有思维的全面和当前的快照,并确定了未来研究的局限性和机会。
Flaky tests (tests with non-deterministic outcomes) pose a major challenge for software testing. They are known to cause significant issues such as reducing the effectiveness and efficiency of testing and delaying software releases. In recent years, there has been an increased interest in flaky tests, with research focusing on different aspects of flakiness, such as identifying causes, detection methods and mitigation strategies. Test flakiness has also become a key discussion point for practitioners (in blog posts, technical magazines, etc.) as the impact of flaky tests is felt across the industry. This paper presents a multivocal review that investigates how flaky tests, as a topic, have been addressed in both research and practice. We cover a total of 651 articles (560 academic articles and 91 grey literature articles/posts), and structure the body of relevant research and knowledge using four different dimensions: causes, detection, impact and responses. For each of those dimensions we provide a categorisation, and classify existing research, discussions, methods and tools. With this, we provide a comprehensive and current snapshot of existing thinking on test flakiness, covering both academic views and industrial practices, and identify limitations and opportunities for future research.