论文标题

是否需要神经元的覆盖范围使人发现更强大?

Is Neuron Coverage Needed to Make Person Detection More Robust?

论文作者

Pavlitskaya, Svetlana, Yıkmış, Şiyar, Zöllner, J. Marius

论文摘要

深度神经网络(DNN)在安全和关键领域(如自主驾驶)的日益增长的使用增加了对其系统测试的需求。覆盖范围引导的测试(CGT)是一种根据预定义的覆盖率指标应用突变或模糊的方法,以查找导致行为不当的输入。随着神经元覆盖率的引入,最近也将CGT应用于DNN。在这项工作中,我们将CGT应用于拥挤的场景中人发现的任务。拟议的管道使用Yolov3进行人检测,包括通过采样和突变查找DNN错误,以及随后在更新的培训集中进行DNN再培训。要成为一个错误,我们需要一个突变的图像才能与干净的输入相比会导致大量性能下降。根据CGT,我们还考虑了增加错误定义中覆盖范围的附加要求。为了探索几种类型的鲁棒性,我们的方法包括Daedalus攻击产生的自然图像转换,腐败和对抗性示例。所提出的框架发现了数千例不正确的DNN行为。对于不同的鲁棒性类型而言,培训模型的地图性能的相对变化平均达到26.21 \%和64.24 \%。但是,我们没有发现证据表明可以有利地使用研究的覆盖范围来改善鲁棒性。

The growing use of deep neural networks (DNNs) in safety- and security-critical areas like autonomous driving raises the need for their systematic testing. Coverage-guided testing (CGT) is an approach that applies mutation or fuzzing according to a predefined coverage metric to find inputs that cause misbehavior. With the introduction of a neuron coverage metric, CGT has also recently been applied to DNNs. In this work, we apply CGT to the task of person detection in crowded scenes. The proposed pipeline uses YOLOv3 for person detection and includes finding DNN bugs via sampling and mutation, and subsequent DNN retraining on the updated training set. To be a bug, we require a mutated image to cause a significant performance drop compared to a clean input. In accordance with the CGT, we also consider an additional requirement of increased coverage in the bug definition. In order to explore several types of robustness, our approach includes natural image transformations, corruptions, and adversarial examples generated with the Daedalus attack. The proposed framework has uncovered several thousand cases of incorrect DNN behavior. The relative change in mAP performance of the retrained models reached on average between 26.21\% and 64.24\% for different robustness types. However, we have found no evidence that the investigated coverage metrics can be advantageously used to improve robustness.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源