通过更高级的梯度混淆技术来缓解先进的对抗攻击

论文标题

通过更高级的梯度混淆技术来缓解先进的对抗攻击

Mitigating Advanced Adversarial Attacks with More Advanced Gradient Obfuscation Techniques

论文作者

Qiu, Han, Zeng, Yi, Zheng, Qinkai, Zhang, Tianwei, Qiu, Meikang, Memmi, Gerard

论文摘要

深度神经网络（DNN）众所周知，很容易受到对抗性例子的影响（AES）。已经花费了大量的努力来发射和加热攻击者和防守者之间的军备竞赛。最近，提出了基于先进的梯度攻击技术（例如BPDA和EOT），这些技术已经击败了大量现有的防御方法。直到今天，仍然没有令人满意的解决方案可以有效，有效地防御这些攻击。在本文中，我们迈出了稳定的一步，以减轻基于两个主要贡献的基于高级梯度的攻击。首先，我们对这些攻击的根本原因进行了深入的分析，并提出了四个可以打破这些攻击基本假设的属性。其次，我们确定一组可以符合这些属性的操作。通过集成这些操作，我们设计了两个可以使这些强大攻击无效的预处理功能。广泛的评估表明，我们的解决方案可以有效地减轻所有现有的标准和高级攻击技术，并击败过去两年中顶级会议上发表的11种最先进的国防解决方案。防守者可以利用我们的解决方案来限制攻击成功率在7％以下的最强攻击率以下，即使对手已经花费了数十个GPU小时。

Deep Neural Networks (DNNs) are well-known to be vulnerable to Adversarial Examples (AEs). A large amount of efforts have been spent to launch and heat the arms race between the attackers and defenders. Recently, advanced gradient-based attack techniques were proposed (e.g., BPDA and EOT), which have defeated a considerable number of existing defense methods. Up to today, there are still no satisfactory solutions that can effectively and efficiently defend against those attacks. In this paper, we make a steady step towards mitigating those advanced gradient-based attacks with two major contributions. First, we perform an in-depth analysis about the root causes of those attacks, and propose four properties that can break the fundamental assumptions of those attacks. Second, we identify a set of operations that can meet those properties. By integrating these operations, we design two preprocessing functions that can invalidate these powerful attacks. Extensive evaluations indicate that our solutions can effectively mitigate all existing standard and advanced attack techniques, and beat 11 state-of-the-art defense solutions published in top-tier conferences over the past 2 years. The defender can employ our solutions to constrain the attack success rate below 7% for the strongest attacks even the adversary has spent dozens of GPU hours.

下载PDF全文

下载文献需遵守相关版权规定

论文标题