论文标题
毗邻光栅幻觉:对神经网络模型的认知挑战
Abutting Grating Illusion: Cognitive Challenge to Neural Network Models
论文作者
论文摘要
与人类相比,即使是最先进的深度学习模型也缺乏基本能力。已经提出了多重比较范例来探索人类与深度学习之间的区别。尽管大多数比较都取决于受数学转变启发的腐败,但很少有人对人类认知现象具有基础。在这项研究中,我们提出了一种基于毗邻光栅幻觉的新型腐败方法,这是在人类和广泛的动物物种中广泛发现的视觉现象。腐败方法破坏了梯度定义的边界,并使用彼此毗邻的线光栅产生了虚幻轮廓的感知。我们应用了MNIST,高分辨率MNIST和Silhouette对象图像的方法。在腐败上测试了各种深度学习模型,包括从头开始训练的模型以及通过ImageNet或各种数据增强技术预测的109个模型。我们的结果表明,即使对于最先进的深度学习模型,将光栅腐败毗邻也是挑战性的,因为大多数模型都是随机猜测的。我们还发现,深度指示技术可以极大地提高稳健性,以防止毗邻光栅幻觉。早期层的可视化表明,更好的性能模型表现出更强的终端特性,这与神经科学发现一致。为了验证腐败方法,涉及24名人类受试者以对损坏数据集进行分类。
Even the state-of-the-art deep learning models lack fundamental abilities compared to humans. Multiple comparison paradigms have been proposed to explore the distinctions between humans and deep learning. While most comparisons rely on corruptions inspired by mathematical transformations, very few have bases on human cognitive phenomena. In this study, we propose a novel corruption method based on the abutting grating illusion, which is a visual phenomenon widely discovered in both human and a wide range of animal species. The corruption method destroys the gradient-defined boundaries and generates the perception of illusory contours using line gratings abutting each other. We applied the method on MNIST, high resolution MNIST, and silhouette object images. Various deep learning models are tested on the corruption, including models trained from scratch and 109 models pretrained with ImageNet or various data augmentation techniques. Our results show that abutting grating corruption is challenging even for state-of-the-art deep learning models because most models are randomly guessing. We also discovered that the DeepAugment technique can greatly improve robustness against abutting grating illusion. Visualisation of early layers indicates that better performing models exhibit stronger end-stopping property, which is consistent with neuroscience discoveries. To validate the corruption method, 24 human subjects are involved to classify samples of corrupted datasets.