论文标题
可解释的少数图像分类的区域比较网络
Region Comparison Network for Interpretable Few-shot Image Classification
论文作者
论文摘要
尽管深度学习已成功地应用于许多现实世界的计算机视觉任务,但培训强大的分类器通常需要大量标记的数据。但是,注释通常昂贵且耗时。因此,很少有人提议有效使用有限数量的标记示例来培训新类型的模型。基于可转移的公制学习方法的最新作品通过学习查询和支持集的样品特征之间的相似性,实现了有希望的分类性能。但是,很少有明确认为模型的解释性,这实际上可以在训练阶段揭示。 为此,在这项工作中,我们提出了一个基于公制的方法名称区域比较网络(RCN),该方法能够揭示几乎没有射击的学习在神经网络中的工作方式,以及在来自查询和支持集的图像中找到与彼此相关的特定区域。此外,我们还提出了一个名为“区域激活映射”(RAM)的可视化策略,以直观地解释我们的方法通过可视化网络中的中间变量所学到了什么。我们还提出了一种将可解释性从任务级别概括为类别的新方法,该方法也可以视为找到用于支持RCN最终决定的原型零件的方法。四个基准数据集的广泛实验清楚地表明了我们方法对现有基准的有效性。
While deep learning has been successfully applied to many real-world computer vision tasks, training robust classifiers usually requires a large amount of well-labeled data. However, the annotation is often expensive and time-consuming. Few-shot image classification has thus been proposed to effectively use only a limited number of labeled examples to train models for new classes. Recent works based on transferable metric learning methods have achieved promising classification performance through learning the similarity between the features of samples from the query and support sets. However, rare of them explicitly considers the model interpretability, which can actually be revealed during the training phase. For that, in this work, we propose a metric learning based method named Region Comparison Network (RCN), which is able to reveal how few-shot learning works as in a neural network as well as to find out specific regions that are related to each other in images coming from the query and support sets. Moreover, we also present a visualization strategy named Region Activation Mapping (RAM) to intuitively explain what our method has learned by visualizing intermediate variables in our network. We also present a new way to generalize the interpretability from the level of tasks to categories, which can also be viewed as a method to find the prototypical parts for supporting the final decision of our RCN. Extensive experiments on four benchmark datasets clearly show the effectiveness of our method over existing baselines.