论文标题
其中之一(几)不像其他东西
One of these (Few) Things is Not Like the Others
论文作者
论文摘要
为了表现良好,大多数基于深度学习的图像分类系统都需要大量数据和计算资源。这些限制使得很难在相当强大的机器之外快速个性化的个体用户或训练模型。为了解决这些问题,已经进行了大量的研究教学机器研究,以学习基于少数培训示例(一个被称为少数学习的领域)来对图像进行分类。传统上,很少有学习的学习研究简化地假设所有图像属于固定数量的先前看到的组之一。但是,许多图像数据集(例如手机上的摄像头滚动)会很嘈杂,并且包含可能与任何清晰组无关或不适合的图像。我们提出了一个模型,该模型既可以根据少量示例对新图像进行分类,又可以识别不属于任何先前看到的组的图像。我们适应了以前的几次学习工作,包括一种简单的机制,用于学习一个截断,该机制确定是否应排除或分类图像。我们检查了我们的方法在现实环境中的性能,在图像的嘈杂和模棱两可的数据集上基准了该方法。我们评估了模型架构范围的性能,包括足够小的设置,可以在低功率设备(例如手机或Web浏览器)上运行。我们发现,这项排除无关图像的任务超出了传统的少量射击任务,这带来了巨大的额外困难。我们分解了此错误的来源,并提出了未来的改进,以减轻这一困难。
To perform well, most deep learning based image classification systems require large amounts of data and computing resources. These constraints make it difficult to quickly personalize to individual users or train models outside of fairly powerful machines. To deal with these problems, there has been a large body of research into teaching machines to learn to classify images based on only a handful of training examples, a field known as few-shot learning. Few-shot learning research traditionally makes the simplifying assumption that all images belong to one of a fixed number of previously seen groups. However, many image datasets, such as a camera roll on a phone, will be noisy and contain images that may not be relevant or fit into any clear group. We propose a model which can both classify new images based on a small number of examples and recognize images which do not belong to any previously seen group. We adapt previous few-shot learning work to include a simple mechanism for learning a cutoff that determines whether an image should be excluded or classified. We examine how well our method performs in a realistic setting, benchmarking the approach on a noisy and ambiguous dataset of images. We evaluate performance over a spectrum of model architectures, including setups small enough to be run on low powered devices, such as mobile phones or web browsers. We find that this task of excluding irrelevant images poses significant extra difficulty beyond that of the traditional few-shot task. We decompose the sources of this error, and suggest future improvements that might alleviate this difficulty.