论文标题
建模通用专家方法来培训高质量的弹性快照集合
Modeling Generalized Specialist Approach To Train Quality Resilient Snapshot Ensemble
论文作者
论文摘要
卷积神经网络(CNN)非常适合食物图像识别,因为能够学习歧视性视觉特征。然而,识别扭曲的图像对于现有的CNN具有挑战性。因此,该研究为培训质量弹性合奏的广义专家方法建模。该方法有助于整体框架中的模型保留了识别干净图像和浅色技能的一般技能,这些技能是将嘈杂图像与特定失真的一个深厚专业知识区域进行分类。随后,新型的数据增强随机质量混合(RQMIXUP)与快照结合起来训练G专家。在G专家的每个训练周期中,在RQMixup生成的合成图像上进行了微调,将特定变形的清洁和变形图像在随机选择的水平上进行了混合。结果,合奏中的每个快照都获得了几个失真级别的专业知识,并具有其他质量扭曲的较浅技能。接下来,将来自不同专家的过滤器输出融合为更高的精度。学习过程没有额外的成本,因为单个培训过程可以培训专家,该过程与广泛的监督CNN兼容用于转移学习的CNN。最后,对三种现实世界食品和马来西亚食品数据库的实验分析显示,在原始食品图像上具有竞争性分类性能的扭曲图像有了显着改善。
Convolutional neural networks (CNNs) apply well with food image recognition due to the ability to learn discriminative visual features. Nevertheless, recognizing distorted images is challenging for existing CNNs. Hence, the study modelled a generalized specialist approach to train a quality resilient ensemble. The approach aids the models in the ensemble framework retain general skills of recognizing clean images and shallow skills of classifying noisy images with one deep expertise area on a particular distortion. Subsequently, a novel data augmentation random quality mixup (RQMixUp) is combined with snapshot ensembling to train G-Specialist. During each training cycle of G-Specialist, a model is fine-tuned on the synthetic images generated by RQMixup, intermixing clean and distorted images of a particular distortion at a randomly chosen level. Resultantly, each snapshot in the ensemble gained expertise on several distortion levels, with shallow skills on other quality distortions. Next, the filter outputs from diverse experts were fused for higher accuracy. The learning process has no additional cost due to a single training process to train experts, compatible with a wide range of supervised CNNs for transfer learning. Finally, the experimental analysis on three real-world food and a Malaysian food database showed significant improvement for distorted images with competitive classification performance on pristine food images.