在面部表情识别中打击不确定性和阶级失衡

论文标题

在面部表情识别中打击不确定性和阶级失衡

Combating Uncertainty and Class Imbalance in Facial Expression Recognition

论文作者

Fan, Jiaxiang, Zhou, Jian, Deng, Xiaoyu, Wang, Huabin, Tao, Liang, Kwan, Hon Keung

论文摘要

在计算机视觉方面，识别面部表情是一个挑战。主要原因是由于数据收集和由于固有的噪声（例如模糊的面部表情和标签不一致）而引起的不确定性引起的类不平衡。但是，当前的研究集中在阶级失衡问题或不确定性问题上，而忽略了如何解决这两个问题的交集。因此，在本文中，我们提出了一个基于重新设定和注意的框架，以解决上述问题。我们为每个班级设计重量。通过惩罚机制，我们的模型将更多地关注训练过程中小样本的学习，并且可以通过卷积块注意模块（CBAM）提高模型准确性的降低。同时，我们的骨干网络还将为每个样本学习一个不确定的功能。通过将样品之间的不确定特征混合，该模型可以更好地学习可以用于分类的特征，从而抑制不确定性。实验表明，我们的方法在面部表达数据集的准确性方面超过了大多数基本方法（例如，AffectNet，RAF-DB），并且还可以很好地解决类不平衡的问题。

Recognition of facial expression is a challenge when it comes to computer vision. The primary reasons are class imbalance due to data collection and uncertainty due to inherent noise such as fuzzy facial expressions and inconsistent labels. However, current research has focused either on the problem of class imbalance or on the problem of uncertainty, ignoring the intersection of how to address these two problems. Therefore, in this paper, we propose a framework based on Resnet and Attention to solve the above problems. We design weight for each class. Through the penalty mechanism, our model will pay more attention to the learning of small samples during training, and the resulting decrease in model accuracy can be improved by a Convolutional Block Attention Module (CBAM). Meanwhile, our backbone network will also learn an uncertain feature for each sample. By mixing uncertain features between samples, the model can better learn those features that can be used for classification, thus suppressing uncertainty. Experiments show that our method surpasses most basic methods in terms of accuracy on facial expression data sets (e.g., AffectNet, RAF-DB), and it also solves the problem of class imbalance well.

下载PDF全文

下载文献需遵守相关版权规定

论文标题