论文标题
ACFD:非对称卡通脸检测器
ACFD: Asymmetric Cartoon Face Detector
论文作者
论文摘要
由于涉及许多困难的场景,卡通面部检测比人脸检测更具挑战性的任务。针对卡通面的特征,例如在脸部内部的巨大差异,在本文中,我们提出了一个名为ACFD的不对称卡通脸检测器。具体而言,它由以下模块组成:一种新型的骨干VOVNETV3,由几个不对称的单发聚合模块(AOSA),不对称的双向特征pyramid网络(ABI-FPN),动态锚固策略(DAM)和相应的二元分类损失(MBC)。特别是,为了生成具有不同接收场的特征,VovNETV3提取了多尺度的金字塔特征,然后通过ABI-FPN同时融合和增强,以以某些极端的姿势处理面部,并且具有不同的长宽比。此外,大坝可用于每张脸部足够的高质量锚点,而MBC则是强大的歧视力量。凭借这些模块的有效性,我们的ACFD在模型尺寸200MB的限制下,每图像50ms的推理时间50ms以及没有任何预告片的模型,在2020 Icartoon Face挑战的检测轨道上获得了第一名。
Cartoon face detection is a more challenging task than human face detection due to many difficult scenarios is involved. Aiming at the characteristics of cartoon faces, such as huge differences within the intra-faces, in this paper, we propose an asymmetric cartoon face detector, named ACFD. Specifically, it consists of the following modules: a novel backbone VoVNetV3 comprised of several asymmetric one-shot aggregation modules (AOSA), asymmetric bi-directional feature pyramid network (ABi-FPN), dynamic anchor match strategy (DAM) and the corresponding margin binary classification loss (MBC). In particular, to generate features with diverse receptive fields, multi-scale pyramid features are extracted by VoVNetV3, and then fused and enhanced simultaneously by ABi-FPN for handling the faces in some extreme poses and have disparate aspect ratios. Besides, DAM is used to match enough high-quality anchors for each face, and MBC is for the strong power of discrimination. With the effectiveness of these modules, our ACFD achieves the 1st place on the detection track of 2020 iCartoon Face Challenge under the constraints of model size 200MB, inference time 50ms per image, and without any pretrained models.