2D自组织的手写文本识别模型

论文标题

2D自组织的手写文本识别模型

2D Self-Organized ONN Model For Handwritten Text Recognition

论文作者

Mohammed, Hanadi Hassen, Malik, Junaid, Al-Madeed, Somaya, Kiranyaz, Serkan

论文摘要

深度卷积神经网络（CNN）最近达到了最先进的手写文本识别（HTR）性能。但是，最近的研究表明，典型的CNN的学习性能是有限的，因为它们是具有简单（线性）神经元模型的同质网络。由于它们的异质网络结构融合了非线性神经元，最近提出了操作神经网络（ONNS）来解决这一缺点。自我结合是具有生成神经元模型的ONN的自组织变化，可以使用泰勒近似产生任何非线性函数。在这项研究中，为了提高HTR的最新性能水平，提出了新型网络模型核心的2D自组织ONNS（自我强调）。此外，本研究中使用了可变形的卷积，最近被证明可以更好地解决写作风格的变化。 IAM英语数据集和Hadara80p阿拉伯数据集中的结果表明，具有自我影响的操作层的拟议模型显着提高了字符错误率（CER）和单词错误率（WER）。与同行CNN相比，Hadara80p中的自我强调将CER和3.4％的CER降低，而IAM数据集中的CER和0.199％和1.244％。基准IAM上的结果表明，与自我相处的操作层的拟议模型通过显着的边缘优于最近的深CNN模型，而使用具有可变形卷积的自我强调则表明了出色的结果。

Deep Convolutional Neural Networks (CNNs) have recently reached state-of-the-art Handwritten Text Recognition (HTR) performance. However, recent research has shown that typical CNNs' learning performance is limited since they are homogeneous networks with a simple (linear) neuron model. With their heterogeneous network structure incorporating non-linear neurons, Operational Neural Networks (ONNs) have recently been proposed to address this drawback. Self-ONNs are self-organized variations of ONNs with the generative neuron model that can generate any non-linear function using the Taylor approximation. In this study, in order to improve the state-of-the-art performance level in HTR, the 2D Self-organized ONNs (Self-ONNs) in the core of a novel network model are proposed. Moreover, deformable convolutions, which have recently been demonstrated to tackle variations in the writing styles better, are utilized in this study. The results over the IAM English dataset and HADARA80P Arabic dataset show that the proposed model with the operational layers of Self-ONNs significantly improves Character Error Rate (CER) and Word Error Rate (WER). Compared with its counterpart CNNs, Self-ONNs reduce CER and WER by 1.2% and 3.4 % in the HADARA80P and 0.199% and 1.244% in the IAM dataset. The results over the benchmark IAM demonstrate that the proposed model with the operational layers of Self-ONNs outperforms recent deep CNN models by a significant margin while the use of Self-ONNs with deformable convolutions demonstrates exceptional results.

下载PDF全文

下载文献需遵守相关版权规定

论文标题