野外文本检测和认可：评论

论文标题

野外文本检测和认可：评论

Text Detection and Recognition in the Wild: A Review

论文作者

Raisi, Zobeir, Naiel, Mohamed A., Fieguth, Paul, Wardell, Steven, Zelek, John

论文摘要

自然图像中文本的检测和识别是计算机视觉领域的两个主要问题，这些问题在分析体育视频，自动驾驶，工业自动化等方面都有多种应用。他们面临着普遍的挑战性问题，这是文本如何表现和受到多种环境条件影响的因素。当前的最新场景文本检测和/或识别方法已利用了深度学习体系结构中的见证进步，并在处理多分辨率和多个面向多的文本时报告了基准数据集的卓越精度。但是，由于模型无法概括地看不见的数据和标记的数据不足，因此仍有几个影响野外图像中文本的挑战，导致现有方法表现不佳。因此，与该领域的以前的调查不同，该调查的目标如下：首先，不仅为场景文本检测和认可的最新进步提供了综述，而且还提供了使用统一的评估框架进行广泛实验的结果，从而对所选方法进行了对挑战性案例进行评估案例和应用这些评估的方法，并应用了这些技术，并应用了这些技术。其次，确定在野生图像中检测或识别文本的几个挑战，即平面旋转，多分辨率和多分辨率文本，透视扭曲，照明反射，部分遮挡，复杂字体和特殊字符。最后，本文还深入了解了该领域的潜在研究方向，以解决一些仍在遇到场景文本检测和识别技术的挑战。

Detection and recognition of text in natural images are two main problems in the field of computer vision that have a wide variety of applications in analysis of sports videos, autonomous driving, industrial automation, to name a few. They face common challenging problems that are factors in how text is represented and affected by several environmental conditions. The current state-of-the-art scene text detection and/or recognition methods have exploited the witnessed advancement in deep learning architectures and reported a superior accuracy on benchmark datasets when tackling multi-resolution and multi-oriented text. However, there are still several remaining challenges affecting text in the wild images that cause existing methods to underperform due to there models are not able to generalize to unseen data and the insufficient labeled data. Thus, unlike previous surveys in this field, the objectives of this survey are as follows: first, offering the reader not only a review on the recent advancement in scene text detection and recognition, but also presenting the results of conducting extensive experiments using a unified evaluation framework that assesses pre-trained models of the selected methods on challenging cases, and applies the same evaluation criteria on these techniques. Second, identifying several existing challenges for detecting or recognizing text in the wild images, namely, in-plane-rotation, multi-oriented and multi-resolution text, perspective distortion, illumination reflection, partial occlusion, complex fonts, and special characters. Finally, the paper also presents insight into the potential research directions in this field to address some of the mentioned challenges that are still encountering scene text detection and recognition techniques.

下载PDF全文

下载文献需遵守相关版权规定

论文标题