Loading…

An Advanced Pyramid Network Technology for Optical Character Recognition

Optical Character Recognition (OCR) technology can quickly convert text and digital information in pictures into text information, which has been widely used in actual scenes. However, in some complex scenes, such as different light, angle or occlusion, the existing OCR systems still cannot reach th...

Full description

Saved in:

Bibliographic Details
Published in:	Journal of physics. Conference series 2019-08, Vol.1302 (2), p.22042
Main Authors:	Zhang, Haoruo, Cheng, Zhongquan
Format:	Article
Language:	English
Subjects:	Misalignment Occlusion Optical character recognition Parameters Physics
Citations:	Items that this one cites Items that cite this one
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

Description
Summary:	Optical Character Recognition (OCR) technology can quickly convert text and digital information in pictures into text information, which has been widely used in actual scenes. However, in some complex scenes, such as different light, angle or occlusion, the existing OCR systems still cannot reach the required accuracy. This paper proposes an advanced pyramid network structure, which uses multiple different scales and parallel pyramid network structures to deal with the problem of different character scales and misalignment. Each pyramid network structure has at least four convolutional layers. At the same time, in each pyramid network connection, the proportionally discarding parameters is introduced to increase the calculation speed further. The advantage of this network structure is very robust to more differently sized characters and increases the number of valid parameters obtained through training. Experiments on open source data sets show that the method has better recognition accuracy and speed.
ISSN:	1742-6588 1742-6596
DOI:	10.1088/1742-6596/1302/2/022042