Loading…
An Advanced Pyramid Network Technology for Optical Character Recognition
Optical Character Recognition (OCR) technology can quickly convert text and digital information in pictures into text information, which has been widely used in actual scenes. However, in some complex scenes, such as different light, angle or occlusion, the existing OCR systems still cannot reach th...
Saved in:
Published in: | Journal of physics. Conference series 2019-08, Vol.1302 (2), p.22042 |
---|---|
Main Authors: | , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | Optical Character Recognition (OCR) technology can quickly convert text and digital information in pictures into text information, which has been widely used in actual scenes. However, in some complex scenes, such as different light, angle or occlusion, the existing OCR systems still cannot reach the required accuracy. This paper proposes an advanced pyramid network structure, which uses multiple different scales and parallel pyramid network structures to deal with the problem of different character scales and misalignment. Each pyramid network structure has at least four convolutional layers. At the same time, in each pyramid network connection, the proportionally discarding parameters is introduced to increase the calculation speed further. The advantage of this network structure is very robust to more differently sized characters and increases the number of valid parameters obtained through training. Experiments on open source data sets show that the method has better recognition accuracy and speed. |
---|---|
ISSN: | 1742-6588 1742-6596 |
DOI: | 10.1088/1742-6596/1302/2/022042 |