Certificate image text recognition method and system based on deep learning
A deep learning and text recognition technology, applied in the field of document image recognition, can solve problems such as irregular text distribution, large text background noise, and unsatisfactory detection rate of OCR technology.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0058] A kind of document image text recognition method based on deep learning of the present invention, comprises the following steps:
[0059] S100. Perform preprocessing on the document image to remove noise, and obtain a preprocessed image;
[0060] S200. Perform text detection on the preprocessed image based on the CTPN algorithm to obtain the text area of the document image;
[0061] S300. The relative position of the font in the document image is fixed, and an image position template is made based on the above principles, and the text area of the document image is screened through the image position template to obtain a target text area of the document image;
[0062] S400. Reconstructing the VGG16 model based on the category of Chinese characters to obtain a text recognition model, using the target text area of the document image as input, and using the TensorFlow Slim algorithm to train the text recognition model to obtain a trained text recognition model;
...
Embodiment 2
[0080] The document image text recognition system based on deep learning of the present invention includes a preprocessing module, a text detection module, a text area module, a model training module and a testing module. The processed image; the text detection module is used to perform text detection on the preprocessed image based on the CTPN algorithm, and output the text area of the document image; the text area module is used to make an image position template based on the principle that the relative position of the font in the document image is fixed, and Filter the text area of the document image through the image position template, and output the target text area of the document image; the model training module is used to reconstruct the VGG16 model based on the category of Chinese characters to obtain a text recognition model, and take the target text area of the document image as input, through The TensorFlow Slim algorithm trains the text recognition model an...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com