OCR identification method based on model fusion
A recognition method and model fusion technology, applied in character and pattern recognition, neural learning methods, biological neural network models, etc., can solve the problems of low recognition accuracy and positioning deviation, and achieve complementary advantages, improve accuracy, and improve Effects of working with sequence data
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0040] This example provides an OCR recognition method based on model fusion. This method uses several recognition models to build a deep learning neural network model. The recognition models train the character positioning data respectively, and obtain the pre-output recognition results after the model converges. The above pre-output recognition results are fused and optimized to output the final recognition results.
[0041] The method flow is as follows figure 1 As shown, it specifically includes the following steps:
[0042] (1) Obtain the picture data to be recognized, and preprocess the picture data.
[0043] (1.1) Preprocess the picture: correct the tilt, grayscale, sharpen, adjust the contrast and brightness of the picture, and scale the picture to a uniform size.
[0044] (1.2) Normalize the picture. m=M / 255. Among them, M is the original data, and m is the result after normalization. After normalization, the image pixel value range is [0, 1], which is more conduc...
Embodiment 2
[0062] This example uses the OCR recognition method based on model fusion in Embodiment 1 to perform recognition processing on a picture to be recognized.
[0063] The complete process steps are as follows:
[0064] (1) Input a picture to be recognized and preprocess it, including correcting the skewed and distorted picture, and removing noise from the blurred picture to make it clearer.
[0065] (2) Use the pixel-link neural network to locate the area containing characters in the picture, take a screenshot and save it.
[0066] (3) Perform random data enhancement on the character picture to be recognized, and randomly enhance and expand from one original picture to be recognized to 30 pictures. Specific enhancement methods include: random up and down translation, left and right translation, random rotation at a certain angle, random addition of salt and pepper noise, random adjustment of luminosity, brightness, contrast, etc.
[0067] (4) Randomly assign 10 pictures to each...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com