Form recognition method, recognition system and computer device
A recognition method and table technology, applied in computer parts, calculation, character and pattern recognition, etc., can solve a lot of human, material, financial and time, complex structure, time-consuming and other problems
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0086] Such as figure 1 As shown, in the form recognition method provided by Embodiment 1 of the present invention, firstly, the format of the input document is discriminated, and if it is a PDF file, the PDF file is converted into a picture in JPG format by the format conversion module, and saved. Then use the nonlinear contrast enhancement based on weighted RC threshold iteration and the LoG operator binarization method to convert the RGB image into a binary image and save it. Then use the tilt correction algorithm based on perspective changes to correct the tilt of the image according to the selected four perspective corner points. At the same time, the frame line of the table is extracted by using the method of image morphology processing, and each cell is segmented. Finally, combined with the characteristics of the form's application field, a proprietary character database is established, and a customized neural network is trained to recognize characters.
[0087] In pr...
Embodiment 2
[0183] Such as Figure 11 As shown, Embodiment 2 of the present invention provides a method for character recognition by training a dedicated neural network. Specifically,
[0184] First, count the high-frequency characters and character strings contained in the form to be recognized in the proprietary domain, and collect character pictures as a sample set for the neural network. Then, binarize the picture, segment each character at the same time, and standardize it to unify the format and size of the picture. Next, feature extraction is performed on the preprocessed image, and character structure point features, character projection features, etc. are extracted. Finally, according to ten-fold cross-training, train the network, and use the tuned network to recognize characters, calculate the edit distance between it and each string in the string database according to the recognition results, and compare the size relationship between the minimum edit distance and the credibil...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com