Method for extracting information from error OCR result
An information extraction, error-prone technology, applied in the direction of instruments, character and pattern recognition, computer components, etc., can solve the problems of difficult data acquisition, error-prone writers, and inability to exhaustively, so as to improve the matching effect and improve the final result. Effect, effect of reducing typo penalty
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0075] This embodiment is based on the above-mentioned method for extracting information from OCR results with errors, and takes extracting gender and ethnicity from an ID card as an example to provide a technical solution for specific implementation.
[0076] Such as figure 1 As shown, the main steps of the method provided by the present invention include: obtaining an image OCR recognition result; post-processing the OCR result; obtaining several lines of text strings after OCR post-processing; inputting a well-written sequence character template; inputting a pre-generated Table of near-words; perform sequence alignment and matching on each line of text strings; select the line with the highest matching score to extract information. The specific process is as follows:
[0077] 1. Recognize the text in the target image through OCR technology and obtain the OCR result.
[0078] 2. Post-process the OCR recognition results and merge the text of each line. The specific method i...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com