Unsupervised learning method for Chinese OCR post-processing
An unsupervised learning and unsupervised technology, applied in the direction of instruments, biological neural network models, character and pattern recognition, etc., can solve problems such as the influence of dossier information extraction, poor recognition results, and poor picture quality
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0055] In order to make the purpose, technical solution and advantages of the present invention clearer, the present invention will be described in detail below in conjunction with the accompanying drawings and specific embodiments.
[0056] The purpose of the present invention is to solve the problem of OCR post-processing of scanned legal files. It is an unsupervised learning method for Chinese OCR post-processing. Standing on the shoulders of predecessors, it proposes an OCR recognition model and an OCR error correction model. The recognition model combines the results of current classic models and mature OCR systems (Tesseract, Baidu OCR). The OCR error correction model, based on the results of the OCR recognition system, proposes an unsupervised multi-input OCR error correction method, which can avoid a large number of artificial marks. The entire model adopts the classic network model in the industry, and does not adopt a particularly complicated network hierarchy. The ...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com