A method and a terminal for creating paper document structured data based on a deep learning model
A technology for structured data and paper documents, applied in biological neural network models, neural architectures, character and pattern recognition, etc., can solve problems such as bill displacement, low accuracy, and beyond the setting range, to improve efficiency and accuracy , improve accuracy and save resources
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0065] Such as image 3 As shown, the present invention provides a method for creating paper document structured data based on a deep learning model, including:
[0066] S1, a preset document training sample set; each sample in the training sample set includes a paper document OCR recognition result and an annotated document corresponding to the paper document OCR recognition result; the annotated document records the document OCR recognition Location and category information for each key field in the results.
[0067] Paper documents include but are not limited to text documents and bill documents; for example, 1000 bill pictures are collected and processed as samples, part of the samples are used as training samples, and part of them are used as test samples. Each ticket includes a certain number of fields, including key fields of interest. Each sample includes the OCR recognition results of paper documents, and a document with key fields marked. The annotation document r...
Embodiment 2
[0101] Such as Figure 6 As shown, the present invention also provides a terminal for creating paper document structured data based on a deep learning model, including one or more processors 1 and a memory 2, the memory 2 is stored with a program, and is configured to be used by all The one or more processors 1 perform the following steps:
[0102] S1. A preset document training sample set; each sample in the document training sample set includes a paper document OCR recognition result and an annotated document corresponding to the paper document OCR recognition result; the annotated document records the paper document The position information and category information of each key field in the document OCR recognition result.
[0103] For example, collect 1000 bill pictures, and use them as samples after processing, part of the samples are used as training samples, and part of them are used as test samples. Each ticket includes a certain number of fields, including key fields...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com