Method for extracting and organizing unstructured sheet document data under big data environment
A structured data, unstructured technology, applied in electrical digital data processing, special data processing applications, instruments, etc., can solve problems such as inability to extract structured table documents, lack of flexibility, etc.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0096] combine figure 2 An actual unstructured tabular document is given to illustrate the specific implementation of the method for extracting and organizing unstructured tabular document data in a big data environment proposed by the present invention. The steps are as follows: (1) define the basic characteristics of the tabular document and extraction rules;
[0097] (1.1) Define the structural features of the table document;
[0098] (1.1.1) If figure 2 As shown, according to the rule that a title area of a single value area corresponds to a data area, figure 2 (a) is a single-value area; according to the rule that one title area of a multi-value area corresponds to one or more data areas, figure 2 (b) is a multi-valued area;
[0099] (1.1.2) If figure 2 as shown, figure 2 (a) "Name" is the title area, and "Chen" is the data area; figure 2 (b) "Start and end time" is the title area, and "2009.12.14-12.16" is the data area;
[0100] (1.2) Define the data ...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com