Document parsing method, system and device applied in big-data analysis technology
A technology of analysis processing and big data, applied in the field of big data analysis, can solve the problems of restricting the acquisition channels of data sources for document analysis, the inapplicability of document analysis solutions, reducing the compatibility and comprehensiveness of document analysis applications, etc., and achieving high accuracy , Improve the effect of application compatibility
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0030] Such as figure 1 As shown, the present embodiment provides a document parsing and processing method applied in big data analysis, the method includes the following steps:
[0031] Regular expression rules for constructing financial indicators;
[0032] Obtain the start characteristic index and end characteristic index of the financial statement;
[0033] Use regular expression rules of financial indicators, start feature indicators, and end feature indicators to locate documents in different formats for financial statements;
[0034] After positioning the data in the financial statement, record the financial data and the name and time of the indicators corresponding to the financial data;
[0035] After unit conversion is performed on numerical data, record the converted data.
[0036] Further as a preferred implementation of this embodiment, the step of constructing regular expression rules for financial indicators specifically includes:
[0037] Obtaining a standa...
Embodiment 2
[0063] Such as figure 2 As shown, this embodiment provides a document parsing and processing system applied in big data analysis, the system includes:
[0064] Construction unit, regular expression rules for constructing financial indicators;
[0065] An acquisition unit, configured to acquire the start characteristic index and the end characteristic index of the financial statement;
[0066] The first positioning unit is configured to use the regular expression rule of the financial index, the start feature index and the end feature index to perform positioning processing of financial statements for documents in different formats;
[0067] The second positioning unit is used to record the financial data and the index name and time corresponding to the financial data after positioning the data in the financial statement;
[0068] The conversion unit is used to perform unit conversion on numerical data and record the converted data.
[0069] Further as a preferred implement...
Embodiment 3
[0095] This embodiment provides a document parsing and processing device applied in big data analysis, the device comprising:
[0096] at least one processor;
[0097] at least one memory for storing at least one program;
[0098] When the at least one program is executed by the at least one processor, the at least one processor is made to implement the steps of the document parsing and processing method applied in big data analysis as described in Embodiment 1 above.
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com