Configurable data analysis method and computer readable storage medium
A data analysis and data object technology, applied in the field of Internet data crawling, can solve the problems of inability to support Json format web page analysis, insufficient adaptation of web page flexibility, and inability to adapt to encapsulation mode, so as to improve analysis efficiency and flexibility, The effect of reducing the amount of analysis data and facilitating machine recognition
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0039] see figure 1 , a configurable data parsing method, comprising the steps of: creating a new parsing configuration page, configuring the URL of the target webpage to be captured, parsing type, parsing attribute, and logical table name for saving parsing results on the parsing table configuration page , generally, the logic table is a two-dimensional table, such as an excel form, and submitted after completion; the analysis attribute includes the analysis area and row positioning information, or the analysis attribute only includes row positioning information; the new field configuration page, in The field configuration page configures the field name of each field in the logic table, each field name corresponds to the data object to be extracted, for example, the field name reg_code, the corresponding data object is the number string of the organization code, the field name is law_person, The corresponding data object is the name of the legal representative; the field name...
Embodiment 2
[0062] The difference between this embodiment and Embodiment 1 is that the field configuration further includes configuration field identifiers, and when data objects are extracted, the data objects to be extracted are matched according to the field identifiers, and then mapped into logical tables.
[0063] Described data parsing method carries out the following steps: configure the URL, parsing type, parsing attribute and the logic table name used to preserve parsing result of the target webpage that will grab in parsing configuration page configuration, submit after finishing; Described parsing property includes parsing area and row positioning information, or the parsing attribute only includes row positioning information; such as Figure 14 As shown, configure the field name and field identifier of each field in the logical table on the field configuration page; create a blank logical table according to the logical table name, and write each field name and corresponding fie...
Embodiment 3
[0069] A computer-readable storage medium, on which a computer program is stored, and when the program is executed by a processor, the following steps are performed: configuring the URL of the target webpage to be grabbed, the analysis type, the analysis attribute, and the method used for saving the analysis table configuration page. The logical table name of the analysis result, submitted after completion; the analysis attribute includes the analysis area and row positioning information, or the analysis attribute only includes the row positioning information; configure the field name of each field in the logic table on the field configuration page, Each field name corresponds to the data object to be extracted; create a blank logical table according to the logical table name, write each field name into the blank logical table, and the order of each field is consistent with the order in which the data object is extracted during parsing; grab The target webpage corresponding to ...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com