Automatic generation method for wrapper on the basis of DOM (Document Object Model) tree abstraction
A DOM tree, automatically generated technology, applied in the field of cloud computing, to achieve the effect of good accuracy and time performance, reducing the utilization of storage resources
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0035] The technical solution of the present invention will be specifically described below in conjunction with the accompanying drawings.
[0036] A kind of wrapper automatic generation method based on DOM tree abstraction of the present invention comprises the following steps,
[0037] Step S1, wrapper generation phase:
[0038] Step S11, the user inputs a web page set, removes impurity information in the source code through web page preprocessing, and parses it into a DOM tree to obtain a DOM tree set;
[0039] Step S12, merging the DOM trees, traversing the DOM trees to merge child nodes with the same label, and marking each node with path features, and finally converting the DOM tree set into a merged tree set;
[0040] Step S13, perform an abstract operation on the merged tree set to obtain an abstract tree, and store the abstract tree in the database;
[0041] Step S14: Determine the path characteristics of the structured data in the merged tree according to the confi...
PUM

Abstract
Description
Claims
Application Information

- R&D
- Intellectual Property
- Life Sciences
- Materials
- Tech Scout
- Unparalleled Data Quality
- Higher Quality Content
- 60% Fewer Hallucinations
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2025 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com