Automatic generating method of wrapper of complex page
An automatic generation and wrapper technology, applied in the fields of instruments, program control devices, special data processing applications, etc., can solve the problems of high skills, do not use large-scale web data integration, wrapper failures, etc., and achieve high extraction accuracy. Effect
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0039] Embodiment one: see attached figure 1As shown in , the basic flow of the wrapper automation generation system is shown. The whole system mainly consists of three parts: Data-rich area (DS) identification sub-module, data record (DR) identification sub-module and wrapper generator sub-module.
[0040] Data-rich area (DS) identification sub-module, from the data point of view, DS is the collection of data records on the Web. The category list page includes not only the data record set area, but also areas such as advertisement bars and navigation bars. By comparing the Html Tag trees of two pages (here, list pages) generated based on the same module, the Data-rich area that the user is interested in can be quickly located. Because the list page is generated by a pre-defined template, DR often appears on the page in an iterative form. According to observation, it can be found that the vicinity of Data-rich is often accompanied by the appearance of paging navigation. We ...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com