Multi-record type dynamic webpage information extraction method based on visual block
A webpage information and extraction method technology, applied in digital data information retrieval, website content management, network data retrieval, etc., can solve problems such as data extraction, complexity of webpage layout, and low accuracy of webpage layout
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0067] The main purpose of web page data record extraction is to obtain effective data records from different web pages. The present invention creates a four-layer multi-record dynamic web page data extraction model, such as figure 1 As shown, and according to this data model, a multi-record dynamic web page data extraction scheme is proposed, and its process state diagram is as follows figure 1 shown.
[0068] The present invention will be described in further detail below in conjunction with the accompanying drawings.
[0069] figure 2 It is a flowchart of a method for extracting dynamic multi-record web page information according to one aspect of the present invention. As shown in the figure, the method includes the following steps:
[0070] Step1: Web page parsing and rendering;
[0071] First determine the target webpage, and obtain the link address of the target webpage. Through the browser kernel or interface, parse and render the target webpage to obtain its visu...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap