Webpage splitting method
A webpage segmentation and webpage technology, applied in the Internet field, can solve the problems of destroying the webpage structure and inconvenience for users to browse, and achieve the effect of high execution efficiency, avoiding separation, and easy realization
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0087] Consistent with the previous embodiment, the web page segmentation method of this embodiment includes segmentation based on multi-line blocks and merging based on topic blocks. The two steps in this embodiment are introduced respectively below.
[0088] 1. Segmentation based on multi-line blocks. Such as Figure 4 As shown, the segmentation method based on multi-line blocks includes the following steps:
[0089] Step 1: Traverse the DOM tree from bottom to top. Set the node's multiline block attribute value.
[0090] Each web page can be represented by a DOM tree, and the DOM tree representation of the web page can be obtained through the interface provided by the browser. A bottom-up traversal of the DOM tree is performed, and if a leaf node is encountered during the traversal, its multi-line block attribute is set to zero. If a non-leaf node is encountered, its child nodes are traversed first, and the y coordinate value of the child node is recorded. After all t...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 