Unlock instant, AI-driven research and patent intelligence for your innovation.

Web page cutting method and system

A webpage, the only technology, applied in the field of webpage clipping methods and systems, can solve the problems of difficult clipping, poor versatility, low efficiency, etc.

Active Publication Date: 2021-05-11
CHINA MOBILE GRP GUANGDONG CO LTD +1
View PDF3 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The present invention provides a web page clipping method and system that overcomes the above problems or at least partially solves the above problems, and solves the problems of poor versatility, difficult clipping, and low efficiency in the prior art

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Web page cutting method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0035] The specific implementation manners of the present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments. The following examples are used to illustrate the present invention, but are not intended to limit the scope of the present invention.

[0036] Such as figure 1 As shown in the figure, a webpage clipping method is shown, including:

[0037] Obtain the user's target element according to the webpage element clicked or searched by the user, obtain the unique identifier of the target element, obtain the clipping rule set of the target element according to the unique identifier, and extract the webpage layer by layer based on the clipping rule order of the clipping rule set Content, clipping the target element corresponding to the unique identifier;

[0038] Wherein, the clipping rule set includes a unique identifier of the target element, and the unique identifier is a starting point clipping rule of the clip...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present invention provides a webpage clipping method and system, the method comprising: obtaining a unique identifier of a target element, obtaining a clipping rule set of the target element according to the unique identifier, and layer by layer based on the sequence of clipping rules of the clipping rule set Extracting the content of the webpage, and clipping the target element corresponding to the unique identifier; wherein, the clipping rule set includes the unique identifier of the target element, and the unique identifier is the starting clipping rule of the clipping rule set . When the user clicks or searches for a matching web page element, the clipping rule set of the element is automatically generated according to the rule retrieval function, and the clipping rule set is stored in a unified format. In the subsequent application integration, the clipping tool can be used to follow the clipping rule set instruction process to finally obtain it. web element. Through reverse positioning, while ensuring the success rate, the HTML nodes that need to be traversed to locate a specific element are minimized, and the clipping efficiency is improved.

Description

technical field [0001] The present invention relates to the technical field of communications, and more specifically, to a method and system for cutting out webpages. Background technique [0002] Web page information clipping, that is, using the web page as the information source, and then extracting the target information from the information source. Most of the data on the web page is described in a semi-structured Hypertext Markup Language (HTML), but due to the lack of description of the data itself, the application cannot directly parse and use the web page. A large amount of information results in a great waste of resources. Web page information clipping The purpose of clipping is to extract the hidden target information in the semi-structured HTML page, and express it in a more structured and semantically clearer form, so that users can query data in the web page and applications directly Make use of data in webpages for convenience. [0003] When clipping webpage...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/958
CPCG06F16/986
Inventor 何应腾陈晓鸿林湧双过松周剑雄文永江陈俊儒董灿佳蒋业
Owner CHINA MOBILE GRP GUANGDONG CO LTD