Web page element searching method and device, and computing equipment

A technology of page elements and current pages, applied in the field of crawler, can solve the problems of loss of selected content, no memory function, and inability to implement persistence, and achieve the effect of improving the probability of successful extraction, reducing development workload, and improving versatility

Pending Publication Date: 2022-01-11
海南车智易通信息技术有限公司
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

For example, in some cases, the XPath collected by XPath Helper contains randomly generated styles, which will change after the website is updated and released, causing the XPath to fail
In addition, the XPath Helper Chrome plug-in also has the following problems: it cannot be persisted on the ground, has no memory function, the selected content is lost after the page is refreshed, multiple fields cannot be selected at the same time, manual copying is required, and "crowded elements" cannot be selected , the element pointed to by the mouse is the topmost element
[0013] In summary, the above-mentioned existing crawler technology has problems such as difficulty in developing web page element plug-ins, and insufficient adaptability to dynamic pages.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Web page element searching method and device, and computing equipment
  • Web page element searching method and device, and computing equipment
  • Web page element searching method and device, and computing equipment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0049] Exemplary embodiments of the present disclosure will be described in more detail below with reference to the accompanying drawings. Although exemplary embodiments of the present disclosure are shown in the drawings, it should be understood that the present disclosure may be embodied in various forms and should not be limited by the embodiments set forth herein. Rather, these embodiments are provided for more thorough understanding of the present disclosure and to fully convey the scope of the present disclosure to those skilled in the art.

[0050] Aiming at problems such as high plug-in development difficulty and insufficient adaptability to dynamic pages in existing web page element search methods, the present invention provides a web page element search method, which can reduce plug-in development difficulty and expand the applicable scope of plug-ins.

[0051] figure 1 A schematic diagram of a web page element search system 100 according to an embodiment of the pr...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a web page element searching method and device, and computing equipment. The web page element searching method comprises the following steps: in response to a request of a user for searching web page elements on a Chrome browser, sending a domain name and a URL of a current site to a server side, and then obtaining a template returned by the server side, wherein a target field and an extraction rule of the target field are recorded in the template; when the extraction rule of the target field is absolute positioning, extracting the target field in the current page by using one of an Xpath selector, a CSS selector and an ID selector; when the extraction rule of the target field is relative positioning, extracting a target container in the current page, and extracting the target field in the target container; and taking the extracted target field as a search result of the web page element. The invention also discloses corresponding computing equipment and device.

Description

technical field [0001] The invention relates to the field of reptiles, in particular to a web page element search method and device and computing equipment. Background technique [0002] Grabbing data from Internet websites is a common requirement. The usual way is to write a program or script to obtain the response from the target website, and then parse out the required element fields from the response according to your own needs. When there are many fields, many target websites, and target elements are sporadic according to data changes, the difficulty and workload of parsing element fields will increase accordingly. [0003] One of the ways to find web page elements is manual parsing. This method requires programmers to customize a set of crawling programs for each target website, obtain the response of the target website through network requests, and then parse them one by one from the response HTML. required fields. Before parsing the fields, the developer needs to a...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/951G06F16/955
CPCG06F16/951G06F16/9566
Inventor 刘毅邢万祥
Owner 海南车智易通信息技术有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products