Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Webpage path navigation method and device, electronic equipment and storage medium

A path navigation and webpage technology, applied in the Internet field, can solve problems such as wasteful execution time-consuming, invalid crawler technology, etc., and achieve the effect of improving efficiency and reducing the number of traversals

Pending Publication Date: 2021-01-15
MIGU CO LTD
View PDF0 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Knowledge can be obtained from the Internet through crawler technology, but the application of crawler technology usually requires clear web page addresses
However, the address of the web page is sometimes updated irregularly, which leads to the invalidation of crawler technology.
And although there are some depth-first crawlers based on a certain topic, which determine the web page address by traversal, the exhaustive link method is adopted, which wastes a lot of time-consuming execution of URLs that have nothing to do with the topic.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Webpage path navigation method and device, electronic equipment and storage medium
  • Webpage path navigation method and device, electronic equipment and storage medium
  • Webpage path navigation method and device, electronic equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0040] The specific embodiments of the present invention will be further described below in conjunction with the accompanying drawings. The following examples are only used to illustrate the technical solution of the present invention more clearly, but not to limit the protection scope of the present invention.

[0041] The web page path navigation method, device, electronic device and storage medium according to the embodiments of the present invention will be described below with reference to the accompanying drawings.

[0042] figure 1 A flowchart showing a web page path navigation method provided in an embodiment of the present invention, as shown in figure 1 As shown, the web page path navigation method provided in the embodiments of the present invention specifically includes the following content:

[0043] S101: Receive an input request from a user.

[0044] Wherein, the input request includes a set goal, for example, it is necessary to collect award information of f...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The embodiment of the invention discloses a webpage path navigation method and device, electronic equipment and a storage medium, and the webpage path navigation method comprises the steps: receivingan input request of a user; querying at least one access task corresponding to the input request from a pre-obtained access task set based on the input request, with the at least one access task comprising a webpage navigation sequence from the initial webpage to the target webpage; obtaining a frequent item set of the URL regular pattern of at least one access task, wherein the frequent item setcomprises the URL regular pattern of each access task corresponding to a plurality of webpage navigation sequences from the initial webpage to the target webpage; and determining a navigation path ofthe target webpage according to the frequent item set. According to the webpage path navigation method disclosed by the invention, the path navigation of a crawler can be automatically and intelligently analyzed according to the association rule mining algorithm, so that the traversal times of the webpage path can be reduced, and the knowledge extraction efficiency in the webpage can be improved.

Description

technical field [0001] The invention relates to the technical field of the Internet, in particular to a web page path navigation method, device, electronic equipment and storage medium. Background technique [0002] At present, HTML, the standard language of Web pages, cannot meet the needs of knowledge representation. Usually, standard language specifications, RDF, RDFS and OWL are embedded in HTML to represent the knowledge of web pages. In order to facilitate knowledge sharing, it is necessary to extract knowledge from web pages (such as the Semantic Web) and form a knowledge graph. Knowledge can be obtained from the Internet through crawler technology, but the application of crawler technology usually requires a clear web page address. However, the web page address is sometimes updated irregularly, causing the crawler technology to fail. And although there are some depth-first crawlers based on a certain topic, which determine the web page address by traversal, the exh...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F16/955G06F16/36
CPCG06F16/9566G06F16/367
Inventor 徐晶霍振坤王军宁李琳张晓颖
Owner MIGU CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products