Method and device for determining link level in website

A website and level technology, applied in the level field, can solve problems such as failure to access normally, dead links, invalid URLs, etc.

Inactive Publication Date: 2017-04-12
BEIJING QIHOO TECH CO LTD +1
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] The third aspect, as time goes by, the URL will become invalid and become a dead link, which cannot be accessed normally
[0008] These spam, duplicate, and invalid links are mixed with valid links. If search engines include them indiscriminately, on the one hand, the original tight site crawling quota will be heavily occupied; on the other hand, from the user's point of view , most of the webpages crawled by search engines have no reference value

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for determining link level in website
  • Method and device for determining link level in website
  • Method and device for determining link level in website

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0092] Exemplary embodiments of the present disclosure will be described in more detail below with reference to the accompanying drawings. Although exemplary embodiments of the present disclosure are shown in the drawings, it should be understood that the present disclosure may be embodied in various forms and should not be limited by the embodiments set forth herein. Rather, these embodiments are provided for more thorough understanding of the present disclosure and to fully convey the scope of the present disclosure to those skilled in the art.

[0093] Such as figure 1 As shown in FIG. 1 , it is a schematic flow chart of the method for determining the link level in a website provided by Embodiment 1 of the present invention. The subject of execution of the method provided in this embodiment may be a device for determining a link level in a website provided on the server side. Such as figure 1 As shown, the method for determining the link level within the website includes...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

A method and apparatus for determining the grades of links within a website are disclosed. The method comprises: determining the category which the link belongs to on the basis of the link address of the link within a website; acquiring the category quality grade corresponding to the category which the link belongs to; extracting the link-value of the link; determining the grade of the link on the basis of the category quality grade and the link-value of the link. The technical solution provided in the present invention can accurately identify the valuable links within the website, and provide guiding crawl references for search engines, so that the search engines can reasonably allocate crawl flow among a number of links within the website, and ensure that the high valuable links are recorded.

Description

technical field [0001] The invention relates to a computer information processing technology, in particular to a method and device for determining the link level in a website. Background technique [0002] A web crawler (also known as a web spider) is a program or script that automatically obtains information on the World Wide Web according to certain rules. Search engines use web crawlers to download all web pages from hundreds of millions of sites on the Internet for analyzing web page data and building indexes. The Internet is always generating new web pages and updating old web pages, so web crawlers also need to work non-stop to ensure that search engines can have the latest Internet web page mirrors. For the sake of search effect, crawlers always hope to index web pages faster. However, crawling webpages by crawlers will occupy server resources of the website. If the frequency of crawling exceeds the tolerance range of the website, it will affect the normal visit of...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/30
CPCG06F16/951
Inventor 魏少俊
Owner BEIJING QIHOO TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products