Method and device for establishing relevant-webpage data base

A technology of associating webpages and establishing methods, which is applied in the field of databases, can solve the problems of easy misoperation, low practicability, and low recall rate of identification methods, so as to avoid repeated crawling of webpages, increase coverage, and high recognition accuracy Effect

Inactive Publication Date: 2014-03-05
BEIJING QIHOO TECH CO LTD +1
View PDF8 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] However, the recall rate of this identification method is low, and the page turning of many websites does not have these keywords, such as "http: / / cq.ABC.com / lvshi / o12 / ", "http: / / bbs.BCA .com / t661_10", "http: / / china.BCD.com / product / 20110617 / 2647", but these pages are still turning pages, which makes these identification methods easy to cause misuse and low practicability

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for establishing relevant-webpage data base
  • Method and device for establishing relevant-webpage data base
  • Method and device for establishing relevant-webpage data base

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0070] Exemplary embodiments of the present disclosure will be described in more detail below with reference to the accompanying drawings. Although exemplary embodiments of the present disclosure are shown in the drawings, it should be understood that the present disclosure may be embodied in various forms and should not be limited by the embodiments set forth herein. Rather, these embodiments are provided for more thorough understanding of the present disclosure and to fully convey the scope of the present disclosure to those skilled in the art.

[0071] refer to figure 1 , which shows a flow chart of the steps of an embodiment of a method for establishing an associated webpage database according to an embodiment of the present invention, which may specifically include the following steps:

[0072] Step 101, judging whether the webpage captured includes the associated webpage URL pattern; if so, then execute step 102;

[0073] It should be noted that, the function of the se...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method and device for establishing a relevant-webpage data base. The method comprises the steps that whether a grabbed webpage comprises a relevant-webpage URL mode is judged; if yes, the relevant-webpage URL mode is obtained; a relevant webpage corresponding to the relevant-webpage URL mode is obtained; the relevant-webpage data base is established through the relevant webpage corresponding to the relevant-webpage URL mode. According to the method and device for establishing the relevant-webpage data base, the relevant-webpage URL mode is extracted based on the currently-grabbed webpage, the relevant-webpage data base is established through the relevant webpage corresponding to the relevant-webpage URL mode, repeated grabbing web pages is avoided, occupied system resources are reduced, and the establishment efficiency of the data base is greatly improved.

Description

technical field [0001] The invention relates to the technical field of databases, in particular to a method for establishing an associated web page database and a device for establishing an associated web page database. Background technique [0002] With the development of the Internet, more and more information is presented on the Internet through webpages for users to query. Similarly, querying data on the Internet through search engines has become the most commonly used data search method. [0003] Search engines need to adopt different scheduling strategies for different types of web pages when indexing web pages. The identification of web page types is a basic task, and the identification of page turning (Page turning) web pages is a relatively critical task. The so-called page-turning webpage refers to viewing the previous page, the next page or any existing non-current page of the paging file. Flipping the web page can change the content in the physical book or the m...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/955
Inventor 王智广
Owner BEIJING QIHOO TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products