Industry-oriented topic search method

A search method and subject technology, applied in the field of information retrieval, can solve problems such as reducing globality, drift, and inability to guarantee content reliability, and achieve the effects of improving accuracy and reliability, high accuracy, and high coverage.

Active Publication Date: 2017-07-25
UNIV OF ELECTRONICS SCI & TECH OF CHINA
View PDF3 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

This method can establish a theme database with low redundancy, but if the correlation degree is used to sort, although the retrieval results are highly relevant to the theme, it reduces t...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Industry-oriented topic search method
  • Industry-oriented topic search method
  • Industry-oriented topic search method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0044] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0045] Such as figure 1 Shown is a schematic flowchart of the industry-oriented subject search method of the present invention. An industry-oriented topic search method, comprising the following steps:

[0046] A. Initialize crawling site seedUrls, crawler crawling time t 1 , subject keyword vector vector topic and the time interval t for the crawler to crawl again 2 , establish the initial queue Url_queue to be crawled through the crawling site seedUrls;

[0047] B. Judging whether it has reached the crawling time t 1, if so, end the operation, if not, further judge whether the queue Url_que...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an industry-oriented topic search method. The method comprises the steps that an initial to-be-crawled queue is initialized and established, whether crawling time of a crawler is due and whether the to-be-crawled queue is empty are judged, a Shark-Search-Advanced algorithm is adopted to calculate a relevancy value between a webpage and a topic, a PageRank-Advanced algorithm is adopted to calculate a webpage connection value and a webpage ranking value, and whether a secondary crawling time interval of the crawler is due is judged. Through the method, the accuracy and reliability of a search result can be effectively improved, therefore, a high-accuracy high-coverage retrieval result is acquired efficiently, and it is guaranteed that a search engine can respond to a search demand of a user for a specific industry with high efficiency, high accuracy and high coverage.

Description

technical field [0001] The invention belongs to the technical field of information retrieval, and in particular relates to an industry-oriented subject search method. Background technique [0002] The Internet has become the most important way for people to disseminate information and acquire content. The general search engines represented by Google, Baidu, and Bing provide great convenience for people to obtain information quickly and accurately on the Internet. However, general-purpose search engines need to build a huge search database, and the search content is oriented to the entire network. When users need to conduct vertical searches on specific industries, their accuracy rate is relatively low and resource consumption is large. At the same time, the vertical search engines represented by Qunar and Sogou Shopping have built their own databases for special fields, with heavy industry constraints, insufficient application flexibility, and unsatisfactory recall rates. ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30
CPCG06F16/9535
Inventor 刘道桂韦云凯刘强李源颢蒲勇全陈怡瑾
Owner UNIV OF ELECTRONICS SCI & TECH OF CHINA
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products