Method and device for sequencing search results

A field and attribute technology, applied in the field of sorting search results, can solve problems that affect the correct ranking of other URLs, difficult semantic analysis, unfair high ranking, etc., to avoid inaccurate correlation calculations, improve sorting accuracy, The effect of improving search quality

Inactive Publication Date: 2012-10-10
BEIJING BAIDU NETCOM SCI & TECH CO LTD
View PDF3 Cites 16 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] 1. Due to the short length of the query, it is difficult to perform semantic analysis on it, so the calculation of relevance generally depends on literal matching. There is also a literal match between url and query, and there will also be escapes, so that it is actually irrelevant For example, the query is "Mopan", and the subject of the url is "Mopanshan". The two are not related, but because of the literal match, the url with the subject "Mopanshan" also becomes the search result , if the url still has a higher PageRank at this time, it will make the url have a higher ranking, thereby affecting the correct ranking of other urls;
[0005] 2. At present, there is a common situation on the domestic Internet: Hackers use various technical means to attack authoritative sites such as governments and companies, and then inject low-quality pages in the fields of online games, gaming, and medical trea

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for sequencing search results
  • Method and device for sequencing search results

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0022] In order to improve the accuracy of the ranking of the search results of the search engine, in the embodiment of the present invention, in the ranking algorithm of the transmission search engine, a third type of parameter—site domain—besides the two parameters of relevance and authority is introduced. to adjust the sorting of search results.

[0023] For the convenience of description, the following briefly introduces the meanings of the parameters in this embodiment.

[0024] Relevance (weight), set to a real number in the interval [0, 1], calculated based on literal matching and limited semantic analysis.

[0025] Authority, set to a real number belonging to the interval [0, ∞), calculated based on PageRank and SiteRank.

[0026] Site domain, also known as site domain attribute, is set to a binary group calculated based on PLSA (Latent Layer Semantic Analysis) and probability density function, where D represents the domain involved in the site, which can also be cal...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to the field of internet information treatment and discloses a method and a device for sequencing search results. The method comprises the following steps of: acquiring corresponding search results and preset field attributions corresponding to the search results according to search keywords input by a user; calculating the field attribution of the search keywords according to the field attribution of each search result; respectively regulating the correlation of each search result with the search keywords and the authority of each search result according to the field attribution of each search result and the field attribution of the search keywords; and sequencing the search results according to the correlation and authority of each search result after regulation. Therefore, the sequencing accuracy of the search results are improved at the aspects of the correlation and the authority, the search quality is effectively improved, and the system performance is promoted.

Description

technical field [0001] The invention relates to the field of Internet information processing, in particular to a method and device for sorting retrieval results. Background technique [0002] With the development of Internet technology, the scope of application of search engine technology is becoming more and more extensive. When traditional search engines respond to users' retrieval needs, they mainly sort the retrieval results according to the following two parameters: one is the retrieval results (also known as url, uniform resource location), and the correlation between the search keywords (also known as query) input by the user, and the second is the authority of the url itself. The so-called authority can be based on the PageRank (page level) of the url or the url's It is determined by the SiteRank (site level) of the site. Generally speaking, the URL with higher relevance to the query ranks higher. In the case of similar correlation, the URL with higher authority rank...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30
Inventor 张子云
Owner BEIJING BAIDU NETCOM SCI & TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products