Method and device for generating searching result

A technology of search results and search words, applied in the field of Internet applications, can solve problems such as poor correlation of search results, low degree of Internetization, and extrusion of well-known web pages, so as to improve the ranking of field relevance, reduce the number of interactions, and reduce the pressure Effect

Active Publication Date: 2013-07-03
BEIJING BAIDU NETCOM SCI & TECH CO LTD
View PDF4 Cites 28 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

This way of using hyperlinks and other relationships to measure the authority of websites / URLs usually reflects popularity, which generally only reflects the popularity of webpages on the entire Internet. However, for some non-Internet mainstream or traditional industries, their Internet-based The degree is not high, such as oil sites, professional dance sites, poetry sites and other sites that are only popular in the professional field. The authority of these sites in the same field should actually be very high, but the existing methods are not true reflect its authority
This will lead to some globally authoritative webpages ranking too high, while well-known webpages in this field will be squeezed
For example, if you search for "the first lesson of elementary school Chinese", educational sites should rank relatively high, but according to the existing methods, there are often cases where document, video, and blog sites have high results; therefore, the current authoritative The authoritativeness of the site is mainly measured by the popularity of hyperlinks, etc.; the lack of domain expertise to measure the authority of the site leads to poor correlation of search results and makes it difficult for users to find the results they want, especially for niche and popular majors. site, which will inevitably increase the number of interactions between the user and the system, causing greater pressure on the server

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for generating searching result
  • Method and device for generating searching result
  • Method and device for generating searching result

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0066] figure 1 is a flow chart of the method for generating search results provided in this embodiment, such as figure 1 As shown, the method includes:

[0067] Step S101 , using the anchor text of the web page or the click text of the user in advance to obtain the terms and weights of each term, and establish a site model for each site.

[0068] A site usually includes multiple web pages, and a web page includes multiple anchor texts. The anchor text (hyperlink text, anchor text) is used to guide the corresponding hyperlink (url, uniform resource locator) of the annotation. From the captured network resources, the anchor text in each web page and its corresponding url are obtained as anchor text data.

[0069] Collect statistics on the user's historical behavior to obtain user click data. For example, a user enters the search term (query) "Shantou Telecom" on a search engine, and clicks the text titled "China Telecom Online Business Hall·Guangdong | Fanfang Electronic S...

Embodiment 2

[0176] image 3 is the structure diagram of the generating device for the search results provided by this embodiment, such as image 3 As shown, the device includes:

[0177] The site model building module 10 is used for pre-using the anchor text of the webpage or the user's click text to obtain the terms and the weights of the terms of each site, and establish a site model of each site.

[0178] The site model includes at least terms of the site and weights of each term.

[0179] A site usually includes multiple web pages, and a web page includes multiple anchor texts. The anchor text is used to guide and comment its corresponding url. From the captured network resources, the anchor text in each web page and its corresponding url are obtained as anchor text data.

[0180] Collect statistics on the user's historical behavior and obtain user click data. For example, a user enters the search term "Shantou Telecom" on a search engine, and clicks on the text titled "China Tel...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a method and a device for generating a searching result. The method comprises the following steps of S1, using an anchor text of a webpage or a click text of a user in advance to obtain a lexical item of each website and the weight of each lexical item, and establishing the website model of each website; S2, acquiring a search term of the user, and obtaining each matched webpage matched with the search term through retrieval; S3, obtaining the domain relevance between the search term and the website corresponding to each matched webpage through correlation calculation by using the search term and the website model established in the step S1; and S4, according to the domain relevance between the search term and the website corresponding to each matched webpage, sequencing each matched webpage to generate the searching result. Compared with the prior art, the method has the advantages that the domain relevance sequencing of the searching result can be improved, the user is facilitated to quickly find the searching result, meanwhile, the efficiencies of the user and a system are improved, the interaction times are reduced, and the burden of a server is mitigated.

Description

【Technical field】 [0001] The invention relates to the technical field of Internet applications, in particular to a method and device for generating search results. 【Background technique】 [0002] With the continuous development of information and network technology, search engines have become an important way for people to obtain information. By inputting a search term (query) in the search engine, the user obtains the search result returned by the search engine for the search term. Search results are usually obtained according to a series of scoring strategies and sorting algorithms. Among them, in addition to keyword factors, the main factor that affects the ranking of search results is the authority of the site (website). [0003] The existing authority mainly considers objective factors such as the hyperlink relationship of web pages, the degree of access of Internet users, and the authority level of the site itself. This way of using hyperlinks and other relationship...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
Inventor 李战胜许恬菁林涛
Owner BEIJING BAIDU NETCOM SCI & TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products