Method and device for excavating search log and page search method and device

A timeliness and log technology, applied in the Internet field, can solve the problems of users being unable to find, understand, and identify the timeliness needs of users.

Active Publication Date: 2014-08-13
BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD
View PDF2 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] However, in the existing search technology, it is impossible to identify the timeliness requirement of the query entered by the user. For example, the user wants to obtain relevant information about an event that just happened, but the search engine will not understand the timeliness requirement of the user. The returned search results are only based on the previous search history, and the search results are sorted according to the preset weights of each attribute. Users may not be able to quickly and accurately find the desired page from the search results.
For example, if a user wants to obtain network information about the recent explosion in Hebei, he enters the query of "Hebei explosion". Since the event has just occurred and there are few network resources, in the search results, the information about the recent explosion in Hebei The page may be submerged in the massive pages of historical events related to the Hebei explosion, and users cannot quickly and accurately find the desired page from the search results

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for excavating search log and page search method and device
  • Method and device for excavating search log and page search method and device
  • Method and device for excavating search log and page search method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0059] figure 1 The flow chart of the method for mining search logs provided by the present invention, such as figure 1 As shown, the method can include the following steps:

[0060] Step 101: Perform word segmentation processing on the query captured from the search log.

[0061] When fetching a query from the search log, the fetching strategy can adopt one or any combination of the following strategies:

[0062] Fetching strategy 1: Fetch queries in which the proportion of pages clicked by the user within the first time period in the most recent first time period in the search results exceeds the preset first proportion threshold. For example, suppose that the most recent first time period is within the last 2 days, and the preset first ratio threshold is 50%. If a query’s search results are published within the last 2 days, pages that are If the percentage of clicks on the total page is 70%, the query can be captured. For another example, if the publication time of the page clic...

Embodiment 2

[0092] figure 2 The flow chart of the page search method provided by the present invention, such as figure 2 As shown, the method can include the following steps:

[0093] Step 201: Perform word segmentation processing on the query input by the user.

[0094] Step 202: Utilize the combination of each word and / or the attribute of each word obtained after word segmentation processing, and the distribution probability of each combination to summarize the type corresponding to the query.

[0095] The processing method of the query input by the user in step 201 to step 202 is the same as the processing method of the captured query in step 101 to step 102, and will not be repeated here.

[0096] Step 203: Look up the timeliness probability table, and determine the timeliness probability corresponding to the type summarized in step 202.

[0097] Step 204: If the highest value of the determined timeliness probability exceeds the preset timeliness probability threshold, it is determined that t...

Embodiment 3

[0110] image 3 It is a structural diagram of a search log mining device provided by an embodiment of the present invention, such as image 3 As shown, the mining device may include: a grabbing unit 300, a first word segmentation unit 310, a first type determination unit 320, a screening unit 330, and a probability calculation unit 340.

[0111] The grabbing unit 300 is used to grab the query from the search log.

[0112] The first word segmentation unit 310 is configured to perform word segmentation processing on the query captured by the capture unit 300.

[0113] The word segmentation processing method adopted by the first word segmentation unit 310 may include, but is not limited to: a word segmentation method for string matching, a word meaning word segmentation method, and a statistical word segmentation method.

[0114] The first type determining unit 320 is configured to use the combination of each word and / or attribute composition of each word and the distribution probability ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a search log excavating method, a timeliness requirement identification method and a corresponding device. By the method for excavating the search log, the timeliness probabilities of types corresponding to queries can be counted and can reflect the timeliness requirements of the queries, so that whether a query input by a user has the timeliness requirement or not is identified and a search result corresponding to the query input by the user is optimized when the query has the timeliness requirement in the page search method, namely the sorting weight of a time attribute in the search result is improved; therefore, the user can quickly and accurately find the required page from the search result, and the timeliness requirement of the user on the search result is met.

Description

【Technical Field】 [0001] The invention belongs to the field of Internet technology, and specifically relates to a method for mining search logs, a method for identifying timeliness requirements, and a corresponding device. 【Background technique】 [0002] With the continuous development of Internet technology and the continuous expansion of information, people's demand for the use of network information is increasing, and search engines have become an important tool for people to obtain network information. When the user enters a search term (query), the search engine usually includes the page containing the search term in the search results and returns it to the user. [0003] However, in the existing search technology, it is impossible to identify the timeliness requirements of the query entered by the user. For example, the user wants to obtain relevant information about the recent event, but the search engine does not understand the timeliness requirements of the user. The retu...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/30
Inventor 辜斯缪
Owner BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products