Method and device for excavating badcase of search engine

A search engine and confidence technology, applied in special data processing applications, instruments, electrical and digital data processing, etc., can solve problems such as failure to timely and accurately find badcase, low efficiency, etc., and achieve the effect of improving efficiency and accuracy
CN103577464AActive Publication Date: 2014-02-12BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD

Patent Information

Authority / Receiving Office
CN · China
Patent Type
Applications(China)
Current Assignee / Owner
BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD
Publication Date
2014-02-12

Smart Images

  • Figure 1
    Figure 1
  • Figure 2
    Figure 2
  • Figure 3
    Figure 3
Patent Text Reader

Abstract

The invention provides a method and a device for excavating a badcase (badcase) of a search engine, wherein the method comprises the following steps of a preprocessing procedure: extracting a certain number of sessions as samples from a session (session) log, and extracting a feature vector describing the search quality from each session of the samples; clustering the samples by utilizing the feature vector of each session; determining confidence coefficient of each category obtained by clustering the samples, wherein the confidence coefficient represents the low degree of the search quality; an excavating procedure: determining an action sequence in the same query in a session log to be excavated, and extracting a feature vector describing the search quality from the action sequence; determining the category of the query by computing the distance between the feature vector of the query and the feature vector of each category; if the confidence coefficient of the category of the query is beyond a preset high threshold, determining that the search engine has the badcase to the query. According to the method and device for excavating the badcase of the search engine, which are disclosed by the invention, the automatic excavation of the badcase of the search engine can be realized, so that the badcase of the search engine can be timely and exactly found out.
Need to check novelty before this filing date? Find Prior Art

Description

【Technical field】

[0001] The invention relates to the technical field of computer applications, in particular to a method and device for mining badcases of search engines. 【Background technique】

[0002] With the continuous development of computer technology, the network has become the main channel for people to obtain information. Among them, the search engine can understand the user's query needs and intentions through analysis, and search for the webpage that best matches the user's query within the entire network. However, due to the vast amount of web pages on the Internet, the content of web pages varies greatly, and the expressions of user needs are also diverse. Therefore, the biggest difficulty for search engines is to be able to return the most relevant search results regardless of the user's query. result.

[0003] The interior of the search engine is composed of many complex coupled correlation strategies, the number and complexity of which, as well as the mutu...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More