Automatic mining method for demand identification template, demand identification method and corresponding device

An automatic mining and demand technology, applied in the computer field, can solve the problems of low query recall rate, consumption of human resources, low efficiency of demand identification templates, etc., and achieve the effect of improving recall rate and saving human resources.

Active Publication Date: 2016-06-15
BEIJING BAIDU NETCOM SCI & TECH CO LTD
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] First, it consumes human resources, and the efficiency of establishing demand identification templates is low
[0005] Second, the recall rate for queries is low, that is to say, the number of queries that can be covered is limited, and the scope of application is narrow

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Automatic mining method for demand identification template, demand identification method and corresponding device
  • Automatic mining method for demand identification template, demand identification method and corresponding device
  • Automatic mining method for demand identification template, demand identification method and corresponding device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0068] figure 1 The flow chart of the mining method for the requirement identification template provided by Embodiment 1 of the present invention, as shown in figure 1 As shown, the method includes the following steps:

[0069] Step 101: Determine the query set corresponding to when a preset type of webpage is clicked in the search log.

[0070] In this step, the following two methods can be used:

[0071] The first method: After the webpage type is determined by the existing webpage type identification method, webpages of the preset type are collected, and all queries corresponding to these webpages are determined in the search log to form a query set when they are clicked.

[0072] Among them, the web page classification method based on text features, or the method of calculating the similarity between the web page text feature vector and the feature vector of the preset type can be used to determine the type of each web page in the search log, and then collect the pre-set...

Embodiment 2

[0106] figure 2 The flow chart of the demand identification method provided by Embodiment 2 of the present invention, such as figure 2 As shown, the method includes the following steps:

[0107] Step 201: Match the query to be recognized with each preset type of dictionaries, replace the words in the query to be recognized that match the dictionary with the attribute tags of the corresponding words in the dictionary, and obtain the semantic annotation of the query to be recognized.

[0108] The query to be identified may be a query input by a user on a search interface provided by a search engine.

[0109] In this step, the replacement of the attribute tags for the query to be recognized is similar to the replacement of the attribute tags for the seed query in step 103 of the first embodiment, the difference is that the query to be recognized needs to be matched with the dictionary of each preset type.

[0110] For example: for the query to be identified "China Food Market...

Embodiment 3

[0122] image 3 The structural diagram of the automatic excavation device of the demand identification template provided by Embodiment 3 of the present invention, as shown in image 3 As shown, the apparatus may include: a first selection unit 301 , a second selection unit 302 , a marker replacement unit 303 and a template determination unit 304 .

[0123] The first selection unit 301 determines a set of queries corresponding to when a webpage of a preset type is clicked in the search log.

[0124] Specifically, the query set can be determined in the following two ways:

[0125] The first way: the first selection unit 301 determines the type of the webpage in the search log, collects the webpages of the preset type, and determines that all queries corresponding to the webpage of the preset type are clicked to form a query set. When determining the type of web pages in the search log, a web page classification method based on text features, or an existing method such as calcu...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides an automatic mining method of a requirement identification template, a requirement identification method and a corresponding device. The automatic mining method comprises the following steps of: determining a corresponding query set when a webpage of a preset type is clicked in a search log; selecting a query that the total clicking number of the webpage of the corresponding preset type exceeds a preset frequency threshold and / or the clicking rate of the webpage of the corresponding preset type exceeds the preset clicking rate threshold from the query set; taking the selected query as a seed query of the preset type; matching each seed query with a dictionary of the preset type respectively; substituting words in the seed query matched with the dictionary into attribute marks of the corresponding words in the dictionary to obtain a template set of the preset type; and determining the requirement identification template of the preset type by using the template set of the preset type. According to the automatic mining method, the manpower resource can be saved, the query range capable of being covered by search identification is expanded, and the recall rate is improved.

Description

【Technical field】 [0001] The invention relates to the field of computer technology, in particular to an automatic mining method for a demand recognition template, a demand recognition method and a corresponding device. 【Background technique】 [0002] With the rapid development and maturity of the Internet on a global scale, the information resources on the network are constantly enriched, and the amount of information data is also expanding rapidly. Obtaining information through search engines has become the main way for modern people to obtain information. In order to provide users with more convenient and accurate query services is the current and future development direction of search engine technology. [0003] In search engine technology, identifying the user's search needs is an important part of improving search accuracy and effectiveness, especially in structured search (ie, vertical search). For example, when the user enters the query "how to do the bus from Baidu ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/30
Inventor 黄际洲柴春光
Owner BEIJING BAIDU NETCOM SCI & TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products