Unlock instant, AI-driven research and patent intelligence for your innovation.

Method and equipment for searching by using word information entropy

A technology of search equipment and search method, applied in the field of computer network, to achieve the effect of accurate word information entropy value and improved accuracy

Active Publication Date: 2013-03-13
ALIBABA GRP HLDG LTD
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] On the one hand, the present application also provides a search method to solve the problem of how to improve the accuracy of the search results when the search request does not have an exact matching search result

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and equipment for searching by using word information entropy
  • Method and equipment for searching by using word information entropy
  • Method and equipment for searching by using word information entropy

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0027] Embodiment 1 of the present application provides a method for determining word information entropy, the schematic diagram of which is as follows figure 1 shown, including the following steps:

[0028] Step 101: Receive multiple search requests input by the user, and determine the category to which each search request belongs.

[0029] In this embodiment, the search request input by the user is a short text containing only about 2-3 words on average.

[0030] This embodiment does not limit the solution for determining the category of the search request. Two available solutions are given below:

[0031] The first solution: use user behavior data to automatically mine the category of the search request.

[0032] In the web log (web log), the direct click behavior from the search request to the category is often disturbed by the page layout, and the data is relatively sparse. Therefore, an indirect method is needed to obtain the category to which the search request belon...

Embodiment 2

[0068] Embodiment 2 of the present application is based on Embodiment 1, using the determined word information entropy value of each word to search, such as figure 2 shown, including the following steps:

[0069] Step 201: According to a search request input by a user, determine whether there is a search result matching the search request.

[0070] In this step, if there is a search result that directly matches a search request input by the user, the corresponding search result is directly returned to the user; otherwise, step 202 is executed.

[0071] Step 202: According to the saved words and word information entropy values ​​corresponding to each word, select at least one word whose word information entropy value is smaller than a set threshold value among the words obtained after word segmentation of the search request.

[0072] In this step, the previously received search requests can be grouped according to the category they belong to according to the method in Embodim...

Embodiment 3

[0081] The schemes of Embodiment 1 and Embodiment 2 of the present application will be described in detail below in conjunction with specific examples.

[0082] The specific implementation process of the third embodiment is as follows:

[0083] Step 1: Determine the category to which the input search request 1, search request 2... search request n belongs.

[0084] Assuming that the number of input search requests is 2, which are "new mobile phone" and "new dress", it is determined that the category of "new mobile phone" is "mobile phone", and the category of "new dress" is "skirt".

[0085] Step 2: Group search request 1 to search request n by category.

[0086] Classify "new phone" into group 1 and "new dress" into group 2.

[0087] Step 3: Determine the frequency information of search requests in each group, that is, determine D={, ...}, where: D represents a group, Q1 represents a search request, and QC1 represents The number of search requests identical to Q1 in group D....

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

Determining and using word information entropies includes: determining one or more categories that correspond to a plurality of queries; sorting the plurality of queries into one or more groups based at least in part on the determined categories of the plurality of queries; segmenting queries that correspond to each of the one or more groups into a first plurality of phrases, wherein each phrase includes one or more words; determining occurrence probabilities for the plurality of phrases; and determining word information entropies for the plurality of phrases based at least in part on the determined occurrence probabilities.

Description

technical field [0001] The present application relates to the field of computer networks, in particular to a method and device for determining word information entropy, and a method and device for searching using the determined word information entropy. Background technique [0002] Search request (Query) is a unique short text in the search engine scenario. Users describe the information they want to retrieve through the search request, and the search engine retrieves the database through the information described in the search request and returns the results that the user wants. A search request initiated by a user consists of an average of 2.4 words (for example: silk dress, candy bar mobile phone). Generally, users use natural text as a search request instead of using statements such as and, or, and not, so search engines based on When retrieving the received search request, it is necessary to determine the user's intention to search according to the amount of informatio...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/30G06F17/27
CPCG06F17/30684G06F16/319G06F16/355G06F16/353G06F16/3331G06F16/951G06F16/35G06F16/3325G06F16/3344G06F16/90335
Inventor 金凯民
Owner ALIBABA GRP HLDG LTD