Method and equipment for searching by using word information entropy
A technology of search equipment and search method, applied in the field of computer network, to achieve the effect of accurate word information entropy value and improved accuracy
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0027] Embodiment 1 of the present application provides a method for determining word information entropy, the schematic diagram of which is as follows figure 1 shown, including the following steps:
[0028] Step 101: Receive multiple search requests input by the user, and determine the category to which each search request belongs.
[0029] In this embodiment, the search request input by the user is a short text containing only about 2-3 words on average.
[0030] This embodiment does not limit the solution for determining the category of the search request. Two available solutions are given below:
[0031] The first solution: use user behavior data to automatically mine the category of the search request.
[0032] In the web log (web log), the direct click behavior from the search request to the category is often disturbed by the page layout, and the data is relatively sparse. Therefore, an indirect method is needed to obtain the category to which the search request belon...
Embodiment 2
[0068] Embodiment 2 of the present application is based on Embodiment 1, using the determined word information entropy value of each word to search, such as figure 2 shown, including the following steps:
[0069] Step 201: According to a search request input by a user, determine whether there is a search result matching the search request.
[0070] In this step, if there is a search result that directly matches a search request input by the user, the corresponding search result is directly returned to the user; otherwise, step 202 is executed.
[0071] Step 202: According to the saved words and word information entropy values corresponding to each word, select at least one word whose word information entropy value is smaller than a set threshold value among the words obtained after word segmentation of the search request.
[0072] In this step, the previously received search requests can be grouped according to the category they belong to according to the method in Embodim...
Embodiment 3
[0081] The schemes of Embodiment 1 and Embodiment 2 of the present application will be described in detail below in conjunction with specific examples.
[0082] The specific implementation process of the third embodiment is as follows:
[0083] Step 1: Determine the category to which the input search request 1, search request 2... search request n belongs.
[0084] Assuming that the number of input search requests is 2, which are "new mobile phone" and "new dress", it is determined that the category of "new mobile phone" is "mobile phone", and the category of "new dress" is "skirt".
[0085] Step 2: Group search request 1 to search request n by category.
[0086] Classify "new phone" into group 1 and "new dress" into group 2.
[0087] Step 3: Determine the frequency information of search requests in each group, that is, determine D={, ...}, where: D represents a group, Q1 represents a search request, and QC1 represents The number of search requests identical to Q1 in group D....
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 