Method and equipment for determining word information entropy and searching by using word information entropy
A technology of information entropy and information entropy value, which is applied in the field of computer networks, can solve the problem of deviation in the determination of word information entropy, and achieve the effect of improving accuracy and accurate word information entropy value
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0040] Embodiment 1 of the present application provides a method for determining word information entropy, the schematic diagram of which is as follows figure 1 shown, including the following steps:
[0041] Step 101: Receive multiple search requests input by the user, and determine the category to which each search request belongs.
[0042] In this embodiment, the search request input by the user is a short text containing only about 2-3 words on average.
[0043] This embodiment does not limit the solution for determining the category of the search request. Two available solutions are given below:
[0044] The first solution: use user behavior data to automatically mine the category of the search request.
[0045] In the web log (web log), the direct click behavior from the search request to the category is often disturbed by the page layout, and the data is relatively sparse. Therefore, an indirect method is needed to obtain the category to which the search request belon...
Embodiment 2
[0081] Embodiment 2 of the present application is based on Embodiment 1, using the determined word information entropy value of each word to search, such as figure 2 shown, including the following steps:
[0082] Step 201: According to a search request input by a user, determine whether there is a search result matching the search request.
[0083] In this step, if there is a search result that directly matches a search request input by the user, the corresponding search result is directly returned to the user; otherwise, step 202 is executed.
[0084] Step 202: According to the saved words and word information entropy values corresponding to each word, select at least one word whose word information entropy value is smaller than a set threshold value among the words obtained after word segmentation of the search request.
[0085] In this step, the previously received search requests can be grouped according to the category they belong to according to the method in Embodim...
Embodiment 3
[0094] The schemes of Embodiment 1 and Embodiment 2 of the present application will be described in detail below in conjunction with specific examples.
[0095] The specific implementation process of the third embodiment is as follows:
[0096] Step 1: Determine the category to which the input search request_1, search request_2...search request_n belongs.
[0097] Assuming that the number of input search requests is 2, which are "new mobile phone" and "new dress", it is determined that the category of "new mobile phone" is "mobile phone", and the category of "new dress" is "skirt".
[0098] Step 2: Group search request 1 to search request n by category.
[0099] Classify "new phone" into group 1 and "new dress" into group 2.
[0100] Step 3: Determine the frequency information of the search request in each group, that is, determine D={, ...}, where: D represents a group, and Q1 represents a search request, QC1 indicates the number of search requests identical to Q1 in group...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 