Word meaning extracting method based on search interactive information and user search intention

A technology of interactive information and search intent, applied in the field of word meaning extraction based on search interaction information and user search intent, can solve the problems of limiting the text field, dividing the number of word meanings, and high manual labeling costs, so as to avoid poor results and avoid costs. effect of the problem

Active Publication Date: 2012-02-01
北京牡丹电子集团有限责任公司数字科技中心
View PDF2 Cites 14 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Among the various methods of word sense disambiguation mentioned above, the human and material costs of manual labeling are very high, and the current word sense disambiguation does not have a method to divide the number of word senses based on user clicks
Faced with the current situation, many companies are also starting to provide personalized search services, but many are still stuck in theories with weak concepts and operability
[0006] There are various word meaning extraction methods in the prior art, but most of the existing methods are to analyze static texts, or manually mark static texts for processing. The former usually needs to limit the text domain, and the general domain The effect is not good; the cost of manual labeling in the latter is very high

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Word meaning extracting method based on search interactive information and user search intention
  • Word meaning extracting method based on search interactive information and user search intention
  • Word meaning extracting method based on search interactive information and user search intention

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0036] In order to avoid the problem of data sparseness, the method of the present invention only uses high-frequency query keywords when processing query keywords, that is, only query keywords with the top K1 ranking in the query ranking frequency of all users for processing; similarly, different users The use frequency of each query keyword is high or low. To avoid accidental factors, only the top K2 query keywords of each user's own query frequency are selected for processing.

[0037] figure 1 Shown is a flowchart of the method of the present invention. The steps of the method of the present invention are as follows:

[0038] Step 1: Record the historical interaction information of each user. The historical interaction information includes query keywords, query time, and corresponding clicks. The query keywords are used to construct the query keyword vector, and the corresponding clicks are used to construct the corresponding click vector. , The query time is used to lock the c...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a word meaning extracting method based on search interactive information and user search intention. The method comprises the following steps in order: recording historic interactive information of each user; using the query keywords in the first K1 rank of the user query frequency to construct a query keyword vector; using corresponding click results obtained after querying the query keywords in the first K2 ranks of the query frequency of each user to construct a corresponding click vector, and setting items corresponding to the query keywords in the first K2 ranks ofthe query frequency of each user in the query keyword vector as 1, and setting other items as zero; extracting a meaning item number of anyone of high-frequency query keywords; clustering the users; and computing the preference rank of the meaning item corresponding to each query keyword of users in the same class. The method can avoid the cost problem caused by manual marking; meanwhile, the problems that the derivative free method has poor effect and is limited by the field. A personalized search service can be provided for a single user according to the analysis result obtained by the method provided by the invention.

Description

[0001] technical field [0002] The invention belongs to the technical field of information retrieval and word meaning disambiguation, and in particular relates to a word meaning extraction method based on search interaction information and user search intention. Background technique [0003] In recent years, the research and application of information retrieval and word sense disambiguation technology are very common, but the research and application of the combination of information retrieval and word sense disambiguation technology is less. [0004] Since the establishment of Google in 1998, information retrieval has gradually become a mainstream technology. Initially, information retrieval provided manual-edited directory retrieval, and the typical company was Yahoo. However, with the explosion of Internet information, manual editing can no longer meet the needs of users. Since then, automatic processing by machines has increasingly become the mainstream. Nowadays, all ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30G06F17/27
Inventor 姬东鸿孙程吕晨滕冲
Owner 北京牡丹电子集团有限责任公司数字科技中心
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products