Method, device and server for obtaining similarity of key words

An acquisition method and keyword technology, applied in the information field, can solve the problems of uneven proportion, low accuracy and success rate of recommended information, inability to accurately describe connections, etc., and achieve the effect of improving accuracy and success rate

Active Publication Date: 2014-10-08
SHENZHEN TENCENT COMP SYST CO LTD
View PDF3 Cites 40 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] Since the similarity between any two keywords depends entirely on their document frequency in the corpus, and the proportion of the number of documents in which keywords appear in the corpus to the number of all documents in the corpus is extremely uneven, for example, two keywords may be very seldom appear in a document, but the frequency of two keywords appearing in the document is very high, so that the obtained similarity cannot accurately describe the connection between the two keywords, so that information is recommended to users in the follow-up low accuracy and success

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method, device and server for obtaining similarity of key words
  • Method, device and server for obtaining similarity of key words
  • Method, device and server for obtaining similarity of key words

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0030] In order to make the object, technical solution and advantages of the present invention clearer, the implementation manner of the present invention will be further described in detail below in conjunction with the accompanying drawings.

[0031] figure 1 It is a flowchart of a method for acquiring keyword similarity provided by an embodiment of the present invention. see figure 1 , the execution subject of this embodiment is a server, and the method includes:

[0032] 101. Obtain user tag keywords and interest category keywords.

[0033] 102. Search the preset database according to the user tag keywords and interest category keywords, and obtain the word vector corresponding to each keyword in the user tag keywords and the word corresponding to each keyword in the interest category keywords vector, the preset database stores the correspondence between keywords and word vectors, and the word vectors are determined by the keywords and the keywords in the keyword contex...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method, device and server for obtaining the similarity of key words, and belongs to the field of information technology. The method comprises the steps: obtaining key words of user labels and key words of interested classes; according to the key words of the user labels and the key words of the interested classes, looking for a preset database to obtain the word vector of each key word in the key words of the user labels and the word vector of each key word in the key words of the interested classes; computing a distance between the word vector of each key word in the key words of the user labels and the word vector of each key word in the key words of the interested classes according to the word vector of each key word in the key words of the user labels and the word vector of each key word in the key words of the interested classes; obtaining the distance between the word vector of a first key word and the word vector of a second key word to be used as the similarity of the first key word and the second key word. According to the invention, the word vectors are used for obtaining the similarity of the key words, so that the precision rate of recommended information is increased.

Description

technical field [0001] The present invention relates to the field of information technology, in particular to a method, device and server for acquiring keyword similarity. Background technique [0002] With the continuous development of information technology, how to recommend information to users who are interested in the information is an urgent problem to be solved. When recommending information to users, it is generally necessary to obtain the similarity between the keywords in the user's label and the keywords in the user's interest category, so that based on the similarity, the interest value of the keyword in the interest category is obtained, according to The size of the interest value recommends information to the user. [0003] In the process of obtaining similarity, you can use the full text of Soso Encyclopedia and Q&A content as a corpus, use each entry in the full text of Encyclopedia or the content of Q&A in Q&A as a document, and count the keywords in user t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30G06F17/27
CPCG06F16/2462
Inventor 汤煌
Owner SHENZHEN TENCENT COMP SYST CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products