Entity word mining method, information recommendation method and device

A technology of entity words and corresponding entities, which is applied in special data processing applications, instruments, electrical digital data processing, etc.

Active Publication Date: 2017-01-04
BEIJING SOGOU TECHNOLOGY DEVELOPMENT CO LTD
View PDF7 Cites 9 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0011] The present invention provides an entity word mining method, information recommendation method and device to sol

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Entity word mining method, information recommendation method and device
  • Entity word mining method, information recommendation method and device
  • Entity word mining method, information recommendation method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0089] The present invention provides an entity word mining method, information recommendation method and device to solve the technical problem in the prior art that user interest features can only be obtained through manual marking.

[0090] The technical solution in the embodiment of the present application is to solve the above-mentioned technical problems, and the general idea is as follows:

[0091] First, M feature words are obtained from the feature word corpus, and M is a positive integer; then the scarcity of each feature word in the M feature words, the distribution between categories of each feature word, and the class of each feature word are calculated. The number of occurrences within; finally, based on the degree of scarcity, the distribution between categories, and the number of occurrences within a category, N1 of the M feature words are determined as entity words, and N1 is a positive integer. That is to say, the scheme realizes the automatic mining mechanism...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to the field of data mining, and discloses an entity word mining method, an information recommendation method and device in order to solve the technical problem existing in the prior art that user interest characteristics can be obtained only through a manual marking manner. The entity word mining method comprises that a number M of characteristic words are obtained from a characteristic word corpus, and M is a positive integer; the scarcity degree, the between-class distribution and the within-class frequency of occurrence of each characteristic word in the number M of characteristic words are calculated; based on the scarcity degree, the between-class distribution and the within-class frequency of occurrence, a number N1 of characteristic words in the number M of characteristic words are determined to be entity words, and N1 is a positive integer. The user interest characteristics can be determined with no need of the manual marking manner.

Description

technical field [0001] The invention relates to the field of data mining, in particular to a method for mining entity words, an information recommendation method and a device. Background technique [0002] In the past ten years, the development of personalization has been in full swing, and the reason is very simple-the irreconcilable contradiction between the explosive growth of information on the Internet and the limited information needs of people is becoming more and more intense. Then personalized recommendation came into being and was applied to various fields: shopping, news reading and even various application apps (Application: application program) and so on. Among them, the personalized recommendation refers to that the computer recommends to the user the information that the user most wants to see at this moment through various technical means. [0003] In the prior art, in order to determine the user's interest characteristics, a tag library is often established...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30
CPCG06F16/9535
Inventor 商胜
Owner BEIJING SOGOU TECHNOLOGY DEVELOPMENT CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products