Label extracting method and device

A technology of label extraction and labeling, which is applied in the field of information processing, can solve the problem of short text sparsity not being good enough, and achieve the effects of avoiding dependence and improving accuracy

Inactive Publication Date: 2016-05-25
TCL CORPORATION
View PDF5 Cites 19 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] In view of this, the embodiment of the present invention provides a tag extraction method and device, which solves the problem that the existing tag extraction algorithm is not good enough to solve the problem of short text sparsity, improves the accuracy of calculating the similarity of commodity evaluation and the accuracy of Degree of Product Review Mining

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Label extracting method and device
  • Label extracting method and device
  • Label extracting method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0027] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0028] In the embodiment of the present invention, by obtaining multiple evaluation information of commodities, the candidate tags in each evaluation information are extracted according to the preset tag grammar rules; each candidate tag is subject-analyzed through the latent Dirichlet allocation model LDA, and each candidate tag is obtained. The topic probability distribution corresponding to the candidate label, the topic probability distribution includes the probability that the candidate label belongs to each specified topic; then determine the candidate label set corresponding to each specified...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention belongs to the technical field of information processing, and provides a label extraction method and device. The label extraction method comprises the following steps: obtaining a plurality of pieces of evaluation information of a commodity; extracting candidate labels in each piece of evaluation information according to a preset label syntax rule; carrying out subject analysis on each candidate label through a potential Dirichlet distribution model LDA to obtain the subject probability distribution corresponding to each candidate label, wherein the subject probability distribution comprises the probability of the candidate label belonging to each appointed subject; and determining a candidate label set corresponding to each appointed subject according to the subject probability distribution, and determining a representative label corresponding to the appointed subject according to the weighted value of each candidate label in the candidate label set. According to the method and device, the problem that the existing label extraction algorithm cannot sufficiently solve the short text sparsity problem is solved, and the correctness of the commodity evaluation similarity and the commodity evaluation mining degree are improved.

Description

technical field [0001] The invention belongs to the technical field of information processing, and in particular relates to a label extraction method and device. Background technique [0002] The online shopping mall provides products ranging from small daily necessities to large and expensive home appliances, which greatly saves consumers' shopping time. When shopping online, consumers mainly obtain the overall quality of the product and its usage information through product evaluation. When there are more and more product reviews, consumers will spend more time and energy on browsing product reviews. Therefore, it is necessary to mine product reviews. [0003] However, consumers' comments on commodities are generally short and concise, and labeling these comments belongs to the category of short text mining. Existing label extraction algorithms, such as algorithms based on TF*IDF, information gain, chi-square selection, etc., all have the following deficiencies: [0004...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30G06F17/27
CPCG06F16/951G06F40/30
Inventor 吴成龙
Owner TCL CORPORATION
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products