Automatic classification system and automatic classification method for Web comment viewpoint on the basis of association rule

A technology of automatic classification and rules, applied in the field of semantic processing

Active Publication Date: 2013-12-25
珠海市颢腾智胜科技有限公司
View PDF4 Cites 19 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The reason is that opinion texts on the Web involve the expression of human emotions, and are text content with a very special theme, and their semantic obscurity is higher than that of objective descriptive texts. There are positive words that express irony, and the opposite also exists. These special patterns are difficult to judge by statistical learning methods

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Automatic classification system and automatic classification method for Web comment viewpoint on the basis of association rule
  • Automatic classification system and automatic classification method for Web comment viewpoint on the basis of association rule
  • Automatic classification system and automatic classification method for Web comment viewpoint on the basis of association rule

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0031] In order to better understand the present invention, some basic concepts are firstly explained.

[0032] Confidence: Confidence reveals whether or not item B will appear when item A appears, or how likely it is to appear. If the degree of confidence is 100%, then A and B can be bundled for sale. If the confidence is too low, it means that the presence of A has little to do with the presence or absence of B.

[0033] Support: Support reveals the probability that item A and item B will appear at the same time. If the probability of A and B appearing at the same time is small, it means that A and B have little relationship; if A and B appear very frequently at the same time, it means that A and B are always related.

[0034]AD-Sup: AD-Sup can be regarded as a mean square error of support. In the above formula, Sup(t) i is the support number of entry t in category i, Sup(FS i ) j Refers to FS i The number of local supports in category j, while Ave(Sup(FS i )) is Sup(...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an automatic classification system and an automatic classification method for a Web comment viewpoint on the basis of an association rule. The automatic classification system can be divided into four modules, i.e. a frequent feature word extraction module, a frequent feature word optimization module, an association rule extracting and mining module and a viewpoint classification module. According to the invention, the defects that the traditional system (such as systems based on machine learning and sentiment classification) has low precision or needs mass manpower participation or excessively depends on natural language processing and professional knowledge can be overcome. Optimization is carried out in the association rule set extraction process, redundant association rules with low distinction degree are removed, and therefore, the operation efficiency and the result acquisition efficiency of the whole system can be improved. According to the invention, a precise and convenient solution is provided for various situations, such as E-business commodity evaluation, E-government affair feedback and netizen opinion survey.

Description

technical field [0001] The invention relates to an automatic classification system and classification method of Web comments based on association rules, and belongs to the technical field of semantic processing. Background technique [0002] Traditional text opinion classification methods include opinion classification based on machine learning and opinion classification based on sentiment analysis. [0003] Machine learning-based methods use machine learning algorithms in text classification directly for opinion classification, and the accuracy of opinion classification tasks is usually lower than that of text classification tasks for other categories of topics. The reason is that opinion texts on the Web involve the expression of human emotions, and are text content with a very special theme, and their semantic obscurity is higher than that of objective descriptive texts. There are positive words that express irony, and the opposite also exists. These special patterns are...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
Inventor 袁满欧阳元新皇甫垚熊璋
Owner 珠海市颢腾智胜科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products