Method and system for carrying out sentiment classification on Chinese comment text on basis of ensemble learning

An integrated learning and emotion classification technology, applied in the field of emotion classification methods and systems based on this method, can solve the problem of not quantitatively considering the uncertainty of the classifier output fuzzy or probability, and ignoring the degree to which the classifier output does not belong to the category , The classifier cannot capture the role of emerging vocabulary, etc., to achieve good recognition rate and stability, fast training speed, and reduce the requirements for initial preparation

Active Publication Date: 2014-08-06
钱钢
View PDF3 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

This largely limits the scope of application of the technology
[0006] (2) Although ensemble learning can improve the instability of a single classifier, the traditional ensemble learning method only simply considers the support degree of the classifier to the category of the sample, ignoring the output of the classifier and the fact that the sample is not stable. The degree of belonging to a class, without quantitatively taking into account the fuzzy or probabilistic uncertainty of the classifier output
However, corpus preparation is difficult
More importantly, the one-time trained classifier cannot capture the role of some emerging vocabulary in expressing emotion

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and system for carrying out sentiment classification on Chinese comment text on basis of ensemble learning
  • Method and system for carrying out sentiment classification on Chinese comment text on basis of ensemble learning
  • Method and system for carrying out sentiment classification on Chinese comment text on basis of ensemble learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0030] The present invention will be described in detail below in conjunction with the accompanying drawings and specific embodiments.

[0031] The sentiment classification method of the Chinese review text based on ensemble learning of the present invention is as figure 1 shown, including the following steps:

[0032] Step 101: Obtain Chinese comment text from the network and perform preprocessing;

[0033] Step 102: sequentially train the multi-classifier system in parallel;

[0034] Step 103: classify the Chinese comment text to be classified with the base classifier, and convert the classification output into an intuitionistic fuzzy number;

[0035] Step 104: Combining the weights of the base classifier and the guiding variables, fusing the emotional tendency of the Chinese comment text to be classified, and making a classification decision.

[0036] Each detail problem in the present invention is described in further detail below.

[0037] Sentiment classification of ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to the field of the mode identification, and discloses a method for carrying out sentiment classification on a Chinese comment text on the basis of the ensemble learning and a system on the basis of the method. The method comprises the following steps of: a, acquiring the Chinese comment text from a webpage and carrying out preprocessing on the Chinese comment text; b, sequentially training a multi-classifier system in parallel; c, classifying the comment text to be classified by a base classifier and converting a classification output into an intuitionistic fuzzy number; and d, combining a weight and a guide variable of the base classifier, fusing the sentiment tendency of the comment text to be classified and making a classification decision. The method and the system have the following advantages that the training and classifying speed is ultrahigh; a sequence learning strategy is adopted, so that newly developed vocabularies can be found conveniently and the requirement on a corpus can be lowered; and the classification accuracy is improved by the ensemble learning, and thus, the system on the basis of the method can support the management or purchase decision more satisfactorily.

Description

technical field [0001] The invention is directed at the research on the sentiment classification method of comment text, relates to the field of pattern recognition, and in particular relates to a sentiment classification method of Chinese comment text based on integrated learning and a system based on the method. Background technique [0002] The popularity of the Internet and the emergence of various new network media not only bring massive information to people, but also provide people with various stages to express their emotions, such as online comment platforms such as BLOG, BBS, and news comments. Therefore, how to scientifically and efficiently manage these online comments on the network containing personal emotions is particularly important for the safety of individuals, enterprises, and society. However, these review texts are significantly different from ordinary texts: first, the review text has no fixed grammatical structure, is short in length, and even new wor...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/30
Inventor 钱钢王海沈玲玲乔爱萍
Owner 钱钢
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products