Unlock instant, AI-driven research and patent intelligence for your innovation.

Short text sentiment fine classification method based on hybrid classifier

A hybrid classifier and classification method technology, which is applied in the field of short text sentiment classification based on hybrid classifiers, can solve problems such as wrong labeling of samples, low classification accuracy, and the impact of classifier training results, and reduce the introduction of noise , the effect of improving quality

Pending Publication Date: 2020-06-05
中国星网网络应用有限公司
View PDF9 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] In practical applications, although semi-supervised learning can reduce the demand for the number of labeled training sets, due to the small number of labeled text training sets, it is impossible to train a single classifier with a high accuracy rate, which leads to the classification of classifiers. The classification accuracy of unlabeled samples is low, and the samples are marked with wrong emotional labels. These wrong labels will be added to the training set, which will cause noise pollution to the training set and affect the training results of the classifier.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Short text sentiment fine classification method based on hybrid classifier
  • Short text sentiment fine classification method based on hybrid classifier

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0030] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some of the embodiments of the present invention, not all of them.

[0031] refer to figure 1 , a short text sentiment segmentation method based on a hybrid classifier, which uses a long-short memory network classifier, a support vector machine classifier and a dictionary-based classification method to form a hybrid classifier, so as to use a small number of training samples to train the hybrid classifier, And through continuous iterative loops, find the classifier with the best classification effect for classification, including the following steps:

[0032] S1: preprocessing the text;

[0033] S2: Train the long-short memory network classifier and the support vector machine classifier respectively on the labeled sampl...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a short text sentiment fine classification method based on a hybrid classifier. A support vector machine classifier, a long and short memory network classifier and a dictionary-based classification method are combined to form a hybrid classifier. Each classifier is trained by using a small number of training samples; then the emotion categories of unmarked samples are predicted by using the classifiers, the confidence coefficients of texts are calculated by using a weighting formula, the texts with higher confidence coefficients are added into a training set, the classifiers are trained by using a new training set, and iterating is performed to form the classifier with the best effect for classification. The method has the advantages of being good in classificationeffect and small in marking training set requirement.

Description

technical field [0001] The invention relates to the field of natural language processing, in particular to a short text sentiment subdivision method based on a hybrid classifier. Background technique [0002] With the development of Internet technology, more and more information platforms have emerged, covering all aspects of daily life, such as social information, hot news, shopping, entertainment, etc. People publish their own information on various platforms according to their hobbies. However, there are also great differences in the attitudes and opinions expressed by each person towards various phenomena in life. This difference contains great commercial and social value. Therefore, sentiment analysis of this kind of text has very important significance and application value. [0003] In recent years, as the academic community has paid more and more attention to text sentiment analysis, research institutions at home and abroad have continuously invested manpower and ma...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/35G06N3/04G06N3/08
CPCG06F16/35G06N3/049G06N3/08G06N3/045
Inventor 卢莉
Owner 中国星网网络应用有限公司