Emotion text classification method

A text classification and emotion technology, applied in special data processing applications, instruments, electrical digital data processing, etc., can solve problems such as not considering user emotions

Active Publication Date: 2017-09-29
TIANJIN UNIV
View PDF5 Cites 7 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] At present, the main relatively accurate text classification algorithm is Data Expansion Sentiment Analysis (DESA), although this algorithm can improve the accuracy of sentiment analysis technology to a certain extent, but this method only analyzes the text in the extended corpus. events and opinions, without taking user sentiment into account

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Emotion text classification method
  • Emotion text classification method
  • Emotion text classification method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0032] The present invention proposes a text classification algorithm based on emotion, comprising the following steps:

[0033] Step 1: Obtain a Chinese emotion dictionary. Use English tense words and relaxed words to mark the dictionary, convert them into Chinese through tools such as Baidu translation, and manually add some related words to the Chinese dictionary.

[0034] Step 2: Strength detection of tense words and relaxed words. According to the Chinese emotional dictionary, detect whether the original corpus text contains tense words and relaxed words, extract the largest stress value Stress and the largest relaxation value Relation in the original corpus text content, and make them processed as part of the feature set.

[0035] Step 3: Text sentiment classification. The support vector machine algorithm (Support Vector Machine, SVM) is used to classify the new feature vectors to obtain the emotional tendency value of the original corpus.

[0036] Step 4: Integrate c...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention belongs to the field of data mining, natural language processing and information retrieval, and provides an emotion text classification algorithm (Emotion Text Classification, ETC). According to the emotion text classification algorithm, emotion factors and a data expansion sentiment analysis algorithm are combined, the final classification emotion polarity value of an original corpus is obtained through an integrated model. According to the technical scheme, the emotion text classification method includes the steps that 1, a Chinese emotion dictionary is acquired; 2, the strength of stress words and relax words is detected; 3, text emotion classification is conducted; 4, integrated classification prediction is conducted. The method is mainly applied to the field of data mining, natural language processing and information retrieval.

Description

technical field [0001] The invention belongs to the fields of data mining, natural language processing and information retrieval, and relates to short text sentiment analysis technology, especially a text classification method based on sentiment analysis. Background technique [0002] Foreign researchers have contributed many authoritative data sets in the field of sentiment analysis, which have been widely used in various conferences and competitions; however, in the field of Chinese text sentiment analysis, labeled data that fully meet the research needs and are sufficiently authoritative At the same time, corpus expansion can remove part of the noise, alleviate the feature sparsity problem to a certain extent, increase the semantic correlation space of text content, and form texts with similar semantics and different words, which can effectively improve the performance of sentiment analysis technology. experimental effect. [0003] At present, the main relatively accurat...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/355
Inventor 侯庆志王正凯喻梅王建荣徐天一成基元
Owner TIANJIN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products