Emotional data classification method and system

A data classification and emotion technology, applied in the field of emotion data classification methods and systems, can solve the problems of not getting results, the accuracy of emotion classification needs to be improved, and it is difficult to define, etc., to improve classification accuracy, accurate emotional tendencies, and document - Emotional matrix precise effects

Active Publication Date: 2017-07-04
INST OF AUTOMATION CHINESE ACAD OF SCI +1
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, it is difficult to define a universally optimal emotional vocabulary to cover all words from different domains
Furthermore, most semi-automatic dictionary-based methods do not yield satisfactory results
The traditional and more advanced dictionary-based method is based on the constrained non-negative matrix tri-factorization (ConstrainedNon-negative Matrix Tri-factorization, referred to as CNMTF) sentiment classification method, which uses domain-independent emotional vocabulary as prior knowledge for emotional classification. However, the experiments show that the sentiment classification accuracy of CNMTF-based sentiment classification method still needs to be improved

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Emotional data classification method and system
  • Emotional data classification method and system
  • Emotional data classification method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0023] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0024] The emotion data classification method provided by the embodiment of the present invention can determine the corresponding emotion tendency of the documents in the test data set. The test data set may be a collection of emotional data generated by users on the Internet, for example, comment data, blog data, etc. existing on the Internet. Sentiment data classification methods can determine the corresponding emotional tendency of documents such as comments, such as determining whether they are positive or negative. Specifically, the data in the training data set is first trained. The training...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The present invention provides a method and system for classifying emotional data, the method comprising: constructing a document-document graph and a word-word graph corresponding to a training data set, in the document-document graph, a node represents a word in the training data set document, the geometric information of the edge represents the correlation between the documents, and in the word-word graph, the node represents the word in the training data set, and the geometric information of the edge represents the correlation between the words; according to the document-document The geometric information of the graph and word-word graph constructs a graph-based regularization item in the objective function; optimizes the objective function and outputs a document-sentiment matrix; obtains documents in the test data set, according to the document-sentiment matrix The sentiment tendency corresponding to the documents in the test data set is obtained. By adopting the method and system, the accuracy of emotion classification can be improved.

Description

technical field [0001] The invention relates to natural language processing technology, in particular to an emotion data classification method and system. Background technique [0002] With the development of Web 2.0, more and more users generate emotional data in web pages, which usually exist in the form of comments and blog data in the network. Sentiment classification refers to automatically predicting the sentiment orientation of sentiment data generated by users, for example, predicting whether a comment is positive or negative. [0003] Recently, sentiment classification has gained widespread attention in natural language processing, and sentiment classification methods can be divided into supervised sentiment analysis and unsupervised sentiment analysis. Supervised sentiment analysis relies on human-annotated training data, and in some cases, the labeling work is time-consuming and expensive, which motivates unsupervised or semi-supervised sentiment analysis. [00...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/30
CPCG06F16/951G06F18/24
Inventor 周光有王巨宏蒋杰薛伟管刚赵军
Owner INST OF AUTOMATION CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products