Multi-language emotional data processing and classifying method and system based on key sentences

A data processing and data classification technology, applied in the direction of electrical digital data processing, special data processing applications, text database clustering/classification, etc., to achieve the effect of less resource dependence, performance improvement, and high performance

Inactive Publication Date: 2014-08-20
INST OF COMPUTING TECH CHINESE ACAD OF SCI
View PDF3 Cites 16 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0022] In order to solve the above problems, the object of the present invention is to propose a language-independent multilingual emotion data c...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Multi-language emotional data processing and classifying method and system based on key sentences
  • Multi-language emotional data processing and classifying method and system based on key sentences
  • Multi-language emotional data processing and classifying method and system based on key sentences

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0058] A method for processing and classifying multilingual emotion data based on key sentences of the present invention, comprising:

[0059] Step 1. Automatically extract an emotion dictionary data package (two-tuple data such as "good positive class" and "poor negative class") from the unlabeled emotional corpus database. The polarity (positive or negative) of emotional words is determined by the K-nearest neighbor algorithm and voting rules. In the voting rules, the present invention also introduces a suspension mechanism to prevent overcorrection of polarity determination;

[0060] Step 2, use the extracted emotional dictionary data package to calculate the score of the emotional attribute, and then comprehensively consider the position attribute and keyword attribute, automatically extract several emotional key sentences for each text as the representative of each text;

[0061] Step 3, apply the extracted emotional key sentences directly to supervised and unsupervised ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a multi-language emotional data processing and classifying method and system based on key sentences. The method includes the steps that first, an emotional dictionary data packet is automatically extracted from an unlabelled emotional data set, and the polarity of emotional words is finally judged through a K nearest neighbor algorithm and a voting rule; second, the extracted emotional dictionary data packet is used for calculating the score of the emotion attribute, then, the position attribute and the key word attribute are comprehensively considered, and a plurality of emotional key sentences are extracted for each text; third, the extracted emotional key sentences are directly applied to supervised emotional data classification and unsupervised emotional data classification. Therefore, the double-difficulty problem caused by language migration and emotional data analysis in the multi-language translation process can be solved, and emotional data analysis accuracy can be improved.

Description

technical field [0001] The invention relates to text emotion data analysis, in particular to a method and system for processing and classifying multilingual emotion data based on key sentences. Background technique [0002] With the continuous emergence of online communication platforms such as forums, blogs, reviews, and Weibo, people are becoming more and more accustomed to posting subjective comments online. These comments are used to express people's views and opinions on daily events, products, policies, etc. At the same time, with the acceleration of the globalization process, the information resources provided by the network are multilingual. Sentiment classification is a classification task that divides texts into positive and negative according to the emotional polarity expressed; multilingual sentiment classification refers to using the source language to classify other languages. Multilingual sentiment classification aims to study the opinions, opinions and attit...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30
CPCG06F16/35
Inventor 程学旗林政张瑾谭松波徐学可
Owner INST OF COMPUTING TECH CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products