SVM based micro-blog emotion classification method fusing various kinds of emotion resources

A technology of emotion classification and microblogging, which is applied in text database clustering/classification, unstructured text data retrieval, instruments, etc., can solve problems such as inability to accurately and comprehensively obtain netizens' emotional tendencies, and achieve the effect of improving accuracy

Pending Publication Date: 2017-03-15
NANJING UNIV OF SCI & TECH
View PDF3 Cites 42 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Therefore, dividing emotions into three types is too absolute and cannot accurately and comprehensively capture the emotional tendencies of netizens

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • SVM based micro-blog emotion classification method fusing various kinds of emotion resources
  • SVM based micro-blog emotion classification method fusing various kinds of emotion resources
  • SVM based micro-blog emotion classification method fusing various kinds of emotion resources

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0065] combine figure 1 , the present invention is based on the microblog emotion 5-level classification method that SVM merges multiple emotion resources, comprises the following steps:

[0066] The first step is to build related dictionaries, improve the sentiment dictionary and degree adverb dictionary, and assign weights to all words in the degree adverbs. Partial degree adverbs and their weights are shown in Table 1.

[0067] The second step is to preprocess the corpus, and perform sentence segmentation, format processing, word segmentation and part-of-speech tagging on different corpus in advance;

[0068] (1) Since the microblog corpus contains useless information such as #话题#, URL, and @user, this information does not contain the user's point of view, and may also affect the effect of word segmentation and part-of-speech tagging in the next step. Therefore, before word segmentation, first filter out useless information such as #话话#, URL and @user in Weibo, and then p...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an SVM based micro-blog emotion classification method fusing various kinds of emotion resources. The method includes the following steps: constructing relevant dictionaries including an emotion dictionary, a negation dictionary, and a degree adverb dictionary; performing pretreatment on different corpora, performing word segmentation and part-of-speech tagging on the corpora, and performing sentence structure analysis; comparing the segmented words and positive and negative dictionaries to acquire initial word polarity, comparing words ahead of emotion words and the word degree grade dictionary and the negation dictionary to acquire modifier weight, and multiplying the initial word polarity by the modifier weight to acquire emotion scores of each micro-blog; extracting features such as nouns, verbs, adjectives, positive and negative emotion words, degree adverb weights, emotion scores, privatives and specific symbols from part-of-speech features, emotion features, sentence pattern features, and semantic features; and inputting the extracted features into an Libsvm to perform model training so as to acquire a training model. The method can achieve emotion 5-grade classification of micro-blogs, and can accurately and roundly acquire emotion tendency of netizens.

Description

technical field [0001] The invention belongs to the technical field of natural language processing, in particular to a microblog emotion classification method based on SVM and integrating multiple emotion resources. Background technique [0002] There are extremely rich subjective emotional information of netizens hidden in Weibo. By classifying microblogs and obtaining the emotional tendencies of the majority of netizens, we can quickly and accurately understand the demands of the majority of netizens and provide a reliable basis for network public opinion analysis. At present, many scholars have studied the sentiment classification of Weibo, mainly using the method based on the sentiment dictionary and the method of machine learning to divide the sentiment into three categories: positive, negative or positive, neutral and negative. The method based on the sentiment dictionary is to construct a sentiment dictionary, then calculate the sentiment tendency value through a spe...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30G06F17/27
CPCG06F16/35G06F40/284
Inventor 陈芬杨爽何源陈佩帆王鹏鹏
Owner NANJING UNIV OF SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products