Microblog sentiment analysis method based on large-scale corpus characteristic learning

A technology of sentiment analysis and feature learning, which is applied in the field of Internet information search, can solve problems such as low accuracy of sentiment analysis, complex network structure, and heavy workload, and achieve the effects of fast training speed, improved accuracy, and high accuracy

Active Publication Date: 2015-09-09
EAST CHINA NORMAL UNIV
View PDF3 Cites 45 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] Existing microblog sentiment analysis methods require manual design of features, heavy workload, high cost, and complex network structure. They cannot combine grammatical, contextual, and emotional tendencies to effectively deal with negative relationships, and the accuracy of sentiment analysis is low.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Microblog sentiment analysis method based on large-scale corpus characteristic learning
  • Microblog sentiment analysis method based on large-scale corpus characteristic learning
  • Microblog sentiment analysis method based on large-scale corpus characteristic learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0034] See attached figure 1 , the present invention includes a neural network based on Word2Vec and a classifier based on supervised learning, utilizes a microblog text to train a neural network based on Word2Vec, and inputs the microblog text into the neural network that completes the training, the neural network will input the microblog, according to The grammatical context, the emotional tendency in the current context, and whether it is negated are mapped to the corresponding word vectors, so as to obtain a Weibo word vector matrix corresponding to the Weibo text, and the word vector matrix is ​​made on the Weibo word vector matrix Synthesis of microblog text to obtain the microblog feature vector corresponding to the microblog text, train the microblog feature vector to the classifier based on supervised learning to obtain the sentiment classifier, use the sentiment classifier to analyze the sentiment of the microblog text, can be compared Accurately Predict Sentiment Te...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a microblog sentiment analysis method based on large-scale corpus characteristic learning. The method is characterized in that microblog texts is used to train a neural network based on Word2Vec, the neural network completes training by inputting the microblog texts, the neural network maps the input microblog to a corresponding word vector according to grammar contents, emotional tendency of current contents, and whether words are denied or not, and a matrix formed by word vectors corresponding to the words in the microblog is obtained. Synthesis of word vectors is performed on the matrix, so as to further obtain a feature vector corresponding to the microblog. Through applying the feature vector to training and prediction of a sentiment classification device, a relatively accurate microblog sentiment analysis is obtained. Compared with the prior art, the method is low in cost and high in analysis accurate rate, and is especially suitable for large-scale corpus, and is fast in training speed. Combining with grammar contents and emotional tendency, the method effectively processes negation relations, and improves accurate rate of sentiment analysis.

Description

technical field [0001] The invention relates to the technical field of Internet information search, in particular to a microblog sentiment analysis method based on large-scale corpus feature learning. Background technique [0002] In today's information explosion, Weibo, as a social tool, has an increasing impact on people's lives. No matter what you do, see, or hear, you want to share it with others. Weibo provides such a platform, allowing users to freely share their thoughts, experiences, etc. with others. Weibo sentiment analysis refers to the extraction and analysis of opinions in Weibo. For example, for a Weibo that contains comments on a certain movie, Weibo sentiment analysis is to analyze the comments on this movie in this Weibo. Emotional orientation, that is, judging whether the point of view is positive or negative (classification problem). Since the microblog platform carries a large number of subjective thoughts of users, automatic sentiment analysis on micro...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30G06F17/27G06Q50/00
CPCG06F16/951G06F40/30G06Q50/01
Inventor 杨静裴逸钧贺樑
Owner EAST CHINA NORMAL UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products