Deep belief network-based short text feature optimization and sentiment analysis method

A deep belief network and sentiment analysis technology, applied in the field of short text feature extraction and sentiment analysis based on deep belief network, can solve the problem that sentiment analysis methods cannot effectively use semantic information, and achieve the effect of enriching emotional semantic expression

Active Publication Date: 2017-09-22
BEIJING UNIV OF TECH
View PDF3 Cites 44 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0009] To sum up, the rule-based sentiment analysis method cannot effectively utilize the potential semantic information of the text, and the traditional machine learning feature extraction method is mainly based on the method of probability statistics, which has inherent defects. In order to make up for the shortcomings of these methods, the present i

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Deep belief network-based short text feature optimization and sentiment analysis method
  • Deep belief network-based short text feature optimization and sentiment analysis method
  • Deep belief network-based short text feature optimization and sentiment analysis method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0070] The traditional method of extracting features is mainly based on the method of probability and statistics, which cannot fully discover the potential semantic information of the text, so that the feature training cannot be performed better to determine the emotional tendency of short texts; the embodiment of the present invention provides a new type of short text based on deep learning The feature extraction and sentiment analysis method includes the following specific steps:

[0071] All corpus collections are divided into training set and test set according to the ratio of 8:2, utilize training set to carry out model training to the method that the present invention extracts, utilize test set to test the pros and cons of the present invention method compared with traditional method.

[0072] The training corpus is marked with artificial emotional orientation, with positive emotional orientation marked as 1, neutral emotional orientation marked as 0, and negative emotion...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a deep belief network-based short text feature optimization and sentiment analysis method. The method comprises the following steps of: 1, obtaining a microblog short text corpus set, a thesaurus, semantic progressive associated words, microblog expression dictionary and a word segmentation model; 2, expanding and reconstructing short texts; 3, carrying out word segmentation and preprocessing on the short texts; 4, constructing a word similarity calculation model; 5, expanding feature vectors of the short texts; 6, self-adaptively extracting an expanded candidate feature set on the basis of a feature depth of a deep belief network; 7, carrying out classified training on the feature set obtained by the deep belief network by utilizing a machine learning classification algorithm so as to obtain a classification and prediction model; and 8, carrying out sentiment annotation on a test data set by utilizing the classification and prediction model. The method is capable of finding potential feature semantic information more effectively and improving the quality of sentiment feature extraction so as to improve the correctness of sentiment classification.

Description

technical field [0001] The invention belongs to the field of text information processing, and in particular relates to a short text feature extraction and sentiment analysis method based on a deep belief network. Background technique [0002] The main content of sentiment analysis is to discover the subjective opinions carried in the text, including the thoughts, hobbies, and emotional expressions of the information subject. It is a multidisciplinary task involving NLP (Natural Language Processing), IR (Information Retrieval), AI (Artificial Intelligence) and many other fields. [0003] The research on sentiment analysis of short texts is a new direction developed along with the emergence of new social tools at home and abroad. Compared with the analysis of opinions and sentiment tendencies of traditional texts, short texts are full of expressions due to their short content, sparse features, and random grammatical expressions. Elements and other non-standard expression word...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/27
CPCG06F40/279G06F40/30
Inventor 杜永萍陈守钦赵晓铮
Owner BEIJING UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products