Sentiment analysis method based on Skip-gram model fusing part-of-speech and semantic information

A technology of sentiment analysis and semantic information, which is applied in the field of sentiment analysis of the Skip-gram model, can solve problems such as unreasonable use of part-of-speech information and little work of sentiment analysis in word vector training

Active Publication Date: 2018-11-02
HUAZHONG UNIV OF SCI & TECH
View PDF4 Cites 21 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] To sum up, most of the current research work on sentiment analysis focuses on proposing different deep neural network models for sentiment analysis. Model improvement mostly focuses on modifying the model structure to reduce model complexity. Other work mainly focuses on cross-language However, there are few sent

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Sentiment analysis method based on Skip-gram model fusing part-of-speech and semantic information
  • Sentiment analysis method based on Skip-gram model fusing part-of-speech and semantic information
  • Sentiment analysis method based on Skip-gram model fusing part-of-speech and semantic information

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0044] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0045] In the embodiment of the present invention, the corpus is a data set of Chinese product reviews of e-commerce such as Dianping and JD.com downloaded from the Internet. The comment data set is mainly customers' comments on merchants, and the data set has authenticity and objectivity.

[0046] figure 1 It is a schematic flowchart of a sentiment analysis method based on a Skip-gram model that integrates part-of-speech and semantic information provided by an embodiment of the present invention. Such as figure 1 As shown, the method includes:

[0047] Step 1. Review corpus preprocessing to obt...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a sentiment analysis method based on a Skip-gram model fusing part-of-speech and semantic information. The method includes five steps of data preprocessing, part-of-speech information modeling, word vector representation, semantic information modeling and sentiment analysis. Data preprocessing includes filtering, word segmentation and part-of-speech labeling. Part-of-speechinformation modeling contains context-based modeling on part-of-speech information. A word vector representation module carries out vector training on the Skip-gram model fusing the part-of-speech information. The semantic information modeling module includes text representations fusing sentiment semantic information. Compared with the prior art, the method considers the part-of-speech information and the sentiment semantic information of words, fully uses the part-of-speech information of the words for helping in word vector training and the sentiment semantic prior-information for helping in learning of text vectors on the basis of modeling on the part-of-speech information of the words and fusion of the semantic information, enables the represented text vectors to more accord with linguistic features, and has good results for sentiment analysis.

Description

technical field [0001] The invention belongs to the field of natural language processing, and in particular relates to an emotion analysis method based on a Skip-gram model integrating part-of-speech and semantic information. Background technique [0002] With the development of e-commerce, sentiment analysis and mining of product review texts are of great value for researching product reputation and product recommendation. Review data has become an important data source for companies to improve product quality and service. [0003] With the great achievements of deep learning in the fields of audio, image, and video, the traditional machine learning model is fused with the neural network model in deep learning, and the traditional word vector is replaced by a low-dimensional word vector that can measure the semantic correlation between words. The one-hot vector used in the bag model has achieved superior performance compared to traditional machine learning methods in variou...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/27G06F17/30
CPCG06F40/284
Inventor 李瑞轩文坤梅黄伟李玉华辜希武昝杰龚晶
Owner HUAZHONG UNIV OF SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products