An Improved Method for Fine-grained Sentiment Analysis Based on Sentiment Word Embedding

A sentiment analysis, sentiment word technology, applied in semantic analysis, text database clustering/classification, unstructured text data retrieval, etc., can solve the problem of not considering sentiment information, and achieve the effect of good sentiment classification results

Active Publication Date: 2021-09-07
SHANGHAI MIDU INFORMATION TECH CO LTD
View PDF12 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, these methods have certain limitations in sentiment analysis tasks: word embedding implements many NLP tasks by learning low-dimensional continuous value vector representations of words, but the word representation obtained by traditional word embedding methods only contains semantic information in the text corpus , without considering the emotional information in the text corpus

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • An Improved Method for Fine-grained Sentiment Analysis Based on Sentiment Word Embedding
  • An Improved Method for Fine-grained Sentiment Analysis Based on Sentiment Word Embedding
  • An Improved Method for Fine-grained Sentiment Analysis Based on Sentiment Word Embedding

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0106] In this embodiment, the method for sentiment analysis based on embedding of sentiment words proposed in this patent is applied to the comment data of the online shopping platform.

[0107] Step 1: Acquisition, manual labeling and preprocessing of text datasets. Specifically, the following steps are adopted:

[0108] A. Collect online comment data for sentiment analysis tasks;

[0109] B. Artificially label the sentences in all online comment data. This patent divides the emotional labels into three categories: "positive, negative, and neutral". After labeling, each sentence corresponds to an emotional label;

[0110] C. Perform preprocessing on the text data set, including text segmentation, deletion of special symbols and stop words, where text segmentation refers to dividing sentences in the text data set into individual words. The word segmentation tool used is the Chinese Academy of Sciences word segmentation tool ICTCLA2018. Since the network text data contains ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an improved fine-grained sentiment analysis method based on sentiment word embedding, which includes: acquisition of text data sets, manual labeling and preprocessing. Compute the semantic word vector corresponding to each word in the text dataset. Get a collection of sentiment words. Compute a set of sentiment phrases for each word in the training dataset. Calculate the sentiment word vector corresponding to each word in the training dataset. Construct sentiment embeddings for each word in the training dataset. Train a classifier to get a fine-grained sentiment analysis model. The present invention utilizes prior emotion knowledge combined with word embedding model, and constructs emotional word embedding suitable for fine-grained emotion analysis for emotion analysis. The present invention can better identify emotional information at the word level, describe users' emotions more accurately and fine-grained, and can be used for fine-grained emotional analysis tasks, such as: analysis of user consumption habits, analysis of user comments on products, etc.

Description

technical field [0001] The invention belongs to the technical field of Chinese text sentiment analysis, and in particular relates to an improved fine-grained sentiment analysis method based on sentiment word embedding for Chinese short texts such as microblogs and network comments. Background technique [0002] With the rapid development of the Internet, millions of people use social networks every day, such as Weibo, Tieba and other online platforms to express their views on products, services, news, events, etc. Analyzing the opinions or opinions expressed by users is very important for marketing professionals and researchers. Sentiment analysis of Weibo texts has become increasingly important due to the popularity of Weibo in Chinese society. Since 2013, the China Computer Federation (CCF for short) has specially set up the Chinese Weibo sentiment classification evaluation task in the second Natural Language Processing and Chinese Computing Conference (NLPCC for short), ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/35G06F40/289G06F40/30G06F40/242
Inventor 李芳芳畅帅毛星亮施荣华石金晶胡超
Owner SHANGHAI MIDU INFORMATION TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products