A Method for Analyzing the Sentiment Polarity of Long Text News Public Opinion

A technology of emotional polarity and analysis method, which is applied in text database clustering/classification, semantic analysis, neural learning methods, etc., can solve the problem of low accuracy and achieve the effect of accurate emotional polarity analysis model

Active Publication Date: 2021-02-23
NANJING HOWSO TECH
View PDF3 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] The technical problem to be solved by the present invention is to provide a method for analyzing the emotional polarity of long-text news public opinion, which encodes the long text through a transformer (Transformer), and then analyzes the long text through a bidirectional threshold recurrent unit network (Bi-GRU). The method of text news public opinion sentiment polarity can solve the problem of low accuracy of traditional methods

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Method for Analyzing the Sentiment Polarity of Long Text News Public Opinion
  • A Method for Analyzing the Sentiment Polarity of Long Text News Public Opinion
  • A Method for Analyzing the Sentiment Polarity of Long Text News Public Opinion

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0044] Embodiment: the analysis method of the emotional polarity of this long text news public opinion, comprises the following steps:

[0045] S1: Collect text data as training samples; the types of emotional polarity labels include positive, neutral, and negative;

[0046] S2: Perform data cleaning on the training sample data collected in step S1, and process special characters in the cleaned training sample data, including deleting URLs, deleting special punctuation marks, deleting continuous punctuation marks, deleting spaces, deleting Continuous line breaks, etc., to obtain the data set;

[0047] S3: Segment the data set, randomly divide the sample into a training set and a test set according to the ratio of 8:2, and ensure that the proportion of emotional polarity labels in the training set and the test set is consistent when dividing the data set;

[0048] S4: Build a deep learning network based on a representation model and load pre-trained parameters;

[0049] The s...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method for analyzing the emotional polarity of long-text news public opinion, comprising the following steps: S1 collects text data as training samples, S2 performs data cleaning on the data of the training samples collected in step S1, and cleans the Process the special characters in the data of the training sample to obtain the data set; S3 split the data set, divide the training sample into a training set and a test set according to the proportion; S4 build a deep learning network based on the representation model and load the pre-training parameters; S5 builds a long text emotional polarity analysis network model; S6 modifies the training sample data structure; S7 model training, adopts the method of stratified sampling and K-fold cross-validation, and guarantees that the proportion of samples in the sample data set of each fold is the same as that in stratified sampling. The proportion of the original data is the same, and the results of each fold model are saved in the model with the highest score in the verification set. The comprehensive K-fold model is tested on the test set, and the average probability is taken as the test result of the model.

Description

technical field [0001] The invention belongs to the technical field of natural language processing, and in particular relates to a method for analyzing the emotional polarity of long-text news public opinion based on a Transformer structure. Background technique [0002] Judging the emotional polarity of news public opinion can be abstracted as a text classification problem in the field of natural language processing, that is, to judge the emotional polarity expressed by the news through the title and text. At present, the methods used for sentiment analysis are mainly divided into the following three types: [0003] 1. Method based on sentiment lexicon. The process of the traditional model method based on the emotional dictionary is to first construct the emotional dictionary, and use the dictionary to judge the emotional orientation and emotional strength of the words in the pre-analysis text, so as to realize the overall emotional classification of the text. Limitations...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/35G06F40/284G06F40/30G06N3/04G06N3/08
CPCG06F16/35G06F40/284G06F40/30G06N3/08G06N3/045
Inventor 唐大鹏郭柏龙陈大龙
Owner NANJING HOWSO TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products