Method for analyzing emotion polarity of long text news public opinions

A technology of emotional polarity and analysis method, which is applied in text database clustering/classification, semantic analysis, neural learning methods, etc., can solve the problem of low accuracy and achieve the effect of accurate emotional polarity analysis model

Active Publication Date: 2020-12-18
NANJING HOWSO TECH
View PDF3 Cites 12 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] The technical problem to be solved by the present invention is to provide a method for analyzing the emotional polarity of long-text news public opinion, which encodes the long text through a transformer (Transformer), and then analyzes the long text through a bidirectional threshold recurrent unit network (Bi-GRU). The method of text news public opinion sentiment polarity can solve the problem of low accuracy of traditional methods

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for analyzing emotion polarity of long text news public opinions
  • Method for analyzing emotion polarity of long text news public opinions
  • Method for analyzing emotion polarity of long text news public opinions

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0044] Embodiment: the analysis method of the emotional polarity of this long text news public opinion, comprises the following steps:

[0045] S1: Collect text data as training samples; the types of emotional polarity labels include positive, neutral, and negative;

[0046] S2: Perform data cleaning on the training sample data collected in step S1, and process special characters in the cleaned training sample data, including deleting URLs, deleting special punctuation marks, deleting continuous punctuation marks, deleting spaces, deleting Continuous line breaks, etc., to obtain the data set;

[0047] S3: Segment the data set, randomly divide the sample into a training set and a test set according to the ratio of 8:2, and ensure that the proportion of emotional polarity labels in the training set and the test set is consistent when dividing the data set;

[0048] S4: Build a deep learning network based on a representation model and load pre-trained parameters;

[0049] The s...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method for analyzing emotion polarity of long text news public opinions, which comprises the following steps: S1, collecting text data as a training sample, S2, performing data cleaning on the data of the training sample collected in the step S1, and processing special characters in the cleaned data of the training sample to obtain a data set; S3, segmenting the data set,and segmenting the training sample into a training set and a test set according to a proportion; S4, establishing a deep learning network based on the representation model and loading pre-training parameters; S5, building a long text sentiment polarity analysis network model; S6, modifying a training sample data structure; and S7, model training: adopting a stratified sampling and K-fold cross validation method to ensure that the sample proportion in the sample data set of each fold is consistent with the original data proportion during stratified sampling, storing the model result of each fold in the model with the highest score of the validation set, integrating the K-fold model to test the test set, and taking the average probability as the test result of the model.

Description

technical field [0001] The invention belongs to the technical field of natural language processing, and in particular relates to a method for analyzing the emotional polarity of long-text news public opinion based on a Transformer structure. Background technique [0002] Judging the emotional polarity of news public opinion can be abstracted as a text classification problem in the field of natural language processing, that is, to judge the emotional polarity expressed by the news through the title and text. At present, the methods used for sentiment analysis are mainly divided into the following three types: [0003] 1. Method based on sentiment lexicon. The process of the traditional model method based on the emotional dictionary is to first construct the emotional dictionary, and use the dictionary to judge the emotional orientation and emotional strength of the words in the pre-analysis text, so as to realize the overall emotional classification of the text. Limitations...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/35G06F40/284G06F40/30G06N3/04G06N3/08
CPCG06F16/35G06F40/284G06F40/30G06N3/08G06N3/045
Inventor 唐大鹏郭柏龙陈大龙
Owner NANJING HOWSO TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products