Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Text sentiment analysis method based on dynamic threshold and multiple classifiers

A multi-classifier and dynamic threshold technology, which is applied in the direction of instruments, character and pattern recognition, and special data processing applications, can solve problems such as the inability to obtain emotional types and the difficulty of obtaining real-time updated emotional lexicons, so as to ensure accuracy, The effect of reducing influence and improving accuracy

Inactive Publication Date: 2018-11-30
TIANJIN UNIV
View PDF0 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The typical unsupervised learning method is based on the emotional lexicon, but the network culture is changing with each passing day, and the word update speed is extremely fast, so it is difficult to obtain a suitable and real-time updated emotional lexicon
Another type of unsupervised learning is a method based on clustering ideas, but this method can only classify texts with different emotions, but cannot obtain specific emotion types

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text sentiment analysis method based on dynamic threshold and multiple classifiers
  • Text sentiment analysis method based on dynamic threshold and multiple classifiers
  • Text sentiment analysis method based on dynamic threshold and multiple classifiers

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0025] The invention proposes a sentiment analysis method based on a dynamic threshold and multiple classifiers, and uses multiple classifiers to jointly decide text sentiment on the basis of continuously expanding data sentiment labels based on the dynamic threshold, so that the sentiment analysis is more accurate. figure 1 The sentiment analysis mechanism of the present invention is shown. figure 2 The dynamic threshold-based classifier training pipeline is shown. image 3 shows the weight voting policy process.

[0026] The specific implementation steps are as follows:

[0027] 1) Firstly, the crawler is used to collect the comment text data set. The data set includes a training set and a prediction set. The training set is divided into data L with emotional labeling (small proportion) and data U without emotional labeling (large proportion). According to different emotions, L is labeled with emotions, such as positive, negative, and neutral; or subjective and objective;...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a semi-supervised text sentiment analysis method based on dynamic threshold and multiple classifiers. The method herein includes: marking sentiment of a part of data with L, and data that are not subjected to sentiment marking as U; performing word pre-segmentation on the data; using word2vec tool to convert words into numeric vectors; using a sampling method with a release function to sample L repeatedly T times so as to obtain T sample sequence Lt and T non-selected sample sequences OOBt; selecting one sample sequence Lt and data with no sentiment marking, and marking them as Ut; using Lt to train Support Vector Machine (SVM) classifier Ct, predicting Ut with Ct, adding samples l having reliability reaching a threshold into Lt, and deleting from Ut; updating thethresholds; using OOBt to calculate reliability Pt of the classifier Ct to obtain T base classifiers and their corresponding reliabilities P; using T BCs to predict prediction text s.

Description

technical field [0001] The invention belongs to the technical field of text classification based on semi-supervised learning, and in particular relates to a text sentiment analysis method based on a dynamic threshold and multiple classifiers. Background technique [0002] With the rapid development of social networks, more and more people tend to express their opinions and opinions on online platforms, such as the currently popular Weibo, WeChat Moments, Douban, Zhihu and so on. In order to better understand and utilize these comments, sentiment analysis for social media user comments has become a current research hotspot. This type of sentiment analysis can be applied to different scenarios, which is helpful for various departments to make network decisions, financial forecasts, policy formulation and public opinion analysis, etc. For example, Douban users can analyze the emotional polarity contained in other users' reviews of a certain movie to determine whether to choose...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/27G06F17/30G06K9/62
CPCG06F40/289G06F18/2411G06F18/214
Inventor 韩玥王颖金志刚
Owner TIANJIN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products