Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Text sentiment analysis method based on new word extension and complex sentence pattern extension

A technology of sentiment analysis and sentence patterns, applied in text database clustering/classification, unstructured text data retrieval, special data processing applications, etc., can solve problems such as ignoring context, complex sentence structure, ignoring different meanings, etc. , to achieve the effect of improving emotion recognition, accurate experimental results, and accurate recognition

Active Publication Date: 2020-06-02
CHONGQING UNIV OF POSTS & TELECOMM
View PDF21 Cites 7 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The current existing research on text sentiment analysis and sentiment classification only adds daily Internet terms, but ignores that specific words have different meanings in different contexts, and ignores specific contextual backgrounds
In addition, there are many short texts in the comment corpus of various social platforms, and the Chinese sentence structure is more casual, and the complex sentence structure also increases the difficulty of judging the emotional polarity of sentences.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text sentiment analysis method based on new word extension and complex sentence pattern extension
  • Text sentiment analysis method based on new word extension and complex sentence pattern extension
  • Text sentiment analysis method based on new word extension and complex sentence pattern extension

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0070] The technical solutions in the embodiments of the present invention will be described clearly and in detail below with reference to the drawings in the embodiments of the present invention. The described embodiments are only some of the embodiments of the invention.

[0071] The technical scheme that the present invention solves the problems of the technologies described above is:

[0072] A text sentiment analysis method based on new word expansion and complex sentence pattern expansion, which comprises the following steps:

[0073] S1: Construct a basic sentiment dictionary, using HowNet sentiment dictionary and National Taiwan University NTUSD Simplified Chinese sentiment dictionary to build a basic sentiment dictionary, and deduplicating the two dictionaries, a total of 3646 positive sentiment words and negative sentiment words were obtained 9530 words. There are 31 negative words.

[0074] S2: Data cleaning by the following steps

[0075] (1) Remove the html fo...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a text sentiment analysis method based on new word extension and complex sentence pattern extension. The method comprises the following steps: S1, firstly, constructing a basicdictionary according to an existing sentiment dictionary, and cleaning and screening the existing dictionary; s2, performing data cleaning on the imported Chinese corpus, and expanding sentiment words in a specific field according to a basic sentiment dictionary; s3, on the basis of the existing method, finding new words in the specific field and adding the new words into the basic dictionary bycombining word frequency, part-of-speech and similarity calculation; s4, analyzing the structure of the Chinese sentence pattern, summarizing and concluding a sentence pattern model, and judging the sentiment polarity of the sentence through different models; and S5, obtaining an algorithm selector suitable for the method, and integrating the dictionary and the sentence pattern model to obtain a sentence polarity result. Compared with a traditional emotion dictionary and machine learning method, the method focuses on short text sentence emotion recognition in the specific field, and the accuracy and recall rate are obviously improved.

Description

technical field [0001] The invention belongs to the field of text classification sentiment analysis, in particular to an analysis method for short text sentiment classification in a specific field. Background technique [0002] The convenience of interaction has made the Internet one of the main ways for people to express their opinions and communicate with each other more and more. Subjective texts generated on the Internet contain a lot of useful emotional information. More and more people are accustomed to expressing their positive, neutral or negative emotions, as well as their preferences for using products on these platforms. Therefore, reviews on various shopping websites, Weibo, forums and other platforms will become the basis for consumers to make purchase decisions. [0003] Due to the huge amount of network evaluation information, it is infeasible to rely on manual methods, the efficiency is very low, and it is difficult to find out really valuable information. ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/35G06F16/36
CPCG06F16/35G06F16/374Y02D10/00
Inventor 刘洪涛孙桂
Owner CHONGQING UNIV OF POSTS & TELECOMM
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products