Text data viewpoint summary mining method merging topic attributes and emotion information

A text data and topic technology, applied in the fields of sentiment analysis and text summarization, can solve the problems of integrating emotional information and not considering different emotions, and achieve the effect of accurate topic attributes

Active Publication Date: 2018-07-17
FUZHOU UNIV
View PDF6 Cites 21 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The disadvantage is that most of the existing graph models consider using text sentences and topic features to construct graph structures, and describe the emotional information of opinion summaries through the emotional information of the entire text sentence, without integrating the emotional information of topic attributes in the graph structure, there is no Considering that the topic features of different emotions are two subjects with different meanings, resulting in sentences containing different emotional topic attributes being associated

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text data viewpoint summary mining method merging topic attributes and emotion information

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0021] The present invention will be further explained below in conjunction with the accompanying drawings and specific embodiments.

[0022] A text data opinion summary mining method that integrates topic attributes and emotional information, which includes the following steps: Step S1: preprocess the text corpus of the topic, and clean up some irrelevant words; Step S2: input the topic corpus and background Corpus; Step S3: Use the log likelihood ratio method to extract the topic attributes of the topic corpus; Step S4: Add the topic attribute obtained in Step S3 to the emotional polarity, and the emotional polarity includes positive emotion and negative emotion, thus positive Topic attributes and negative topic attributes are used as emotional attribute features to vectorize sentences; step S5: use the topic attributes obtained in step 3 as evaluation objects, and use the multi-evaluation object-oriented dynamic word sequence sentiment analysis method to analyze the evaluati...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a text data viewpoint summary mining method merging topic attributes and emotion information. The method comprises the steps of preprocessing a text corpus set of a topic; inputting a topic corpus set and a background corpus set; extracting the topic attributes of the topic corpus set; adding emotional polarities to the obtained topic attributes, and vectorizing sentences; taking the obtained topic attributes as evaluation objects, obtaining emotional attribute features contained in the sentences, and conducting feature vectorization on one sentence by means of a topic attribute and emotion analysis method; utilizing an obtained topic attribute set and a text sentence feature vector set S to construct a three-layer graph structure, and clustering all the text sentences; selecting sentences from class clusters to form a viewpoint summary, and selecting the sentences with high scores to form a viewpoint summary. According to the text data viewpoint summary mining method, the extracted topic attributes are more accurate by adopting a topic attribute extraction method, and meanwhile the text data viewpoint summary mining method can be applied not only to the field of Chinese microblogs but also to the field of website news and product reviews.

Description

technical field [0001] The present invention relates to the fields of text summarization and sentiment analysis, and more specifically, relates to a method for generating short opinion summaries with rich user emotional information for massive topic text data of Chinese microblog corpus, and the opinion summaries can accurately cover the text discussed and can be applied to practical application scenarios such as news summaries and commodity review analysis. Background technique [0002] Currently, there are many techniques and methods available for research in the field of opinion summarization. Traditional view summarization models include graph models and ranking models. The representative methods of graph models include Textrank, PageRank, LexRank and other methods. They use sentences as nodes, and a certain relationship between sentences as the weight of edges, and iteratively update and calculate the scores of sentences through the random walk model, so as to realize ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30G06F17/27
CPCG06F16/35G06F40/30
Inventor 廖祥文陈国龙赵楠杨定达
Owner FUZHOU UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products