A text data opinion summarization mining method that integrates topic attributes and sentiment information

A technology of text data and topics, applied in the field of sentiment analysis and text summarization, it can solve the problems of integrating emotional information without considering different emotions, etc.

Active Publication Date: 2022-03-08
FUZHOU UNIV
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The disadvantage is that most of the existing graph models consider using text sentences and topic features to construct graph structures, and describe the emotional information of opinion summaries through the emotional information of the entire text sentence, without integrating the emotional information of topic attributes in the graph structure, there is no Considering that the topic features of different emotions are two subjects with different meanings, resulting in sentences containing different emotional topic attributes being associated

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A text data opinion summarization mining method that integrates topic attributes and sentiment information

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0020] The present invention will be further explained below in conjunction with the accompanying drawings and specific embodiments.

[0021] A text data opinion summary mining method that integrates topic attributes and emotional information, which includes the following steps: Step S1: preprocess the text corpus of the topic, and clean up some irrelevant words; Step S2: input the topic corpus and background Corpus; Step S3: Use the log likelihood ratio method to extract the topic attributes of the topic corpus; Step S4: Add the topic attribute obtained in Step S3 to the emotional polarity, and the emotional polarity includes positive emotion and negative emotion, thus positive Topic attributes and negative topic attributes are used as emotional attribute features to vectorize sentences; step S5: use the topic attributes obtained in step S3 as evaluation objects, and use the multi-evaluation object-oriented dynamic word sequence sentiment analysis method to analyze the evaluat...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The present invention provides a text data viewpoint summary mining method that combines topic attributes and emotional information, including: preprocessing the text corpus of the topic; inputting the topic corpus and the background corpus; extracting the topic attributes of the topic corpus; and obtaining Add emotional polarity to the topic attribute of the sentence and vectorize the sentence; use the obtained topic attribute as the evaluation object to obtain the emotional attribute characteristics contained in the sentence, and use the topic attribute and sentiment analysis method to perform feature vectorization on a sentence; use the obtained topic The attribute set and text sentence feature vector set S construct a three-layer graph structure to cluster all text sentences; select sentences from clusters to form opinion summaries, and select sentences with high scores to form opinion summaries. The invention makes the topic attributes extracted by the method of extracting topic attributes more accurate, and also makes it not only applicable to the field of Chinese microblog, but also applicable to the fields of website news and commodity reviews.

Description

technical field [0001] The present invention relates to the fields of text summarization and sentiment analysis, and more specifically, relates to a method for generating short opinion summaries with rich user emotional information for massive topic text data of Chinese microblog corpus, and the opinion summaries can accurately cover the text discussed and can be applied to practical application scenarios such as news summaries and commodity review analysis. Background technique [0002] Currently, there are many techniques and methods available for research in the field of opinion summarization. Traditional view summarization models include graph models and ranking models. The representative methods of graph models include Textrank, PageRank, LexRank and other methods. They use sentences as nodes, and a certain relationship between sentences as the weight of edges, and iteratively update and calculate the scores of sentences through the random walk model, so as to realize ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/35G06F40/30
CPCG06F16/35G06F40/30
Inventor 廖祥文陈国龙赵楠杨定达
Owner FUZHOU UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products