Unlock instant, AI-driven research and patent intelligence for your innovation.

A Semi-Supervised E-commerce Review Sentiment Analysis Method Based on Tripartite Graph and Cluster Analysis

A technology of cluster analysis and sentiment analysis, applied in the field of sentiment classification of e-commerce review documents

Active Publication Date: 2020-10-30
NANJING SILICON INTELLIGENCE TECH CO LTD
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, they still lack thinking about the problem of limited labeled data. Li et al. [Two-View Label Propagation to Semi-supervised Reader Emotion Classification] proposed a semi-supervised label propagation algorithm based on two views of news documents and review documents in 2016. Sentiment classification, taking into account the modeling of labeled data and data relationships

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Semi-Supervised E-commerce Review Sentiment Analysis Method Based on Tripartite Graph and Cluster Analysis
  • A Semi-Supervised E-commerce Review Sentiment Analysis Method Based on Tripartite Graph and Cluster Analysis
  • A Semi-Supervised E-commerce Review Sentiment Analysis Method Based on Tripartite Graph and Cluster Analysis

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0066] The specific implementation of the present invention will be further described below in conjunction with accompanying drawing and embodiment, but the implementation and protection of the present invention are not limited to this, it should be pointed out that if there are any processes or parameters not specified in detail below, those skilled in the art It can be realized with reference to the prior art.

[0067] Carry out experimental demonstration below for the inventive method (concrete scheme can be seen in the content of the invention, repeats no more here), specifically includes:

[0068] 1. Experimental settings

[0069] Data set: This embodiment uses the hotel review data set in the Chinese emotion mining corpus ChnSentiCorp [Tan Songbo, Chnsenticorp [Eb / Ol], 2010-06-29, http: / / www.datatang.com / data / 14614.] ChnSentiCorp-Htl-del-4000 and laptop review dataset ChnSentiCorp-NB-del-4000 are used as experimental data, both datasets include 2,000 positive and 2,000 ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a semi-supervised e-commerce comment sentiment analysis method based on tripartite graphs and cluster analysis. The method comprises the following steps of calculating word similarity based on word vectors in combination with an emotion dictionary and part-of-speech information; introducing a word group mode to add context information, and eliminating the influence of a one-word multi-meaning phenomenon; establishment a word-document-word group tripartite graph taking the document as the center, calculating the similarity among the documents; based on the sample clustering hypothesis, mining cluster structure distribution in the corpus to obtain global information of the corpus; carrying out weighted fusion on the global information of the corpus and the similarityinformation in the three graphs to obtain a relation graph of the final sample; and executing a label propagation algorithm according to the relational graph, and propagating the labels with the labeled samples to the unlabeled samples to realize the emotion classification of the unlabeled samples. According to the method, the global information and the similarity information in the tripartite graph are subjected to weighted fusion, a high-quality sample relation graph model is obtained on the basis of combining the characteristics of the comment corpus, and a better emotion classification effect can be achieved.

Description

technical field [0001] The invention relates to the field of document classification of natural language processing technology, in particular to a semi-supervised method based on a tripartite graph and a cluster analysis method for sentiment classification technology of e-commerce review documents. Background technique [0002] With the rapid development of the Internet, a large number of user comments have been generated on online platforms such as e-commerce websites, and the emotional information contained in the comments can not only help other users make better purchasing decisions, but also facilitate merchants to track and manage Consumer Feedback Information. Therefore, how to automatically classify the sentiment of user review documents has gradually become a research topic that has attracted more and more attention in the field of natural language processing. [0003] Document sentiment classification methods can be mainly divided into unsupervised learning, super...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F40/289G06F16/35G06Q30/02
CPCG06Q30/0218G06F40/289
Inventor 卢昕薛云吴海明
Owner NANJING SILICON INTELLIGENCE TECH CO LTD