Commodity comment data labeling system and method based on hierarchical AP clustering

A technology of AP clustering and data labeling, applied in the field of commodity review data labeling system, can solve the problems of underutilization of valuable review information, inability to fully utilize evaluation orientation, irregularity and redundancy of review data, etc.

Active Publication Date: 2018-01-26
WUYI UNIV
View PDF5 Cites 27 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] However, with the advent of the era of big data, the amount of review data has increased rapidly, and the review data itself has characteristics such as irregularity and redundancy, making it difficult to obtain useful review information.
The number of comments on e-commerce websites is growing exponentially, and these comments have different styles, and there is no standard model for language structure to follow... Faced with a huge number of comments, it is difficult to mine user comments only by manual analysis and processing. time consuming and inefficient
[0005] In major e-commerce websites such as JD.com, Taobao, and Yihaodian, some user reviews have been categorized to provide a more intuitive and convenient experience for customer inquiries through label classification, but the disadvantage is that these labels Pre-set by merchants, most of them are positive labels, which cannot fully reflect customers' comments on products; in addition, labels are chosen by reviewers themselves, but it can be found through statistical data that only about 10% of reviewers are willing to put a label on the review. labels, and 90% of the review data has no label information, which makes it impossible to make full use of the evaluation orientation implied by each review
Overall, the comment labeling technology in practical applications is still in the artificial stage, and a large amount of valuable comment information has not been fully utilized

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Commodity comment data labeling system and method based on hierarchical AP clustering
  • Commodity comment data labeling system and method based on hierarchical AP clustering
  • Commodity comment data labeling system and method based on hierarchical AP clustering

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0029] The idea, specific structure and technical effects of the present invention will be clearly and completely described below in conjunction with the embodiments and accompanying drawings, so as to fully understand the purpose, scheme and effect of the present invention. It should be noted that, in the case of no conflict, the embodiments in the present application and the features in the embodiments can be combined with each other. The same reference numbers are used throughout the drawings to indicate the same or similar parts.

[0030] figure 1 Shown is a system structure diagram of a tagging system disclosed by the present invention. According to an embodiment of the present invention, a product review data labeling system based on hierarchical AP clustering includes a data capture module, a word vector training module, a feature information extraction module, and a feature information labeling module. The data capture module obtains the comment data of the target pr...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a commodity comment data labeling system based on hierarchical AP clustering. The system includes a data capturing module, a word vector training module, a feature information extraction module and a feature information labeling module. The data capturing module stores corpus information and comment data. The word vector training module obtains a training corpus set. The feature information extraction module obtains a feature information set corresponding to the comment data. The feature information labeling module obtains the comment data labelled result after clustering. The beneficial effects of the invention are as follows: the commodity comment data labeling system and method based on hierarchical AP clustering are provided, the purpose of automatically labelingcomment data is achieved, the value orientation of the feature information can be mined and presented in the form of labels to merchants and customers, support is provided for subsequent data analysis, and companies and consumers are further provided with a convenient, scientific and intuitive tool for obtaining useful comment information.

Description

technical field [0001] The invention relates to the fields of computer data processing and data mining, in particular to a system and method for labeling commodity review data based on hierarchical AP clustering. Background technique [0002] In recent years, the vigorous development of e-commerce has led more and more people to choose the consumption mode of online shopping, and the transformation of consumption patterns has also brought new opportunities and challenges to enterprises and merchants. In the virtual environment of online shopping, a large number of research and survey results show that online reviews are the most important factor for consumers to make purchase decisions. At the same time, online reviews as feedback data can also help companies improve products, understand user needs, and enhance competition. strength and reputation. According to Nielsen’s survey data in the first half of 2014, 70%-80% of consumers will check product reviews as important refe...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30G06F17/27G06K9/62G06Q30/02
Inventor 彭敏晶张朕轩唐晨馨李运蒙
Owner WUYI UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products