False comment identification method based on rolling type cooperative training

A technology of collaborative training and recognition methods, applied in the field of false review recognition, can solve problems such as time-consuming and laborious, low recognition accuracy, and lack of manual annotation in standard data sets

Pending Publication Date: 2020-09-15
NORTHEAST DIANLI UNIVERSITY
View PDF8 Cites 16 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] (1) The classification method under the fully supervised framework is mainly used. The fully supervised learning method requires a large amount of labeled corpus as a training set. The lack of standard data sets and the time-consuming and laborious problems of manual labeling have brought great challenges to the research under the fully supervised framework. Earth limitations
[0006] (2) Scholars try to use unsupervised learning methods. Unsupervised learning methods solve the problem of missing labeled datasets, but the recognition accuracy is generally low.
[0007] (3) Semi-supervised learning balances the main problems of fully supervised learning and unsupervised learning. However, in the current research on identifying false reviews with semi-supervised methods, simple feature construction is only based on shallow review features. model, ignoring the impact of differences in the combination of different features and classification models on the results

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • False comment identification method based on rolling type cooperative training
  • False comment identification method based on rolling type cooperative training
  • False comment identification method based on rolling type cooperative training

Examples

Experimental program
Comparison scheme
Effect test

experiment example

[0129] (1) Data source:

[0130] The present invention obtains the original experimental data set from the yelp review website. The original experimental data set includes multiple fields such as user ID, total number of comments, comment content, comment level, comment time, etc., and a total of 5854 comment texts. And with the help of the false comment filtering system of the yelp review website, the false comments are marked, and the experimental data set is shown in Table 1.

[0131] Table 2 Experimental data set

[0132] type of data Comments User number real comment 5076 4231 fake review 778 743 total number of comments 5854 4974

[0133] (2) Experimental platform

[0134] The algorithm used in this application adopts Win64-bit server operating environment; processor Intel(R) Core(TM) i5-5200UCPU @2.20GHz 2.20GHz; running memory 8G; Python3.7.0 version; gensim3.8.0 version; scikit-learn0.20.1 version; text paragraph vector traini...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a false comment identification method based on rolling type cooperative training, and relates to the technical field of false comment identification. The method comprises the steps: obtaining a comment text, and determining the content features of the comment text according to the comment text; obtaining reviewer information corresponding to the comment text, and determining reviewer behavior characteristics according to the reviewer information; performing false comment identification according to the comment text content characteristics and the reviewer behavior characteristics; outputting false comment recognition results. According to the false comment recognition method, unlabeled samples are effectively utilized to assist model learning, multiple features suchas emotion and text representation are fused at the same time, the recognition performance of the model is improved through multi-feature fusion collaborative training, and the accuracy is improved by 3.5% compared with a traditional false comment recognition method.

Description

technical field [0001] The invention relates to the technical field of false comment identification, in particular to a method for identifying false comments based on rolling collaborative training. Background technique [0002] For online shopping, due to the inconsistency between the online product information and the products received by consumers offline, consumers will read a large number of reviews of target products to assist in judgment. Therefore, commodity reviews affect consumers' purchasing behavior, and also affect the interests of merchants. Positive reviews attract more potential users, while negative reviews drive away potential users. In order to obtain higher profits, unscrupulous merchants usually hire professional writers to write false positive reviews for their own products to attract potential users; to write false negative reviews for competitors to suppress competitors. For example, “Some merchants use the method of “rebate for good reviews” to obt...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/9532G06F16/35G06K9/62
CPCG06F16/9532G06F16/35G06F18/241G06F18/25G06F18/214
Inventor 王敬东阚海涛孟凡奇李佳
Owner NORTHEAST DIANLI UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products