Microblog rumor detection algorithm based on gradient boosting trees and and rumor detection feature set

A gradient boosting tree and detection feature technology, which is applied in computing, special data processing applications, instruments, etc., can solve the problems of low detection accuracy and insufficient detection accuracy, and achieve the effect of improving detection accuracy

Inactive Publication Date: 2018-11-06
UNIV OF ELECTRONICS SCI & TECH OF CHINA
View PDF10 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] The detection accuracy of the existing Weibo rumor detection algorithm is not high enough, especially in the early stage when the rumor is released.
This is an important shortcoming of the existing microblog rumor detection algorithms

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Microblog rumor detection algorithm based on gradient boosting trees and and rumor detection feature set
  • Microblog rumor detection algorithm based on gradient boosting trees and and rumor detection feature set
  • Microblog rumor detection algorithm based on gradient boosting trees and and rumor detection feature set

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0024] A gradient boosting tree-based microblog rumor detection algorithm and a rumor detection feature set disclosed in the present invention include two parts: a gradient boosting tree-based rumor detection algorithm and a feature set for rumor detection.

[0025] The overall flow chart of the microblog rumor detection algorithm based on gradient boosting tree is as follows: figure 1 shown. The specific implementation manner of the present invention will be described in detail below in conjunction with the accompanying drawings.

[0026] 1. Data processing

[0027] This part corresponds to figure 1 In S1, see the detailed flow chart figure 2 .

[0028] S1.1: Extract features

[0029] Feature extraction is performed on the original data, and the values ​​of the features in the rumor detection feature set are extracted. The feature set is shown in Table 1.

[0030] S1.2: Set label

[0031] For a sample x i (1≤i≤N), if it is a rumor, set its label yi to 1; otherwise, s...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a microblog rumor detection algorithm based on gradient boosting trees and a rumor detection feature set. The provided feature set of rumor detection contains 23 features. According to the provided rumor detection algorithm based on the gradient boosting trees, firstly, training samples are constructed according to the features in the feature set, and the training samples are used for training of a microblog rumor detection model; then multiple training is carried out on the training sample set to obtain multiple regression tree models, each regression tree gives a predicted value, and the predicted values of the multiple regression trees are combined for obtaining the final microblog rumor detection model; and in rumor detection, features of a to-be-predicted microblog are extracted according to the feature set, the detection model is used to calculate and derive a predicted value on the to-be-predicted microblog, and according to the predicted value, judging that the to-be-predicted microblog belongs to rumor microblogs or non-rumor microblogs. Compared with existing microblog rumor detection algorithms, the microblog rumor detection algorithm based on thegradient boosting trees and the rumor detection feature set provided by the invention can bring higher rumor detection precision, and especially in an early period of releasing a rumor, detection precision is significantly higher than that of the existing microblog rumor detection algorithms.

Description

technical field [0001] The invention relates to the technical field of microblog rumor detection, in particular to a microblog rumor detection algorithm and a rumor detection feature set based on a gradient boosting tree. Background technique [0002] Diversified information, freedom of speech, and explosive speed of dissemination on Weibo encourage the generation and dissemination of rumors, making Weibo an ideal place for the spread of false and untrue news. In order to detect rumors and stop the spread of rumors in time, related algorithms for rumor detection have emerged as the times require. [0003] The detection accuracy of the existing microblog rumor detection algorithm is not high enough, especially in the early stage when the rumor is released. This is an important shortcoming of existing microblog rumor detection algorithms. Contents of the invention [0004] Aiming at the shortcomings of existing microblog rumor detection algorithms, the present invention pr...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30G06Q50/00
CPCG06Q50/01
Inventor 杨波熊枭
Owner UNIV OF ELECTRONICS SCI & TECH OF CHINA
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products