An LSTM-based microblog rumor detection and resource library construction method

A construction method and resource library technology, applied in network data retrieval, semantic tool creation, other database retrieval, etc., can solve problems such as ignoring comment text and comment emotional tendency, incomplete construction of Weibo rumor resource library, etc., to achieve good detection The effect of the result

Pending Publication Date: 2019-05-28
NANJING UNIV OF SCI & TECH
View PDF1 Cites 10 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The purpose of the present invention is to provide an LSTM-based microblog rumor detection and resource construction method, aiming to solve the defects of the traditional method of microblog rumor resource library construction is not comprehensive and rumor detection ignores comment text and comment emotional tendency

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • An LSTM-based microblog rumor detection and resource library construction method
  • An LSTM-based microblog rumor detection and resource library construction method
  • An LSTM-based microblog rumor detection and resource library construction method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0042] According to a preferred embodiment of the present invention, the establishment of a microblog rumor resource library is realized based on Selenium technology, and the method based on LSTM is used to detect rumor microblogs. It is mainly divided into six stages, which will be described in further detail below.

[0043] In order to achieve the above object, the technical scheme adopted in the present invention is as follows:

[0044] 1) The crawling method of rumor microblog data, specifically:

[0045] A) Log in to Sina Weibo through crawler technology, enter the false information reporting section of the Weibo community management center, select a specific Weibo that has been reported as false information, enter the original text interface of the Weibo, and crawl The original text information of the Weibo, including the original text content, release time, etc.

[0046] B) Simulate clicking on the profile picture of the microblog user to enter the user interface to c...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an LSTM-based microblog rumor detection and resource library construction method. The method comprises the steps of 1, crawling rumor data by using a crawler technology; Step 2, crawling non-rumor data by utilizing the crawler technology; Step 3, performing integrated storage on the crawled data, and constructing a resource library; Step 4, carrying out data labeling on themicroblogs and comments thereof; Step 5, preprocessing the acquired data, including word segmentation, stop word removal and feature extraction; And 6, constructing an LSTM-based model, and sending the context sequence and the target text into the model to obtain a classification result. According to the method provided by the invention, various data including microblog comments and microblog user information are covered, the data types are more diversified, and the influence of microblog heat on data acquisition can be better reduced. Compared with an existing technology for classifying target texts, the LSTM model based on sequence labeling can make full use of context information and the emotional tendency of comments, so that a better microblog rumor detection result is obtained.

Description

technical field [0001] The invention relates to the fields of web crawler and natural language processing, in particular, a method for constructing a resource library using web crawler technology and implementing rumor classification using deep learning technology. Background technique [0002] With the rise of platforms such as Twitter and Sina Weibo, information in today's society is being generated and disseminated at an unprecedented speed. But at the same time, the rumors caused by the widespread dissemination of Weibo information have become more and more obvious, which has attracted the attention of the government and relevant research institutions. Rumor information in social media such as Weibo is affecting the dissemination of normal network information and the development of social relationships, and preventing and controlling the spread of rumors has become an important part of supervision. In this situation, research on efficient and feasible microblog rumor de...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/951G06F16/35G06F16/36G06F17/27
Inventor 夏睿周尧
Owner NANJING UNIV OF SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products