Weighted classification method for disaster information importance of blog articles based on deep learning and XGBoost algorithm

A technology of deep learning and classification method, applied in the direction of text database clustering/classification, digital data information retrieval, calculation, etc., can solve the problem of slow cycle of incident rescue and achieve the effect of accurate classification performance

Active Publication Date: 2020-04-28
BEIJING UNIV OF TECH
View PDF3 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

When natural disasters occur, such as natural disasters such as fires, earthquakes, mountain torrents, tsunamis, mudslides, etc., or emergencies such as shootings and robberies, the cycle of event rescue will be very slow if it is only broadcast through the media

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Weighted classification method for disaster information importance of blog articles based on deep learning and XGBoost algorithm
  • Weighted classification method for disaster information importance of blog articles based on deep learning and XGBoost algorithm
  • Weighted classification method for disaster information importance of blog articles based on deep learning and XGBoost algorithm

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0041] The present invention is described in detail below in conjunction with accompanying drawing and embodiment Step (1) receives social media text data and preprocessing

[0042] The training data set used in the present invention is a labeled data set about disaster information officially provided by TREC, which is Twitter text data, but the present invention is not limited to tweets, and can also be applied to other classifications, such as WeChat, Weibo text data on social platforms. The data set contains 30,000 pieces of text data that have been labeled with information categories and warning categories. The information categories and warning categories are the official category tables provided by TREC. There are 25 information categories, including request categories (need help / information, Request for search and rescue), call-to-action (transfer personnel / volunteers, etc.), report (news / weather, etc.), others (discussion / emotion, etc.), each piece of data can contain ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a weighted classification method for disaster information importance of blog articles based on deep learning and XGBoost algorithm. The method mainly comprises the following five steps of (1) receiving and preprocessing social media text data; (2) constructing a disaster word pre-training word vector table for converting social media text data into word vectors; (3) converting the social media text data preprocessed in the step (1) into a word vector with the dimension of d through the updated word vector table in the step (2), and performing information category classification and importance category classification on the social media text data by utilizing an XGBoost algorithm; (4) weighting the information classification result and the alarm classification resultof the social media text data to determine the importance of the contained text information. Based on social media text data, importance weighting functions are added, through dividing information categories and alarm categories of social media text data, the association degree between the information category and the alarm is established, classification is carried out by combining deep learningand an XGBoost algorithm, and the association between the information category and the alarm category is weighted to obtain a final social media text data information importance index. Experimental results show that the information importance classification effect of the social media text data is obviously improved.

Description

technical field [0001] The invention relates to a weighted classification method for disaster information importance of blog posts based on deep learning and XGBoost algorithm, and belongs to the technical field of Internet information classification. Background technique [0002] Social media has become an integral part of human life, such as Twitter, Weibo, etc. When natural disasters occur, such as natural disasters such as fires, earthquakes, flash floods, tsunamis, mudslides, etc., or emergencies such as shootings and robberies, if they are only broadcast through the media, the cycle of event rescue will be very slow. On February 28, 2019, China Internet Network Information Center (CNNIC) released the 43rd "Statistical Report on Internet Development in China" in Beijing. As of December 2018, the number of Internet users in my country reached 829 million, and the penetration rate reached 59.6%. At the end of the year, it increased by 3.8 percentage points, with 56.53 mil...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/9536G06F16/35
CPCG06F16/9536G06F16/35
Inventor 王鹤松杨震
Owner BEIJING UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products