Unlock instant, AI-driven research and patent intelligence for your innovation.

A Storm-based big data preprocessing method and system for railway geological disaster monitoring

A technology of geological disasters and big data, applied in transmission systems, measuring devices, ICT adaptation, etc., can solve the problems of inability to apply complex railway geological disaster monitoring big data processing, low efficiency of serial data processing, low accuracy of monitoring and early warning, etc. Achieve the effect of improving data processing efficiency, satisfying real-time performance and high precision

Active Publication Date: 2022-05-17
CHINA RAILWAY ERYUAN ENG GRP CO LTD
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The purpose of the present invention is to overcome the traditional ETL data processing method existing in the prior art, which can only accurately analyze a single monitoring data, the serial data processing efficiency is low, the data cleaning quality is low, resulting in low monitoring and early warning accuracy, which is not applicable Due to the defects of complex railway geological disaster monitoring big data processing, a storm-based railway geological disaster monitoring big data preprocessing method and system are provided. This invention combines the characteristics of distributed stream computing processing technology, and proposes an open source stream computing framework. The big data preprocessing method for railway geological disaster monitoring uses the storm flow computing framework and distributed parallel computing technology to carry out real-time ETL preprocessing for massive big data of geological disaster monitoring along the railway, and improves the traditional big data ETL method for geological disaster monitoring in terms of data processing efficiency, Limitations in the quality of data cleaning and the like; the method can not only realize ETL rapid preprocessing of railway engineering geological disaster monitoring big data, but also provide high-quality basic data for monitoring and early warning analysis; the method of the present invention is also applicable to other railway engineering fields ETL preprocessing for big data analysis of multiple engineering monitoring points

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Storm-based big data preprocessing method and system for railway geological disaster monitoring
  • A Storm-based big data preprocessing method and system for railway geological disaster monitoring
  • A Storm-based big data preprocessing method and system for railway geological disaster monitoring

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0046] figure 1 Shows the storm-based railway geological disaster monitoring big data preprocessing computing framework of an exemplary embodiment of the present invention, such as figure 1 As shown, the present invention proposes a storm-based railway geological disaster monitoring big data preprocessing method for railway geological disaster monitoring big data preprocessing. The method is based on the open source big data platform Storm flow computing framework, and runs the ETL method in the Storm computing framework Realize real-time parallel preprocessing of massive monitoring big data. Such as figure 1 As shown, the geological disaster monitoring big data ETL preprocessing system provided by the present invention first extracts sensor monitoring flow data in parallel from multiple data sources of the railway geological disaster monitoring platform, and then performs distributed parallel flow data cleaning and distribution of multi-task scheduling Parallel streaming da...

Embodiment 2

[0066] In a further embodiment of the present invention, we design and select the processing algorithm of the ETL tool, and the noise data and missing data in the monitoring data can be effectively screened and filtered through the data cleaning algorithm provided by the present invention, and the monitoring data Perform data repair to provide high-quality data sets for subsequent data analysis and mining; specifically, the data cleaning includes: detection and processing of abnormal data points, detection and processing of periodic noise data points, detection and processing of missing data points deal with:

[0067] 1. Abnormal outlier data filtering and correction processing

[0068] When the monitoring point sensor along the railway is disturbed by external events, it will generate some abnormal data, that is, some isolated point data. Abnormal isolated points affect the accuracy of data analysis or generate false alarms. Therefore, the present invention needs to screen o...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a storm-based method and system for preprocessing big data of railway geological disaster monitoring, including: providing a parallel sub-calculation module for each sensor data of each monitoring point based on the open-source stream computing storm framework, and each sub-computation The module realizes ETL processing of railway geological disaster monitoring big data, realizes real-time parallel preprocessing of multiple sensor data at different monitoring points, effectively improves data processing efficiency and data cleaning quality; effectively solves the problem of low efficiency of serial processing in traditional methods , can not meet the processing requirements of monitoring big data, can meet the real-time, timeliness, high precision and other requirements of railway geological disaster monitoring data preprocessing, and is suitable for complex railway geological disaster monitoring big data real-time analysis and early warning application scenarios; at the same time The storm-based ELT processing method provided by the present invention is also applicable to other complex projects with multiple project monitoring points and various monitoring data.

Description

technical field [0001] The invention relates to the application field of equipment monitoring and analysis, in particular to a storm-based method and system for preprocessing big data of railway geological disaster monitoring. Background technique [0002] In the application of big data analysis, the data quality of data preprocessing is a key factor related to the credibility of data analysis and mining. Similarly, the high efficiency and real-time performance of data preprocessing is also very important for many applications. In the application of geological disaster monitoring and early warning along the railway, a large amount of monitoring data collected by sensors reaches the monitoring processing platform in real time in a fast manner. How to realize fast data extraction, data conversion, and data cleaning on the monitoring processing platform faces many challenges. Quality and fast data preprocessing is necessary. The big data of railway geological disaster monitor...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G08B21/10G01D21/02G06K9/62H04L67/10
CPCG08B21/10G01D21/02H04L67/10G06F18/23Y02A90/10
Inventor 王珣陆鑫袁焦伏坤金劭南杨科刘勇潘兆马邹文露余博杨俊超杨学锋徐鑫杨森姚书琴裴起帆
Owner CHINA RAILWAY ERYUAN ENG GRP CO LTD
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More