Data outlier identification method and system for automatic environment monitoring network

An automatic monitoring and data identification technology, which is applied in transmission systems, electrical digital data processing, special data processing applications, etc., can solve the problems of lack of statistical algorithms, insufficient comprehensiveness and reliability, difficult inspection and identification, etc., to improve work efficiency, The effect of preventing data loss failure and reducing the error rate

Active Publication Date: 2018-06-05
广东省环境监测中心
View PDF2 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

This method can effectively prevent outliers from being included in data statistics, but because there is no vertical tracking of the causes of outliers (such as instrument quality control and external environmental factors, etc.) when tracing the source of quality control work, platform data managers cannot track them. Judging the true state of data quality
In order to solve this problem, data management personnel need to query the work log of the base station and ask the base station maintenance personnel. The traceability of quality control work is inefficient and it is easy to make mistakes due to inquiries
[0006] (2) Missing or inconsistent sources of identification
[0008] However, there is still no relevant technology to unify the four types of data identification. As a result, data reviewers at different levels often only consider data identification from one source, which is not comprehensive and reliable enough.
[0009] (3) Lack of statistical algorithms in the real-time data platform for automatic environmental monitoring
However, most of the environmental automatic monitoring data do not conform to these distributions, and it is difficult to directly use these research results to test and identify
Therefore, the real-time data platform for automatic environmental monitoring lacks statistical algorithms that can check and audit the reliability and accuracy of statistical data
[0011] In addition, the current outlier identification technology directly deletes the outliers of the monitoring data, and it is difficult to accommodate the method of statistical theory to identify outliers
Because statistical theory can calculate the abnormal value in the monitoring data, but it does not mean that the abnormal value is absolutely wrong in the real environment. If the data calculated as abnormal is directly deleted, it will not respect the objective reality, and it is not scientific and accurate enough.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data outlier identification method and system for automatic environment monitoring network
  • Data outlier identification method and system for automatic environment monitoring network
  • Data outlier identification method and system for automatic environment monitoring network

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0123] This embodiment introduces the data structure of the data identifier in the present invention.

[0124] In consideration of the integrity and relativity of the data review work of the environmental monitoring network, the present invention re-unifies the definition of the environmental monitoring network through the four sources of automatic identification of the base station system, manual review of the base station, expert identification of the real-time data platform, and manual review of the platform. A data identification system to improve data review efficiency and reduce error rates.

[0125] The data identification of the present invention includes:

[0126] (1) Identification of source type

[0127] In order to overcome the defect that the traditional data identification system does not distinguish the source of the data identification, the invention defines the source type identification, which can well distinguish the source of each data identification, faci...

no. 2 example

[0177] The environment automatic monitoring network data identification system of the present invention is divided into a base station layer, a base station-platform data identification synchronous transmission layer, and a platform layer according to the flow of data identification generation.

[0178] The base station layer includes a base station data identification knowledge base, a base station database, a data identification update module, and a base station synchronization database. The base station database saves all the original data and data identification records of the base station monitoring system. The base station data identification knowledge base provides the knowledge storage, extraction, retrieval and application services of the environmental automatic monitoring network data identification; the data identification update module can call the knowledge of the data identification knowledge base, diagnose the data of the base station monitoring system and assign...

Embodiment 3

[0182] This embodiment describes the outlier identification technology at the base station layer.

[0183] The base station environmental monitoring system is the source of all data in the monitoring network, and it is also the first line of defense for data review. It integrates the technical content of abnormal value identification at the base station layer, including automatic identification of base station system abnormal values ​​and manual identification of base station, which is the most important part of data review. , the implementation process of the base station layer abnormal value identification in the present invention is:

[0184] 1. The base station monitoring system recognizes an abnormal data signal, which can come from the automatic perception of the system or the manual input of the base station guards.

[0185] 2. The base station monitoring system converts the monitoring data into data signals that can be recognized by the computer.

[0186] 3. The data ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method and a system of identifying data abnormal values of an environment automatic monitoring network. The method comprises the following steps of: carrying out unified definition on a data identification system of the environment automatic monitoring network according to four data sources including automatic identification of a base station system, manual examination and verification of the base station, specialist identification of a real-time data platform and manual examination and verification of the platform so as to obtain data structures of a source type identifier, a mode identifier, a mode tracking data identifier, a deduction identifier and a deleting identifier; and identifying abnormal values of a base station layer and a platform layer according to the data structures of the source type identifier, the mode identifier, the mode tracking data identifier, the deduction identifier and the deleting identifier, and adopting an improved data identifier synchronization algorithm to implement synchronization of the abnormal value of the base station layer and the abnormal value of the platform layer. The mode tracking data is added and the deduction identifier is identified, and the abnormal value identification algorithm of the real-time data platform is provided, so that the method and the system have the advantages of high traceability efficiency, high precision, completeness, reliability, science and accuracy, and can be widely applied to the field of environment monitoring.

Description

technical field [0001] The invention relates to the field of environmental monitoring, in particular to a data abnormal value identification method and system for an automatic environmental monitoring network. Background technique [0002] The environmental automatic monitoring network is an important data source for the scientific characterization of environmental quality. Its purpose is to obtain high-quality environmental monitoring data and infer the existing quality characteristics of the entire environment. In addition to the elements of general environmental monitoring, the automatic environmental monitoring network also has the characteristics of long-term real-time uninterrupted monitoring of base stations, synchronous docking of data between base stations and real-time data platforms, and automatic analysis and statistics of massive data by real-time data platforms. According to the unique technical characteristics of the automatic environmental monitoring network,...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): H04L29/08G06F17/30
Inventor 黎如昊肖文向运荣张苒
Owner 广东省环境监测中心
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products