Multi-source heterogeneous ecological environment big data processing method and system based on data lake

An ecological environment and data technology, applied in the multi-source heterogeneous big data processing method and its system field, can solve the problems of high storage cost, lack of unified data specification, and low total amount of ecological environment data openness, so as to reduce the storage cost. Cost, the effect of improving access and analysis efficiency

Pending Publication Date: 2020-07-28
INST OF URBAN ENVIRONMENT CHINESE ACAD OF SCI
View PDF8 Cites 13 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The main difficulties are: 1) Data interoperability: the source of ecological environment big data covers almost all government functional departments, these departments are not connected to each other, and data often exists in the form of "data islands"
2) The problem of data standardization: data not only exists in a single structured form, but more data is presented in semi-structured and unstructured forms, lacking a unified data specification, and there are a large number of heterogeneous data
3) Data storage cost and operational performance issues: The storage of ecological environment big data in databases or data warehouses often brings high storage costs, and at the same time seriously restricts the speed of data processing
4) The problem of data openness: the total amount of ecological environment data openness is low, most of which are static data, and are concentrated in cities with developed economies, government informatization foundations, and well-developed IT industries

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Multi-source heterogeneous ecological environment big data processing method and system based on data lake
  • Multi-source heterogeneous ecological environment big data processing method and system based on data lake
  • Multi-source heterogeneous ecological environment big data processing method and system based on data lake

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0040] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings of the embodiments of the present invention. Apparently, the described embodiments are only some of the embodiments of the present invention, not all of them. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0041] Such as figure 1 As shown, it is a schematic diagram of the architecture of the multi-source heterogeneous ecological environment big data integration system based on the data lake in the present invention.

[0042] The present invention aims to provide a multi-source heterogeneous ecological environment big data integration system based on a data lake, including an ecological environment data collection layer L1, an ecological environmen...

Embodiment 2

[0075] Such as figure 2 As shown, it is a flow chart of a preferred embodiment of a data lake-based ecological environment big data management method in the present invention.

[0076] In this embodiment, the realization of the multi-source heterogeneous ecological environment big data management method based on the data lake includes steps S1-S5. In the following, the multi-source heterogeneous ecological environment big data management method based on the data lake will be combined with specific ecological environment data. The implementation is described in detail:

[0077] Step S1, the ecological environment data collection module L1 event-driven fully managed automatic collection of various structural ecological environment raw data from each data source, the data includes ecological environment data and metadata, and is stored in the original data pool in the data lake; the original The environmental data in the data pool is the original ecological environment data col...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a multi-source heterogeneous ecological environment big data processing system and method based on a data lake. The system comprises an ecological environment data acquisition layer, an ecological environment data cleaning layer, an ecological environment data storage layer, an ecological environment data processing layer and an ecological environment data management layer.The ecological environment data acquisition layer is used for acquiring original data of the ecological environment; the ecological environment data cleaning layer is used for preprocessing and standardizing the data acquired by the ecological environment data acquisition layer; the ecological environment data storage layer is used for carrying out classified and layered storage on the data transmitted by the ecological environment data cleaning layer; the ecological environment data processing layer is used for integrally processing the stream batch ecological environment data; and the ecological environment data management layer is used for monitoring the ecological environment data acquisition, cleaning, storage and processing processes. The access and analysis efficiency of the environmental data can be improved, and the storage cost is greatly reduced.

Description

technical field [0001] The invention relates to the field of computer big data processing, in particular to a data lake-based multi-source heterogeneous big data processing method and system thereof. Background technique [0002] In recent years, with the rapid development of technologies such as the Internet of Things, remote sensing, cloud computing, and mobile smart devices, ecological environment data has shown a blowout growth. On the whole, ecological environment big data can be divided into four categories: 1) basic support data: basic geography, remote sensing images, climate and meteorological data; 2) natural ecological data: farmland ecosystem, forest ecosystem, grassland ecosystem, 3) Environmental monitoring data: data on water environment, atmospheric environment, soil environment, noise environment, nuclear radiation environment, etc.; 4) Humanities and social data: economic development, infrastructure, Data on energy consumption, public participation, online...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/21G06F16/215G06F16/2457G06F16/25G06F16/28G06N7/00G06N20/10G06Q50/26
CPCG06F16/214G06F16/215G06F16/24573G06F16/254G06F16/285G06Q50/26G06N20/10G06N7/01
Inventor 李楠汪鹏陈伟强
Owner INST OF URBAN ENVIRONMENT CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products