Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Big data acquisition method

A technology of data collection and big data, applied in electronic digital data processing, structured data retrieval, special data processing applications, etc. Improve quality and efficiency, and handle clear and simple effects with logic

Pending Publication Date: 2021-01-26
ZHUHAI XINDEHUI INFORMATION TECH
View PDF0 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The technical problem to be solved in the present invention is to provide a data collection method for big data, so as to solve the problems that the current data collection methods on the market cannot perform unified data processing, and are not conducive to the arrangement of public processes such as data reconciliation and data quality monitoring. , in order to realize the ability to process data through a unified real-time streaming engine, making the processing logic clear and simple, so as to facilitate the unified arrangement of public processes such as data reconciliation and data quality monitoring

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Big data acquisition method
  • Big data acquisition method
  • Big data acquisition method

Examples

Experimental program
Comparison scheme
Effect test

example 1

[0084] Example 1: Drag and drop a "data quality detection" component, you can set to obtain limited source data (such as 1000 pieces) for analysis, check whether the selected fields of the table are primary keys, whether they are empty, etc., or expand the detection rules, etc., and finally Generate a profiling report to evaluate the quality of the data source.

example 2

[0085] Example 2: Drag and drop a "data quality detection" component to detect and analyze the data pushed to Kafka, configure corresponding rules (for example: load data standards of specific industries), detect whether the data conforms to industry standards, and generate corresponding data quality report.

[0086] In the present invention, the pre-acquisition processor shields the differences in data source types, and then can process data through a unified real-time stream engine, with clear and simple processing logic; at the same time, it is also convenient to uniformly arrange public processes such as data reconciliation and data quality detection, and improve the quality and efficiency of the entire system.

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a big data acquisition method, which comprises the following steps of S1, uniformly processing different types of data based on a visual process arrangement platform; S11, constructing a design state platform; S12, customizing different access components for different types of data; S13, presetting an access assembly; S2, uniformly arranging data reconciliation and data quality detection public flows based on a visual flow arrangement platform; S21, arranging a data reconciliation assembly and a data reconciliation process at a node capable of actively acquiring sourcedata and destination data; and S22, arranging a data quality detection assembly and process at a node capable of actively pulling data. According to the invention, the difference of data source typesis shielded by a pre-acquisition processor, and then the data can be processed by a unified real-time stream engine, so that the processing logic is relatively clear and simple; and meanwhile, publicprocesses such as data reconciliation and data quality detection are conveniently and uniformly arranged, and the quality and efficiency of the whole system are improved.

Description

technical field [0001] The present invention relates to the technical field of data collection, and more specifically to a data collection method for big data. Background technique [0002] Data collection is an important part of the big data governance platform. The big data governance platform faces a wide variety of data sources and diverse forms. When collecting data, it is necessary to monitor and receive high-speed real-time data streams, and also need to actively pull massive static data; it is necessary to process regular structured data , also need to deal with a large amount of semi-structured and unstructured data. [0003] The current data collection methods on the market have the following deficiencies when accessing: 1) The collection and processing processes are organized separately according to different scenarios, and unified data processing cannot be performed, and the processing logic is relatively complicated; 2) It is not conducive to data reconciliatio...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/835G06F16/838G06F16/27
CPCG06F16/8358G06F16/838G06F16/27
Inventor 龚波苏学武水军杨刚苏文辉
Owner ZHUHAI XINDEHUI INFORMATION TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products