A distributed heterogeneous data cleaning system based on visual management
Patent Information
- Authority / Receiving Office
- CN · China
- Patent Type
- Patents(China)
- Current Assignee / Owner
- 众创网(武汉)科技有限公司
- Publication Date
- 2022-05-24
Smart Images

Figure 1
Abstract
Description
technical field
[0001] The present invention relates to the technical field of data processing, and more particularly, to a distributed heterogeneous data cleaning system based on visual management. Background technique
[0002] Data cleaning is the core of government intensification, data warehouse and data mining. It is the basis of government data migration. The complexity of heterogeneous data leads to slow data cleaning and error-prone. Due to the wide range of data sources in ETL technology, These data sources may be stored on different hardware or different operating systems, so there will inevitably be some "dirty data" in these data sources. The purpose of data cleaning is to find and eliminate those data that do not meet the specifications, which is important for ensuring data. The high quality of the data warehouse has a very important impact on the correctness of the data warehouse and subsequent data mining and decision analysis.
[0003] The heterogeneity of d...