A distributed data cleaning system and method based on data analysis
A distributed data and data analysis technology, applied in the direction of electronic digital data processing, digital data information retrieval, special data processing applications, etc., can solve the problems of unpredictable cleaning effect, damage to data integrity, loss of data attributes, etc. Achieve the effects of improving controllability and precision, high flexibility and practicality, and improving speed and accuracy
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0085] like Figure 1-2 As shown, the processing unit 2 includes a collection module 201, a processing module 202, a metadata classification module 203, a cleaning module 204 and an output module 205, and the collection module 201 is used to collect user models and metadata of the multivariate heterogeneous database 1 element as well as the source data element;
[0086] The processing module 202 is configured to screen initial metadata elements for the correlation between the metadata elements collected by the collection module 201 and the user model;
[0087] The metadata classification module 203 screens metadata elements that have a common relationship with the initial metadata elements from the metadata elements collected by the collection module 201, and extracts from the source data elements collected by the collection module 201 The source data elements corresponding to the metadata elements having a common relationship with the initial metadata elements, the source da...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


