A Hybrid Data Cleaning Method Based on Multiple Data Versions
A mixed data, multi-data technology, applied in the direction of electrical digital data processing, digital data information retrieval, special data processing applications, etc., can solve the problems of long running time, inapplicability, dependence, etc., to reduce the detection range and speed up the running time , High cleaning efficiency
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0039] Now in conjunction with accompanying drawing and concrete implementation technical scheme of the present invention is described further:
[0040] Such as figure 1 Shown, the specific implementation process and working principle of the present invention are as follows:
[0041] Step (1): Input the integrity constraint (IC) in the framework and the data set with dirty data into the framework; the dirty data set and integrity constraint are described in Table 1 below:
[0042] Table 1 shows a hospital information dataset record, which contains 4 attributes, namely hospital name (HN), city (CT), state (ST), contact information (PN), and gray shading marks in Table 1 for wrong data. Given three integrity constraints:
[0043]
[0044]
[0045]
[0046] where D represents the data set, t 1 ,t 2 Represents two different tuples, the functional dependency (Functional Dependency, referred to as FD) rule r1 Indicates that a city can only belong to one state, the deni...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


