An automatic data cleaning method based on DeepDive
An automatic cleaning and data technology, applied in the field of data processing
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0018] Now in conjunction with embodiment, accompanying drawing, the present invention will be further described:
[0019] The present invention proposes an automatic data cleaning method based on DeepDive, the automatic data cleaning flow chart is shown in figure 1 , the technical solutions adopted to solve its technical problems include the following:
[0020] 1. Data preprocessing
[0021] Set the threshold for the size of the data to be cleaned Calculate the size of the data to be cleaned, that is, the number of tuples included, if Then randomly sample the original data to get the sampled data, otherwise keep the original data.
[0022] 2. Data model learning
[0023] In the absence of ready-made data cleaning modes / rules, data model learning is carried out from the data obtained after data preprocessing, to find out the implicit non-absolute or relatively weak dependencies in the data, and use the Bayesian network Form representation. In the learning phase of the ...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com