Intelligent data blood relationship tracing method and device based on clustering analysis
A cluster analysis and data technology, applied in the field of big data, can solve problems such as inability to complete, data performance impact, and inability to process data lineage, etc., to achieve the effect of improving accuracy and efficiency
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Example Embodiment
[0021] According to one or more embodiments, such as figure 1 As shown, a method for intelligently tracing the origin of data based on cluster analysis includes the steps:
[0022] Step 1: Read the table structure and data, and form the data characteristics of each field by means of data engineering. The specific method is as follows:
[0023] Step 1.1: Parse the data characteristics of the original data into structured sample data, including field type, field length, field content mode, etc.
[0024] Step 1.2: Combine the existing features in the sample data to form high-dimensional features;
[0025] Step 1.3: Analyze high-dimensional features, form new dimensions and rank the influence of new dimensions;
[0026] Step 1.4: Reduce the sample data according to the new dimension, and use the smallest number of dimensions on the premise that the distortion rate of the sample data is lower than the set value;
[0027] Step 1.5: Normalize the sample data of the new dimension.
[0028] Step ...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap