Data filling method and device based on clustering algorithm and computer equipment
A technology of clustering algorithm and filling method, applied in the field of big data, which can solve problems such as errors and missing data
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0039] see figure 1 , in a data filling method based on a clustering algorithm of the present embodiment, the following steps may be included:
[0040] Step 01, determine the attributes of the missing data.
[0041] In the process of data collection or transmission, due to human error or mechanical reasons, null values may be caused, resulting in missing data. In this embodiment, the positioning of missing data can be realized by using a null value positioning method.
[0042] In the embodiment of the present invention, after the missing data is located, the attribute of the missing data may be determined according to the data content. For example, if a boy's love for basketball is missing, then the love for basketball is determined as an attribute of the missing data. For another example, if a user has missing data on the probability of renewal of the purchased target insurance after expiration, then the probability of renewal of the target insurance after expiration is ...
Embodiment 2
[0096] see Figure 5 The data filling method based on the clustering algorithm of the present embodiment is based on the first embodiment, including the following steps:
[0097] Step 501, determining attributes of missing data.
[0098] Step 502, performing binary group integration on the data according to the attributes of the missing data.
[0099] Step 503, clustering the data after the binary group integration to form clusters.
[0100] In the embodiment of the present invention, in order to realize the filling of the missing data, the data with the same attribute as the missing data can be clustered according to the data after the binary group integration and the reference data as a benchmark. For example, based on boys as the benchmark, clustering the degree of love for basketball can form multiple clusters. The formed clusters are all boys’ love for basketball, but the degree of love is different. For example, five clusters are formed , respectively: Like it very mu...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com