Difference privacy protection K-means cluster method based on profile coefficient
A technology of k-means clustering and silhouette coefficient, which is applied in the field of information security, can solve the problems of big data clustering analysis information leakage, etc., and achieve the effects of good clustering result availability, good algorithm stability, and increased execution time
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0025] The present invention will be described in detail below in conjunction with the accompanying drawings and specific embodiments.
[0026] A K-means clustering method for differential privacy protection based on silhouette coefficients in the present invention, the process is as follows figure 1 As shown, the specific steps are as follows:
[0027] Step 1. Divide the data set into M pieces of data of the same size and perform the Map task and Reduce task respectively. Assume that the data set is D, the total number of records in the data set is N, and the record is recorded as a i , where, 1≤i≤N, the dimension of the record is d, the number of clusters is K, and the kth central point is denoted as u k , 1≤k≤K, privacy budget ε, the random noise of the kth cluster in the tth iteration is t is the number of iterations;
[0028] Step 2. Perform normalization processing on all the data in the data set D, and distribute all points to [0,1] during normalization processing ...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com