Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Differential privacy k-means clustering method based on cluster similarity and transformation invariance

A technology of differential privacy and clustering method, which is applied in the field of privacy protection and can solve problems such as poor availability of results

Inactive Publication Date: 2021-02-12
ZHENGZHOU UNIV
View PDF1 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The embodiment of the present invention provides a differentially private k-means clustering method based on cluster similarity and transformation invariance, aiming to solve the problem of poor availability of results after adding Laplace noise to existing differentially private clustering methods

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Differential privacy k-means clustering method based on cluster similarity and transformation invariance
  • Differential privacy k-means clustering method based on cluster similarity and transformation invariance
  • Differential privacy k-means clustering method based on cluster similarity and transformation invariance

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0034] In order to make the purpose, technical solutions and advantages of the embodiments of the present application clearer, the present invention will be further described in detail below in conjunction with the accompanying drawings and specific embodiments. It should be understood that these examples are only used to illustrate the present invention and not to limit the scope of the present invention. After reading the present invention, modifications to various equivalent forms of the present invention by those skilled in the art fall within the scope defined by the appended claims of the present application.

[0035] The present invention is a differentially private k-means clustering method based on cluster similarity and transformation invariance, and its overall framework is as follows figure 1 As shown, the specific steps are as follows:

[0036] Step 1. Preprocess the data set and normalize all samples to [0,1] d In the space, d represents the dimension of the sam...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a differential privacy k-means clustering method based on cluster similarity and transformation invariance. The differential privacy k-means clustering method comprises the following steps: preprocessing a data set; reversely sorting the probabilities P that the sample is selected as the next initial cluster center, and selecting K initial cluster centers by using a roulette method; calculating the product of the Euclidean distance from each sample to K cluster centers and the similarity to each cluster set, and taking the product as a similarity measurement index; dividing clusters according to similarity measurement indexes; calculating a new cluster center u'j, and performing privacy protection on the new cluster center by using a differential privacy Laplace noise mechanism; correcting the new cluster center disturbed by the Laplace noise mechanism according to the transformation invariance of the differential privacy; and repeating the steps until a convergence condition is met or the maximum number N of iterations is reached. According to the method, the problem that the result availability of an existing differential privacy clustering method is poorafter Laplace noise disturbance is added is solved.

Description

technical field [0001] The invention belongs to the technical field of privacy protection, and in particular relates to a differential privacy k-means clustering method based on cluster similarity and transformation invariance. Background technique [0002] As a typical unsupervised data mining method, K-means clustering can mine unknown knowledge and potential value from massive data. However, while mining useful information, personal privacy information in the data may be leaked, causing immeasurable threats and losses to users. Therefore, in the process of k-means clustering analysis, it is necessary to protect the personal privacy information in the data and ensure the availability of the final clustering results. [0003] As attackers have more and more background knowledge, traditional privacy protection technologies, such as k-anonymity, t-closeness, etc., have become difficult to guarantee data security. As a new privacy protection model with strict mathematical pr...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06K9/62
CPCG06F18/23213G06F18/22
Inventor 叶阳东徐富国胡世哲
Owner ZHENGZHOU UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products