Unlock instant, AI-driven research and patent intelligence for your innovation.

Privacy protection clustering method based on differential privacy

A technology of privacy protection and clustering method, which is applied in the field of privacy protection clustering based on differential privacy, can solve the problems of insufficient clustering quality and randomness of adding data point noise, and achieve the maintenance of overall distribution, improvement of differential privacy protection, The effect of improving safety

Pending Publication Date: 2021-11-26
STATE GRID JIANGSU ELECTRIC POWER CO LTD MARKETING SERVICE CENT +1
View PDF7 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The deficiency of the prior art document 1 is that, after adding noise to the data object divided into each center point, the method of re-updating the center of the cluster is used to iteratively divide, resulting in the randomness of adding noise to the data point, and it is difficult to obtain the data from the data set. The impact of noise on data availability is considered from the perspective of overall distribution, resulting in insufficient clustering quality

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Privacy protection clustering method based on differential privacy
  • Privacy protection clustering method based on differential privacy
  • Privacy protection clustering method based on differential privacy

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0068] The application will be further described below in conjunction with the accompanying drawings. The following examples are only used to illustrate the technical solutions of the present invention more clearly, but not to limit the protection scope of the present application.

[0069] Such as figure 1 As shown, the present invention provides a privacy protection clustering method based on differential privacy, comprising the following steps:

[0070] Step 1. The data owner obtains the original data set D, calculates the Euclidean distance value between any two data points in the original data set D, and constructs a distance matrix with the Euclidean distance value, where the original data set D is the relational table data, and each data points have the same attribute pattern, and each attribute is a numeric value. Specifically include:

[0071] In step 1.1, the data owner obtains the original data set D, expressed by the following formula,

[0072] D={x 1 ,...,x n...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

A privacy protection clustering method based on differential privacy comprises the steps of: (1) enabling a data owner to calculate the Euclidean distance between data points in an original data set and construct a distance matrix; (2) adding differential noise to the distance matrix to form a noise-added distance matrix, and sharing the noise-added distance matrix to an untrusted data mining party to avoid data privacy leakage in a clustering analysis process; (3) enabling the data mining party to select k non-outliers with good global distribution from the noise adding distance matrix as initial center points of clustering; (4) calculating a sequence of q nearest central points of each data point, and distributing the data points to an expected interval of the q nearest central points to form a current round of clustering division; and (5) selecting the distance between the points in each cluster and the minimum data point as a new center point. The process is repeated until each cluster center point is not changed any more. According to the method provided by the invention, the clustering mining precision can be effectively improved while differential privacy protection of sensitive data is realized.

Description

technical field [0001] The present invention relates to the technical field of privacy protection, in particular to a privacy protection clustering method based on differential privacy. Background technique [0002] Data mining can discover potential regular patterns hidden in data and provide support for auxiliary decision-making. Clustering is the basic operation of data mining, and the access to individual data during the clustering process has the risk of leaking data privacy. For example, a pharmaceutical company hopes to cluster users' purchase records to obtain drug audiences, but access to sensitive drug purchase records such as skin diseases during clustering will violate patients' privacy. How to implement clustering mining under the premise of taking data privacy into account has become an urgent problem to be solved. Privacy-preserving clustering is an effective method to solve this problem, which has attracted continuous attention of researchers in recent years...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F21/60G06F21/64G06K9/62
CPCG06F21/602G06F21/64G06F18/23213
Inventor 单超邹云峰范环宇祝宇楠徐超
Owner STATE GRID JIANGSU ELECTRIC POWER CO LTD MARKETING SERVICE CENT