Differential privacy protection-oriented k-means clustering method adopting

A differential privacy and clustering method technology, applied in the field of information security, can solve problems such as reducing the availability of clustering results, avoid the blindness of k value and initial point sensitivity, improve usability, and reduce the number of iterations.

Active Publication Date: 2018-07-13
DONGGUAN MENGDA INDAL INVESTMENT
View PDF4 Cites 20 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, traditional differential privacy-protected k-means algorithms (such as differential privacy k-means algorithm, differential privacy k-means++ algorithm, etc.) blindness, which reduces the availability of clustering results

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Differential privacy protection-oriented k-means clustering method adopting
  • Differential privacy protection-oriented k-means clustering method adopting
  • Differential privacy protection-oriented k-means clustering method adopting

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0036] The implementation of the technical solution of the present invention will be described in further detail below in conjunction with the accompanying drawings. It should be understood that these examples are only used to illustrate the present invention and are not intended to limit the scope of the present invention. After reading the present invention, those skilled in the art will understand the present invention Modifications in various equivalent forms fall within the scope defined by the appended claims of the present application.

[0037] A differential privacy-oriented k-means clustering method of the present invention uses the result of the k-means++ algorithm as an input value, and then performs a series of non-local "jumps" alternately with the traditional k-means algorithm to improve the initial The selection of the center point, and using the differential privacy protection Laplace mechanism, increases the appropriate random noise that satisfies a specific di...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a differential privacy protection-oriented k-means clustering method. The K-means clustering method comprises the following steps: performing data preprocessing; ensuring thatC indicates a clustered centered point set, and C indicates a sum of error square of a given data set and a cluster center C; judging the volume of C; performing cyclic execution until retry is greater than a maximum value retrymatx of given retry times, and then returning to the best central point Cbest; traversing each point of the data set X, classifying the points to the nearest central point;setting added random noises; renewedly calculating the sum of the data points of each cluster and the quantity of the points, and adding the noises and finally updating the quality center of the cluster; and repeatedly carrying out the steps until the sum of error square is converged or iteration times reach the upper limit. According to the differential privacy protection-oriented k-means clustering method disclosed by the invention, the appropriate random noises which are specially distributed are added in an iteration process of a k-means clustering algorithm, so that a clustering result is distorted to a certain extent, the aim of privacy protection is fulfilled, and meanwhile, the availability of data is ensured.

Description

technical field [0001] The invention relates to a privacy protection and clustering method, in particular to a differential privacy protection-oriented k-means clustering method, which belongs to the technical field of information security. Background technique [0002] With the rapid development of cloud computing and big data, data mining technology has made great progress in some in-depth research and applications. As one of the important methods of data mining, clustering algorithm can mine implicit and unknown knowledge and rules, and has important potential value in business decision-making of a large amount of relevant data. But at the same time, a large amount of information disclosure of sensitive information brings immeasurable threats and losses to users. Therefore, how to protect data privacy in the process of cluster analysis has become a hot issue in the field of data mining and data privacy protection. With the proposal and development of privacy protection ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62G06F21/62
CPCG06F21/6245G06F18/23213
Inventor 杨庚胡闯白云璐王璇唐海霞
Owner DONGGUAN MENGDA INDAL INVESTMENT
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products