Heteroscedasticity difference privacy preservation method for medical data based on OPTICS clustering

A medical data and differential privacy technology, which is applied in digital data protection, other database clustering/classification, patient-specific data, etc., can solve the problems of data availability decline, low data availability, weak protection, etc., and achieve time complexity reduction, Effects of improved data availability and increased security

Inactive Publication Date: 2019-02-26
SHANDONG UNIV OF SCI & TECH +1
View PDF8 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] In order to achieve data privacy protection, the current common technologies can be divided into three categories: data encryption, restriction of release and data distortion. Although the above methods are very effective, they also have their shortcomings: data encryption makes data availability extremely low, which violates The main premise of the restriction release does not define a strict attack model. When the attacker has sufficient knowledge background, the protection given by this technology is very weak; the typical technology corresponding to data distortion is differential privacy protection technology, which is also the most effective and The most commonly used technique, which defines a powerful attack model, but because it is based on distortion, it will still lead to a significant decrease in data availability when giving data strong protection (increasing noise)
[0006] The OPTICS algorithm is a typical clustering algorithm in artificial intelligence. It can cluster according to the shape of the dense area covered by the data set, find clusters of any shape, and it is not sensitive to the input of parameters, which better reflects the query The density relationship of the data in the data set enables more effective analysis of the distribution and association of medical data, but its processing of low-frequency data needs to be improved

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0026] The present invention will be further described below.

[0027] A heteroscedastic differential privacy protection method for medical data based on OPTICS clustering, including:

[0028] a) Process the medical data, delete the data identifier in the medical data, represent the discrete attributes in the medical data with fixed integers, and obtain the medical data D after digitizing all the medical data;

[0029] b) The user defines the privacy parameter K of the K-anonymity mechanism, generalizes the medical data D based on the quasi-identifier, and obtains the generalized medical data set D';

[0030] c) The generalized medical data set D' is used as the clustering data of the OPTICS algorithm, the user defines the neighborhood radius r and the minimum number M in the OPTICS algorithm, and establishes the seed queue L1, result queue L2 and pointers in the OPTICS algorithm S, initialize the seed queue L1 and the result queue L2 as an empty queue, and define the pointer...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

A heteroscedasticity difference privacy preservation method for medical data based on OPTICS clustering is proposed. The time complexity of OPTICS clustering algorithm is reduced by introducing singlelinked list update and pointer S, and the combination of K-anonymity and differential privacy preservation enhances its security. In order to ensure the availability of data, In this process, heteroscedastic noise is adopted to improve data availability. During the process, we assume that the attacker can obtain the probability of obtaining the privacy information successfully under the maximum knowledge background, and set the upper bound of privacy parameters so as to ensure that the relationship between data availability and privacy security is effectively balanced within the scope of privacy protection.

Description

technical field [0001] The invention relates to the technical field of medical data protection, in particular to an OPTICS clustering-based privacy protection method for medical data heteroscedasticity difference. Background technique [0002] In recent years, the combination of medical data and artificial intelligence has made intelligent medical treatment unprecedentedly hot. The correct use of medical data has brought great value. Coupled with the combination of artificial intelligence, it has led the medical industry to a new way, new vision, and new situation. , however, due to the value of medical data, the number and methods of attacks on medical data have increased significantly, and the leakage of medical data privacy has become more serious. How to ensure the availability of medical data while ensuring its data Privacy security has become a major focus. [0003] Data Privacy Protection: [0004] In order to achieve data privacy protection, the current common tech...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F21/62G16H10/60G06K9/62G06F16/906
CPCG06F21/6245G06F21/6254G16H10/60G06F18/23
Inventor 王英龙孙宗锟舒明雷崔焕庆赵慧奇成曦平永杰燕婷
Owner SHANDONG UNIV OF SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products