Individual differential privacy protection method for high-dimensional data publishing in distributed environment

A distributed environment, high-dimensional data technology, applied in the field of personalized differential privacy protection for high-dimensional data release, can solve the problem of data loss of utility, reduce data changes, improve utility, and facilitate correlation analysis

Active Publication Date: 2018-04-03
GUANGXI NORMAL UNIV
View PDF6 Cites 25 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the existing differential privacy protection technology cannot deal with the release of high-dimensional data in a distributed environment very well, especially when the published d

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Individual differential privacy protection method for high-dimensional data publishing in distributed environment
  • Individual differential privacy protection method for high-dimensional data publishing in distributed environment
  • Individual differential privacy protection method for high-dimensional data publishing in distributed environment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0033] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in combination with specific examples and with reference to the accompanying drawings.

[0034] The present invention considers the overlapping and overlapping of data attributes in a distributed environment, and realizes personalized allocation of privacy budgets according to the different sensitivity levels of attributes and the correlation between them, so as to solve the problems existing in the prior art. The present invention is based on the semi-credible assumption that each local database performs noise addition according to the allocated privacy budget so that it satisfies ε i -differential privacy(ε i is the allocated privacy budget), and then sent to the data manager for aggregation, after data aggregation satisfies the total ε-differential privacy (where k participants), while better maintaining th...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an individual differential privacy protection method for high-dimensional data publishing in a distributed environment. According to the method, the correlation among properties is quantified through mutual information, and the mutual information of corresponding property pairs is calculated by use of a mutual information formula; an approximate k-degree Bayesian network isconstructed according to the mutual information, and the Bayesian network can well reflect dependency among the properties; privacy budgets are allocated individually according to the quantity of sensitive properties and the quantity of non-sensitive properties meeting conditions; all participants perform noise addition processing on data according to the allocated privacy budgets, and a random response mechanism is adopted to perform noise addition; and the data obtained after noise addition is sent to a manager, the manager gathers the data and synthesizes the data into an integrated dataset, and then the dataset is published to the outside. Through the method, when the data is published, a privacy requirement is guaranteed, a large amount of processing data is reduced, therefore, change of the data is lowered, the utility of the data is improved, and the method is beneficial for a data analyzer to perform relevant analysis.

Description

technical field [0001] The invention relates to the technical field of network data security, in particular to a personalized differential privacy protection method for publishing high-dimensional data in a distributed environment. Background technique [0002] In recent years, with the rapid development of the Internet and information technology, the release of privacy-protected data has received a lot of attention, and people have a deeper understanding of the concept of protecting personal privacy and realizing data sharing. Many specific data owners need to publish their original data (such as medical data of a hospital, user data of a social networking site, etc.) for other institutions to conduct research and analysis or for other purposes. The original data to be released may contain a large amount of sensitive personal privacy information (such as salary, illness, personal savings, etc.), and direct release may lead to the disclosure of private information, so the da...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F21/62G06F17/30
CPCG06F16/21G06F21/6245
Inventor 李先贤赵华兴王利娥刘鹏于东然
Owner GUANGXI NORMAL UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products