Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

High-dimensional mass data release privacy protection method

A privacy protection, massive data technology, applied in the field of information technology security, it can solve the problems of wasting privacy budget, unable to meet actual requirements, and publishing space growth, etc. Effect

Pending Publication Date: 2022-05-27
LIAONING UNIVERSITY OF TECHNOLOGY
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, part of the privacy budget is wasted when constructing the noise covariance matrix, and the processing time of this method cannot meet the actual requirements when dealing with data with large attribute dimensions
[0010] At present, the research on data release mainly focuses on one-dimensional or low-dimensional data. However, these data release methods are not suitable for the release of high-dimensional data, and cannot solve the With the increase of domains, the publishing space formed grows exponentially, encountering the problem of "dimension disaster", introducing large noise, resulting in low availability of published data
Therefore, in the release of high-dimensional data, it is urgent to design a release method that can solve the problem of low data availability caused by the disaster of dimensionality and satisfy data privacy and security while providing data researchers with a large amount of effective information.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • High-dimensional mass data release privacy protection method
  • High-dimensional mass data release privacy protection method
  • High-dimensional mass data release privacy protection method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0061] The present invention will be further described in detail below, so that those skilled in the art can implement according to the description.

[0062] The invention provides a privacy protection method for high-dimensional mass data release. First, through the attribute importance threshold, the attributes in the original data are screened, and the useless attributes and attributes with many missing values ​​in the original data are eliminated, and then use the main The component analysis method reduces the dimensionality of the data. During the dimensionality reduction, Laplace noise is added to the generated projection matrix to make the data satisfy differential privacy. Then, on the premise of satisfying differential privacy, the sensitive preferences of data attributes are classified and combined with the optimal Matching theory to allocate the privacy budget, add noise of different sizes to the attributes of different sensitive preferences in the data set, implemen...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a high-dimensional mass data release privacy protection method, which comprises the following steps of: reducing dimensions of data by introducing an attribute importance optimization PCA (Principal Component Analysis) algorithm so as to reduce time and space; a differential privacy protection strategy meeting individuation is designed in the dimension reduction process, and different protection degrees are designed for different sensitive attributes; a mutual information evaluation mechanism is introduced into a PCA algorithm for the first time, data generated by selecting different principal component numbers in the dimension reduction process is evaluated, and the optimal principal component number is determined.

Description

technical field [0001] The present invention relates to the technical field of information technology security, and more particularly, the present invention relates to a privacy protection method for publishing high-dimensional mass data. Background technique [0002] At present, many data collection agencies need to release the collected raw data (such as medical data, financial data, etc.) to facilitate data analysis and mining, and to generate more effective decision support from the released data. However, a large amount of personal sensitive information is involved in the released raw data, and the direct release of data will lead to serious leakage of personal privacy. Therefore, data publishers need to process private data through special protection technology before releasing the data. [0003] In the prior art, there have been a few research results on high-dimensional data publishing methods, but these methods all have some problems: [0004] The PriView algorith...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F21/62G06K9/62
CPCG06F21/6245G06F18/2135
Inventor 褚治广彭栋栋徐忠全张兴张巍
Owner LIAONING UNIVERSITY OF TECHNOLOGY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products