Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A Differential Privacy High-Dimensional Data Publishing Protection Method Based on Principal Component Analysis Optimization

A principal component analysis, differential privacy technology, applied in digital data protection, electronic digital data processing, instruments, etc., can solve the problems of exponential growth of publishing space, dimensional disaster, inappropriate high-dimensional data publishing, etc. Disaster problems, effects of optimizing time and space

Active Publication Date: 2021-11-23
辽宁优智物联有限公司
View PDF9 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, these data release methods are not suitable for the release of high-dimensional data, and cannot solve the problem of "dimension disaster". The problem
Therefore, how to provide data researchers with a large amount of effective information while using differential privacy technology to ensure the privacy and security of original high-dimensional data has become extremely challenging.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Differential Privacy High-Dimensional Data Publishing Protection Method Based on Principal Component Analysis Optimization
  • A Differential Privacy High-Dimensional Data Publishing Protection Method Based on Principal Component Analysis Optimization
  • A Differential Privacy High-Dimensional Data Publishing Protection Method Based on Principal Component Analysis Optimization

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0061] The present invention will be further described in detail below in conjunction with the accompanying drawings, so that those skilled in the art can implement it with reference to the description.

[0062] like figure 2 As shown, the method for publishing and protecting differentially private high-dimensional data based on principal component analysis optimization of the present invention proposes a PCAO_PPDP method for publishing high-dimensional data with differential privacy based on principal component analysis optimization, which utilizes principal component analysis based on information entropy The method reduces the dimensionality of the data, and uses the personalized Laplace mechanism to ensure that PCAO_PPDP meets the requirements of differential privacy. Theoretical analysis shows that the proposed PCAO_PPDP algorithm satisfies ε-differential privacy; the experimental results show that compared with the existing research work, the data utility of the data set g...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a method for publishing and protecting differentially private high-dimensional data optimized based on principal component analysis, comprising the following steps: step 1, calculating the information entropy of original data attributes, determining the attribute importance threshold, and screening the attributes in the original data ; Step 2, utilize principal component analysis method to carry out dimension reduction to described screening data, determine optimal k value, thereby determine the best release data; Wherein, in the process of dimension reduction, the projection matrix that produces carries out personalized noise addition Obtaining the added data, and making the added data satisfy differential privacy; and in the dimensionality reduction process, performing multiple selections of the number of principal components k, and calculating the relationship between the original data and the The mutual information of the noisy data determines the optimal k value. The present invention provides a differential privacy high-dimensional data release protection method based on principal component analysis optimization, which ensures that data privacy information is not leaked, and at the same time, the released data is better close to the original data.

Description

technical field [0001] The invention relates to the technical field of privacy protection data release, in particular to a differential privacy high-dimensional data release protection method based on principal component analysis optimization. Background technique [0002] At present, many data collection organizations need to release the collected raw data (such as medical data, financial data, etc.) to facilitate data analysis and mining, and to generate more effective decision support from the released data, such as figure 1 Shown is a schematic diagram of data distribution. However, the released original data involves a large amount of personal sensitive information, and directly releasing the data will lead to a serious disclosure of personal privacy. Therefore, data publishers need to process private data through special protection technology before releasing the data. [0003] At this stage, the main privacy-preserving data release technologies are roughly divided i...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F21/62G06K9/62
CPCG06F21/6245G06F18/2135
Inventor 史伟李万杰张兴张青云
Owner 辽宁优智物联有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products