Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Data release privacy protection algorithm and system based on big data

A data release and privacy protection technology, applied in digital data protection, electronic digital data processing, computing, etc., can solve problems such as leakage, probabilistic attack, similarity attack on privacy, etc.

Inactive Publication Date: 2020-11-20
汪秀英
View PDF0 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The traditional method for making an anonymous model generate an equivalence class does not consider the problem of sensitive attribute values, and similar sensitive attribute values ​​of the same sensitive attribute are very likely to exist in the equivalence class. It is very easy to cause probabilistic attacks or similarity attacks to cause privacy leaks, and secondly cause greater information loss in constraining sensitive attribute values

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data release privacy protection algorithm and system based on big data
  • Data release privacy protection algorithm and system based on big data
  • Data release privacy protection algorithm and system based on big data

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0094] It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0095] By obtaining the data table to be released, the statistical histogram of the data to be released is converted into a k-fork interval tree, and the k-fork interval tree is disturbed by adding noise to obtain the interval tree after adding noise. At the same time, by constructing an integrated classification based on feature selection The function is solved and the classification result of the data to be released is obtained. According to the classification result of the data to be released, the data to be released is clustered, and the personalized anonymous algorithm based on clustering is used to protect data privacy and realize the release of private data. refer to figure 1 As shown, it is a schematic diagram of a big data-based data publishing privacy protection algorithm provided by an embodiment of the pre...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to the technical field of data privacy protection, and discloses a big data-based data release privacy protection algorithm, which comprises the steps of obtaining a to-be-released data table, and obtaining a statistical histogram of to-be-released data according to an attribute value division result of the to-be-released data; converting the statistical histogram of the to-be-published data into a k-fork interval tree, adding noise to the k-fork interval tree to disturb the k-fork interval tree to obtain a noise-added interval tree, and approximately converting the noise-added interval tree into an average histogram; constructing an integrated classification function based on feature selection; solving the integrated classification function by using an L-BFGS optimization algorithm to obtain a classification result of the to-be-published data; and clustering the to-be-published data according to the classification result of the to-be-published data, performing data privacy protection by utilizing a clustering-based personalized anonymous algorithm, and publishing the private data at the same time. The invention further provides a data release privacy protection system based on the big data. According to the invention, the protection of private data is realized.

Description

technical field [0001] The present invention relates to the technical field of data protection, in particular to a big data-based data release privacy protection algorithm and system. Background technique [0002] With the popularization of the Internet and the rise of the mobile Internet, the commercial value of big data has been applied to all aspects of society, which has brought a profound impact on the development of human society. At the same time, it also makes the collection, analysis or mining of information data more convenient and accurate. However, in the process of sharing, mining and knowledge discovery of data information as the research purpose, it is also accompanied by the leakage of sensitive private information. How to protect private data has become a hot topic in current research. [0003] At present, the privacy protection technology in data release can be divided into restricted release technology, data encryption technology and data distortion techn...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F21/62G06K9/62
CPCG06F21/6254G06F18/23G06F18/24
Inventor 汪秀英
Owner 汪秀英
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products