Micro-aggregation anonymization method based on sorting

A technology of anonymization and aggregation operations, applied in the fields of instruments, electrical digital data processing, digital data protection, etc., can solve the problems of unrealistic cube processing, low information loss rate, and high information loss rate, and achieve the best form, The effect of low information loss rate and improved execution efficiency

Active Publication Date: 2018-05-18
HOHAI UNIV
View PDF2 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] Purpose of the invention: Aiming at the inability to realize the multidimensional data set processing and the slightly high information loss rate in the prior art, the present invention combine...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Micro-aggregation anonymization method based on sorting
  • Micro-aggregation anonymization method based on sorting
  • Micro-aggregation anonymization method based on sorting

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0038] The present invention provides a sorting-based micro-aggregation anonymization method, which effectively improves the k-division process according to the mean sorting technology, ensures that its information loss rate is minimized, improves the execution efficiency of the algorithm and introduces the concept of sorting Afterwards cubes can be processed and a slightly higher privacy protection can be improved.

[0039] This embodiment adopts java programming language and myeclipse10 platform for simulation, wherein, the specific PC configuration is CPU-i7, DDR-8G, SATA1-TB, OperatingSystem-Win8.

[0040] The experimental data set uses three data sets used as research benchmarks to evaluate various micro-aggregation methods, namely the data set DS1: "Tarragona", DS2: "Census" and DS3: "EIA", where the DS1 data set contains 834 instances, 13 numerical attributes, DS2 dataset contains 1080 instances, 13 numerical attributes, DS3 dataset contains 4092 instances, 11 numerical...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a micro-aggregation anonymization method based on sorting. The method comprises the following steps that: (1) a sorting operation: on the basis of a Q1 quasi-identifier, dividing a dataset into a plurality of categories to enable k-division to be based on the Q1 quasi-identifier by the dataset; (2) a division operation based on sorting: independently systemically forming equivalence classes from the first extreme record and the last extreme record of the dataset initialization of the sorting operation, and keeping the record number of the equivalence classes in k; and (3) an aggregate operation: taking the center points of two extreme records as the centroid point of each equivalence class, and replacing all sensitive attribute values with the mean value of the equivalence classes to form an anonymous equivalence class. By use of the method, firstly, according to a mean valve sorting technology, a k-division process is effectively improved to guarantee the information loss ratio of the k-division process to be minimum, the execution efficiency of an algorithm is improved, in addition, a multidimensional dataset can be processed after a sorting concept is introduced, and then, privacy protection can be improved.

Description

technical field [0001] The invention relates to a sorting-based micro-aggregation anonymization method, belonging to a data privacy protection method in the field of information security. Background technique [0002] In the current information age, the release of data is beneficial to the field of data analysis, such as the release of election ballot information, census information, medical and health information, etc. Through the mining of such information, the trend of election information and the trend of population growth can be judged and the health of the people. But these released data more or less contain some sensitive information, such as personal privacy. To avoid the threat of data privacy breaches, data should be anonymized before release. [0003] In the current anonymization field, the most widely used anonymity technology is k-anonymity, whose main idea is to make each record indistinguishable from at least other k-1 records. The general implementation pr...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F21/62
CPCG06F21/6254
Inventor 许国艳宋健李敏佳平萍张网娟朱帅
Owner HOHAI UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products