System for anonymizing set type data by partially deleting certain items

An anonymous and project technology, applied in the computer field, can solve problems such as information distortion and achieve the effect of maintaining the use value

Inactive Publication Date: 2013-01-09
SHANGHAI JIAO TONG UNIV
View PDF5 Cites 16 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the global deletion method uses a large number of violent deletion operations, resulting in serious information distortion; and the global generalization method not only changes the appearance of the data itself, but also uses a generalized classification structure that is not recognized by data users

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • System for anonymizing set type data by partially deleting certain items
  • System for anonymizing set type data by partially deleting certain items
  • System for anonymizing set type data by partially deleting certain items

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0034]The embodiments of the present invention are described in detail below in conjunction with the accompanying drawings. This embodiment is implemented on the premise of the technical solution of the invention, and detailed implementation methods and specific operating procedures are provided, but the protection scope of the present invention is not limited to the following the embodiment.

[0035] The task of this embodiment is to anonymize a simplified collective data set, which is record one (a), record two (a, b), record three (a, d, c), record four (b, c), record five (d), where items a, c, and d are privacy entries, and only item b is a non-privacy entry, and the confidence of all sensitive association rules in the anonymized results of the data set is required (confidence) Not higher than 0.5.

[0036] Such as figure 1 As shown, this embodiment includes six modules: a data set preprocessing module, a divide-and-conquer module that accelerates anonymization, a risk-...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a system for anonymizing set type data by partially deleting certain items. The system preprocesses a dataset, then eliminates dangerous and sensitive strong association rules in the dataset by utilizing a multi-round iteration method, and ensures that the items are minimally deleted. A specific iteration implementation process comprises the following steps of: screening sensitive strong association rules from the dataset; and partially deleting certain items in the rules from the dataset, so that the dangerous and sensitive strong association rules become secure and sensitive weak association rules or are removed from the dataset. An iteration process can be skipped until the dangerous and sensitive strong association rules do not exist in the dataset. According to the system, a divide-and-conquer concept is combined to accelerate an anonymization process, so that the anonymization process can be concurrently executed through a plurality of threads, and the efficiency of the anonymization process is greatly improved on the premise of ensuring that the number of the deleted items is not sharply increased.

Description

technical field [0001] The invention relates to a system framework in the field of computer technology, in particular to a system for anonymizing collective data by partially deleting certain items. Background technique [0002] With the rapid development and popularization of computer technology, massive digital information is quietly multiplying. Whether it is government organizations, social institutions, corporate groups, or individuals, they are inadvertently producing and collecting a wealth of data information. At the same time, the plethora of digital information has also brought new opportunities and challenges to data analysts and related researchers. Scientists and engineers use digital information to carry out various statistical analysis, knowledge mining and other activities to form a summary understanding and rules, guide future related activities and decisions, and make relevant predictions, ultimately accelerating technological progress and improving people...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
Inventor 朱其立许信辉贾枭潘超
Owner SHANGHAI JIAO TONG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products