Minimum information loss control method in business data anonymity release

A technology of information loss and data concentration, applied in digital data protection, data processing applications, business, etc., can solve the problem of large information loss in anonymous processing methods, and achieve the effect of reducing information loss

Inactive Publication Date: 2015-07-08
北京睿航至臻科技有限公司
View PDF3 Cites 11 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] What the present invention aims to solve is the problem of large information loss in the existing anonymous processing method based on the ρ-uncertainty privacy protection model, and provides a method for controlling the minimal information loss in the anonymous publishing of business data

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Minimum information loss control method in business data anonymity release
  • Minimum information loss control method in business data anonymity release
  • Minimum information loss control method in business data anonymity release

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0029] A minimal information loss control method in the anonymous publishing of business data, such as figure 1 shown, including the following steps:

[0030] Step 1: Define the privacy constraint condition ρ, generalize all non-sensitive items in the data set to be released to the highest level according to the generalization hierarchy tree, and do nothing for sensitive items.

[0031] Step 2: Check whether the generalized data set meets the privacy conditions, if not, partly delete sensitive items to meet the requirements of privacy protection. Calculate the probability that each association rule in the data set appears in the data, that is, the confidence ρ'. If ρ’≤ρ, the current data set will be retained; if ρ’>ρ, the current data set will be partially deleted to form a data set that meets the privacy conditions.

[0032] Partial deletion method: traverse the entire data set, assuming that the background knowledge of the attacker is X, and the support degree SUP(X) of X ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a minimum information loss control method in business data anonymity release. The method comprises the steps that firstly, nonsensitive items in a dataset are generalized to the highest level knot; secondly, privacy checking is performed on the generalized dataset, if the confidence coefficient is larger than a privacy constraint condition p, the current dataset is partially deleted and processed, when the current dataset is partially deleted and processed, privacy checking is constantly called and stopped until the current dataset is in accordance with a data privacy protection anonymity condition, the current dataset which is in accordance with the privacy protection anonymity requirements is transferred to a minimum information control mechanism, and whether grouping and refining continue to be performed or not is determined through the front and back information loss comparison. If the information loss after grouping and refining processing is small, grouping and refining processing continues to be performed, and meanwhile the above process is performed on multiple sub-branch groups formed by one group. If the information loss increases on the contrary after grouping and refining, then the current branch group is directly released.

Description

technical field [0001] The invention relates to the technical field of e-commerce, in particular to a method for controlling minimal information loss in anonymous publishing of business data. Background technique [0002] With the rapid development of e-commerce, a large amount of data is generated in the Internet, for example: transaction data generated by querying and browsing commodities on various shopping websites, online or offline shopping and product reviews. These data contain rich resources, such as group shopping habits and commodity market research, etc. In order to attract more consumers, increase cross-selling, and improve consumer loyalty, these data are analyzed reasonably by publishing data sets and induction, to customize more accurate personalized services for consumers, and use these data to mine more accurate potential models, and conduct business forecasts to help business decision makers better adjust market strategies, avoid risks, and make correct de...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F21/60G06Q30/00
Inventor 李先贤陈刘华刘鹏王利娥辛如意
Owner 北京睿航至臻科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products