Detrended analysis differential privacy protection-based histogram data release method

A de-trend analysis, differential privacy technology, applied in the fields of digital data protection, instrument, character and pattern recognition, etc., can solve the problem that no scholars give good

Active Publication Date: 2018-08-24
NORTHWEST UNIV
View PDF5 Cites 21 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

At the same time, no scholars have given a good treatment method for the processing of outliers after judging them. How to sol

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Detrended analysis differential privacy protection-based histogram data release method
  • Detrended analysis differential privacy protection-based histogram data release method
  • Detrended analysis differential privacy protection-based histogram data release method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0086] The present invention is further described below in conjunction with accompanying drawing.

[0087] Based on the above invention, the main implementation steps are as follows:

[0088]1) Detrended histogram orderly processing

[0089] 2) Outlier equilibrium and constrained adaptive histogram clustering

[0090] 3) Add noise to the clustered data

[0091] 4) Release the histogram after differential privacy protection

[0092] Detrended histogram processing

[0093] The ordered histogram sequence has a very important impact on the clustering effect of the histogram buckets, that is, similar bucket count histograms are clustered and published to help reduce reconstruction errors. The main advantage of using detrending analysis is that on the one hand, it reduces the amount of data calculation compared with the discrete analysis of calculating the difference, and on the other hand, it saves adjustment time during the orderly adjustment process. This paper draws on the ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a detrended analysis differential privacy protection-based histogram data release method. A method for judging a signal sequence trend is introduced in judgment of histogram anomaly distribution; a large amount of outliers cause relatively great fluctuation of data distribution, so that the stability is reduced; and from the perspective, a histogram bucket count distribution condition is regarded as continuous digital signals to perform data outlier processing. Meanwhile, for a clustering objective function causing the large amount of the outliers in a conventional method, outlier balance constraints and similar penalty constraints are added for balancing the influence of similar bucket and outlier bucket data on clustering, so that the occurrence of the outliers isreduced; and outlier data micro-clustering is performed based on outlier similarity for outlier data.

Description

technical field [0001] The invention belongs to the technical field of computer information security, and in particular relates to a histogram data publishing method for detrending analysis and differential privacy protection. Background technique [0002] The data release method based on the histogram is the most commonly used data release method at present, because it shows the data distribution form vividly, and its statistical results can provide a theoretical basis for the realization of the counting query. The histogram mainly splits the data table into multiple disjoint subsets based on one or more attributes with different attribute values, forming several independent buckets, and uses statistical values ​​to identify each subset (or bucket) Divide the meaning, where the width of each bucket represents a query range, thus realizing the range count query. [0003] In the process of publishing the histogram, in order to meet the differential privacy and improve the us...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F21/60G06K9/62
CPCG06F21/60G06F18/23
Inventor 高岭杨旭东罗昭毛勇孙骞王帆
Owner NORTHWEST UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products