A Histogram Data Publishing Method for Detrend Analysis and Differential Privacy Protection

A technology of detrending analysis and differential privacy, applied in digital data protection, instruments, computing, etc., can solve problems that no scholars have given

Active Publication Date: 2021-04-13
NORTHWEST UNIV
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

At the same time, no scholars have given a good treatment method for the processing of outliers after judging them. How to solve the clustering of outliers is also a key issue related to the utility of differential privacy in histogram data publishing.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Histogram Data Publishing Method for Detrend Analysis and Differential Privacy Protection
  • A Histogram Data Publishing Method for Detrend Analysis and Differential Privacy Protection
  • A Histogram Data Publishing Method for Detrend Analysis and Differential Privacy Protection

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0086] The present invention is further described below in conjunction with accompanying drawing.

[0087] Based on the above invention, the main implementation steps are as follows:

[0088]1) Detrended histogram orderly processing

[0089] 2) Outlier equilibrium and constrained adaptive histogram clustering

[0090] 3) Add noise to the clustered data

[0091] 4) Release the histogram after differential privacy protection

[0092] Detrended histogram processing

[0093] The ordered histogram sequence has a very important impact on the clustering effect of the histogram buckets, that is, similar bucket count histograms are clustered and published to help reduce reconstruction errors. The main advantage of using detrending analysis is that on the one hand, it reduces the amount of data calculation compared with the discrete analysis of calculating the difference, and on the other hand, it saves adjustment time during the orderly adjustment process. This paper draws on the ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

A histogram data publishing method for detrending analysis and differential privacy protection, which introduces the method of judging the trend of signal sequences into the judgment of abnormal distribution of histograms. A large number of outliers will cause relatively large fluctuations in data distribution and smoothness Reduced, from this point of view, the histogram bucket count distribution is regarded as a continuous digital signal for data outliers. At the same time, for the clustering objective function that the traditional method will cause a large number of outliers, the outlier equilibrium constraint and the similarity penalty constraint are added to balance the impact of similar buckets and outlier bucket data on clustering and reduce the occurrence of outliers; Outlier data micro-clustering based on outlier similarity.

Description

technical field [0001] The invention belongs to the technical field of computer information security, and in particular relates to a histogram data publishing method for detrending analysis and differential privacy protection. Background technique [0002] The data release method based on the histogram is the most commonly used data release method at present, because it shows the data distribution form vividly, and its statistical results can provide a theoretical basis for the realization of the counting query. The histogram mainly splits the data table into multiple disjoint subsets based on one or more attributes with different attribute values, forming several independent buckets, and uses statistical values ​​to identify each subset (or bucket) Divide the meaning, where the width of each bucket represents a query range, thus realizing the range count query. [0003] In the process of publishing the histogram, in order to meet the differential privacy and improve the us...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F21/60G06K9/62
CPCG06F21/60G06F18/23
Inventor 高岭杨旭东罗昭毛勇孙骞王帆
Owner NORTHWEST UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products