Method for protecting privacy under condition of MapReduce data processing frameworks

A technology for privacy protection and data processing, which is applied in the field of big data and can solve problems such as inapplicability

Inactive Publication Date: 2015-04-01
LANGCHAO ELECTRONIC INFORMATION IND CO LTD
View PDF3 Cites 13 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, these methods are basically traditional data privacy protection technologies, which are suitable for small-scale, relational databases and file systems, and are not suitable for the current MapReduce computing framework.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for protecting privacy under condition of MapReduce data processing frameworks
  • Method for protecting privacy under condition of MapReduce data processing frameworks
  • Method for protecting privacy under condition of MapReduce data processing frameworks

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0015] The implementation of the present invention will be described in detail below in conjunction with the accompanying drawings and examples, so as to fully understand and implement the process of how to apply technical means to solve technical problems and achieve technical effects in the present invention. It should be noted that, if there is no conflict, the embodiments of the present invention and the mutual features of the embodiments are within the protection scope of the present invention.

[0016] The present invention uses a typical case in the big data analysis process to illustrate the implementation.

[0017] The two methods are briefly described below.

[0018] 1. Mandatory scope check:

[0019] In fact, there are many application scenarios where the output range of the Mapper is predictable. For example, for Douban's movie rating data, the output value of the Mapper must be within the range of (1,10). The data provider can also predefine an output range MaxR...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a method for protecting the privacy under the condition of MapReduce data processing frameworks, and belongs to the field of big data. An executing procedure of the method includes steps of (1), allowing users to define the maximum output ranges MaxRange of functions Mapper according to particularity of application environments; (2), judging whether results computed according to the maximum output ranges MaxRange and the functions Mapper are within the maximum output ranges MaxRange or not; (3), adding Laplacian noise to the output results of the functions Mapper according to differential privacy protection formulas if the results are within the maximum output ranges, or randomly selecting a number from the maximum output ranges MaxRange to be used as an output result of each function Mapper if the corresponding result is not within the corresponding maximum output range. The method has the advantages that the quantity of the functions Mapper for executing differential privacy protection can be reduced, and accordingly the algorithm running time can be shortened; the problem of excessively loud noise due to excessively high or low output valves of existing functions Mapper can be solved, requirements of differential privacy protection rules can be met, and the query precision can be improved.

Description

technical field [0001] The invention relates to the field of big data, in particular to a method for protecting privacy under the framework of MapReduce data processing. Using the method of adding Laplacian noise in the differential privacy protection strategy to meet the privacy protection of Mapper output results and using mandatory range checking to eliminate malicious code. Background technique [0002] Today, the development of social informatization and networking has led to explosive growth of data. According to statistics, only in terms of China's Internet data volume, Baidu visits more than 1 billion times a day, maintains and indexes more than 100 billion web pages, has more than 500 million active social users every day, and shares 4.5 billion pictures. The transaction volume has exceeded 20 billion, and the daily transaction peak has reached 100 million times. At the same time, various industries such as scientific computing, medical and health care, finance, a...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F21/62
CPCG06F21/6245
Inventor 苏志远辛国茂亓开元刘伟曹连超金洪殿
Owner LANGCHAO ELECTRONIC INFORMATION IND CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products