Supercharge Your Innovation With Domain-Expert AI Agents!

Privacy protection method based on Hadoop platform under MapReduce environment

A privacy protection and environmental technology, applied in the field of privacy protection, can solve data leakage, malicious analysis and other problems

Active Publication Date: 2020-10-02
NANJING UNIV OF POSTS & TELECOMM +1
View PDF2 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] Aiming at the problems of data leakage and malicious analysis in the process of computing and processing mass data in a distributed environment, the present invention proposes a privacy protection method based on the Hadoop platform MapReduce environment, which ensures data privacy and security. Under the premise, it has a high classification accuracy

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Privacy protection method based on Hadoop platform under MapReduce environment
  • Privacy protection method based on Hadoop platform under MapReduce environment
  • Privacy protection method based on Hadoop platform under MapReduce environment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0033] The technical solution of the present invention will be further described in detail below in conjunction with the accompanying drawings.

[0034] Hadoop is currently one of the excellent platforms that can complete parallel processing work, and it can shorten the time for massive data analysis in a parallel manner. When a node in the Hadoop cluster suddenly goes down, the platform can implement task scheduling and redistribution, and has better fault tolerance. Relying on its parallel working method, the processing speed of Hadoop platform is very fast. At the same time, because of its scalability, it can handle data information on the order of PB (PetaByte) (1PB=1024TB). To sum up, the Hadoop platform can be easily built in a short time, and the platform is simple to operate and easy to use. Because of its relatively low cost, users do not need to worry about the price when using the platform, which relieves users from worries. Users will not worry about being too r...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The method aims at solving the problems that in the operation processing process of mass data in a distributed environment, data leakage and malicious analysis are likely to happen. The invention provides a privacy protection method based on a Hadoop platform under a MapReduce environment. According to the method, a data set is allocated to each node by using a MapReduce technology in a Hadoop cluster in combination with a random record extraction mode, and Map sub-tasks are started to perform data integration processing. Reduce sub-tasks complete attribute selection and updating by using an exponential mechanism, and finally random noise is added to leaf nodes, so that a classification result meets differential privacy. According to the method, the data availability is guaranteed, and meanwhile the good classification accuracy is achieved.

Description

technical field [0001] The invention belongs to the field of big data processing based on a cloud computing platform, and in particular relates to a privacy protection method based on a Hadoop platform MapReduce environment. Background technique [0002] Today's society is an information society, and human data and information are everywhere. In the context of the development of the era of big data, technologies such as cloud computing have become an important method for processing big data. The design concept of MapReduce makes this technology have significant advantages in parallel computing and data storage. When using MapReduce technology to complete the calculation and processing of large data sets, it needs to go through the analysis and integration of the two main parts of the Map stage and the Reduce stage. The distributed processing framework guarantees the fault-tolerant performance of Map and Reduce, and the calculation and processing of data can be completed on...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F21/62G06F16/22G06F16/27G06F16/25G06F16/13G06F16/182G06K9/62
CPCG06F21/6245G06F16/2282G06F16/27G06F16/25G06F16/137G06F16/182G06F18/24323G06F18/214
Inventor 李鹏王璇璇徐鹤王汝传樊卫北朱枫程海涛蓝东婉李友涛张结魁
Owner NANJING UNIV OF POSTS & TELECOMM
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More