Supercharge Your Innovation With Domain-Expert AI Agents!

A privacy protection method based on hadoop platform mapreduce environment

A privacy protection and environmental technology, applied in the field of privacy protection, can solve problems such as data leakage and malicious analysis

Active Publication Date: 2021-10-08
NANJING UNIV OF POSTS & TELECOMM +1
View PDF2 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] Aiming at the problems of data leakage and malicious analysis in the process of computing and processing mass data in a distributed environment, the present invention proposes a privacy protection method based on the Hadoop platform MapReduce environment, which ensures data privacy and security. Under the premise, it has a high classification accuracy

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A privacy protection method based on hadoop platform mapreduce environment
  • A privacy protection method based on hadoop platform mapreduce environment
  • A privacy protection method based on hadoop platform mapreduce environment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0033] The technical solution of the present invention will be further described in detail below in conjunction with the accompanying drawings.

[0034] Hadoop is currently one of the excellent platforms that can complete parallel processing work, and it can shorten the time for massive data analysis in a parallel manner. When a node in the Hadoop cluster suddenly goes down, the platform can implement task scheduling and redistribution, and has better fault tolerance. Relying on its parallel working method, the processing speed of Hadoop platform is very fast. At the same time, because of its scalability, it can handle data information on the order of PB (PetaByte) (1PB=1024TB). To sum up, the Hadoop platform can be easily built in a short time, and the platform is simple to operate and easy to use. Because of its relatively low cost, users do not need to worry about the price when using the platform, which relieves users from worries. Users will not worry about being too r...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

Aiming at the problems of data leakage and malicious analysis easily encountered in the process of computing and processing massive data in a distributed environment, the present invention proposes a privacy protection method based on the Hadoop platform MapReduce environment, which uses MapReduce in the Hadoop cluster Technology, combined with the method of randomly extracting records, distributes the data set to each node, and starts the Map sub-task to integrate and process the data. The Reduce sub-task uses the exponential mechanism to complete the selection and update of attributes, and finally adds random noise to the leaf nodes, so that the classification results meet differential privacy. This method also has better classification accuracy while ensuring data availability.

Description

technical field [0001] The invention belongs to the field of big data processing based on a cloud computing platform, and in particular relates to a privacy protection method based on a Hadoop platform MapReduce environment. Background technique [0002] Today's society is an information society, and human data and information are everywhere. In the context of the development of the era of big data, technologies such as cloud computing have become an important method for processing big data. The design concept of MapReduce makes this technology have significant advantages in parallel computing and data storage. When using MapReduce technology to complete the calculation and processing of large data sets, it needs to go through the analysis and integration of the two main parts of the Map stage and the Reduce stage. The distributed processing framework guarantees the fault-tolerant performance of Map and Reduce, and the calculation and processing of data can be completed on...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F21/62G06F16/22G06F16/27G06F16/25G06F16/13G06F16/182G06K9/62
CPCG06F21/6245G06F16/2282G06F16/27G06F16/25G06F16/137G06F16/182G06F18/24323G06F18/214
Inventor 李鹏王璇璇徐鹤王汝传樊卫北朱枫程海涛蓝东婉李友涛张结魁
Owner NANJING UNIV OF POSTS & TELECOMM
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More