Government affair big data anonymity system and method based on Spark
A technology of big data and government affairs, applied in the field of data management, it can solve the problems of increasing inaccuracy, not considering the situation of global data association, desensitizing global data speculation, etc.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0024] Hadoop is an open source distributed computing platform under the Apache Foundation. It uses the distributed file system HDFS and the MapReduce algorithm as the core, and provides users with a distributed infrastructure with transparent details of the underlying system. The Hadoop platform includes the following two core components: 1) HDFS: a distributed file system that stores massive amounts of data. It is a scalable, fault-tolerant, high-performance distributed file system, asynchronous replication, one write multiple reads, mainly responsible for storage; 2) MapReduce: parallel processing framework to achieve task decomposition and scheduling. Contains map (mapping) and reduce (reduction) process, responsible for computing on HDFS. Hadoop has the following characteristics: 1) High scalability: it can reliably store and process gigabytes of data, theoretically unlimited; 2) Low cost: learn from Google, it can distribute and process data through a server group compos...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 
