A data prefetching method based on mapreduce
Patent Information
- Authority / Receiving Office
- CN · China
- Patent Type
- Patents(China)
- Current Assignee / Owner
- UNIV OF ELECTRONICS SCI & TECH OF CHINA
- Publication Date
- 2018-02-09
- Estimated Expiration
- Not applicable · inactive patent
Smart Images

Figure 1
Abstract
Description
technical field
[0001] The invention belongs to the technical field of computers, and in particular relates to a data prefetching method related to MapReduce. Background technique
[0002] As one of the core components of Hadoop, MapReduce is mainly used for distributed computing. On the Hadoop platform, the distributed computing framework of MapReduce is built on the distributed file system HDFS (Hadoop Distributed FileSystem), that is to say, the data input and data output required by the MapReduce framework are based on HDFS. When MapReduce performs data processing, it divides a large job into small computing tasks. These small tasks are divided into Map tasks and Reduce tasks. Map tasks obtain data from HDFS as input, and different Map tasks are independent of each other. The data input of the Reduce task comes from the output of the Map, and finally the processed data is stored on HDFS.
[0003] When the HDFS distributed file system stores data, the data is divided in...