The invention discloses a data block balancing method in an operation process of an HDFS (Hadoop
Distributed File System). The method comprises the following steps of: at first, pre-
processing local task lists of nodes, and dividing the local
task list of each node into entirely local tasks and non-entirely local tasks, so as to provide the basis for starting data block balance judgment of the HDFS; secondly, carrying out
estimation and task request prediction on an operation rate of each node; thirdly, designing and realizing an assignment process of each node after completing said steps; fourthly, selecting proper nodes to move a data block between the proper nodes, so that the distribution of the data block can be matched with a predicted node task request sequence; and finally, balancing the data block. With the adoption of the data block balancing method, non-
local map task execution which is possible to occur is judged by predicting the node task request in advance, and the proper data block is moved between the corresponding nodes, so that the distribution response of the
local map tasks can be obtained when the nodes send an actual task request. Therefore, the completion efficiency of a Map step can be improved.