The invention discloses a content-defined chunking remote file real-time updating method. The invention aims at providing a file increment updating method with low network transmission expenditure for a distributive storage
system, so that the distributive storage
system can support the random writing of a file at low expenditure. According to the technical scheme, the method comprises the following steps: storing an original file fold file in a chunking manner, calculating an abstract for each
data chunk by adopting a Hash
algorithm, calculating one
fingerprint for the beginning and the ending of each
data chunk by adopting a rabin-fingerprinting
algorithm, contrasting the chunking information of the original file fold by adopting a content-defined chunking data chunking way, chunking a novel file Fnew, calculating the abstract for each
data chunk of Fnew by adopting the Hash
algorithm, comparing the abstracts of the Fold data chunk and the abstract of the Fnew data chunk, finding out the varied data chunk, deleting the data chunk to be deleted from the Fold, and adding the data chunk to be updated into Fold. By adopting the method, the random writing of the file can be supported by the distributive storage
system at low expenditure.