The invention discloses a block-based Web record linkage system which comprises a Web crawler, a Sample database, a Web record database, a block attribute analysis module, a blocking module, a block balancing module, a paired matching module, a matching determination module and a record linkage result set. According to a block-based Web record linkage method, data from various data sources are quickly blocked by a Mapreduce model, and the data are compared and recorded in the blocks, therefore, the record matching efficiency is improved to a large extent; on the basis, the sizes of blocks are balanced, so that the record matching efficiency is further improved. The recalling rate of record linkage is also improved by adopting a method for blocking a data set with a multi-block function from multiple angles.