The invention discloses a text multi-
granularity similarity comparison method based on semantic aggregation fingerprints. The method comprises the following steps: training word vector representation;extracting semantic features; performing multi-
feature aggregation; constructing a hierarchical index; calculating similarity. According to the method, word vector representation modeling is carriedout in combination with multi-dimensional semantic correlation;
semantic information among words is fully mined; characteristics are extracted by taking sentences as units, semantic features are represented by adopting multiple weights, text
library statistics and distribution information are mined by utilizing a
statistical learning method, finer division of a feature space is realized, a compacttext
fingerprint with high identification degree is generated on the basis of multi-
feature aggregation, and the description capability and the discrimination degree of the text
fingerprint are effectively improved. According to the method, text similarity comparison is carried out by adopting a top-down thought and using semantic aggregation
fingerprint and local semantic features, and global-to-local multi-
granularity similarity comparison of texts can be quickly and efficiently realized by constructing hierarchical indexes; the method has good expandability.