The invention provides a method for matching Chinese similarity. An
edit distance formula and a keyboard fingering rule are used to obtain the edition similarity of the corresponding
pinyin of Chinese, namely, whether the Chinese and the
pinyin are easily mixed up during edition is reflected; the pronunciation rules of the initial
consonant and the final sound of
Chinese characters are used for obtaining the initial
consonant similarity and the final sound similarity of character strings; and common fuzzy tones in dialects or common pronunciation are combined to calculate the pronunciation similarity among character strings. Because the Chinese character pattern is one of the most important characteristics of Chinese, character pattern coding namely the Five-
stroke Method coding is used for calculating the character pattern similarity among character strings; information is collected and calculated at the same time for updating data; and the above similarities are combined to obtain the whole similarity of
Chinese word, various factors, such as Chinese spelling custom,
user input custom,
keyboard layout, mandarin pronunciation rules, dialects, common wrong pronunciation, Chinese character patterns and the like are fully considered, the statistical regularity is combined, and the similarity among Chinese words is comprehensively evaluated.