The invention discloses a Chinese text automatic
correction method. The method comprises the following steps of: a) inputting a to-be-corrected Chinese text, and performing word segmentation preprocessing on the Chinese text
sentence by
sentence; b) searching for one-character words, two-character words or disperse strings of three or more than three characters occurring in the text subjected to word segmentation
sentence by sentence; c) performing continuous determination on the disperse strings occurring in the text subjected to word segmentation by adopting an N-
gram model, and checking text word level errors for each
single sentence in combination with a word forming probability of separate characters; and d) constructing an error correction
knowledge base to generate an error correction candidate text. According to the Chinese text automatic
correction method provided by the invention, the one-character words, two-character words or disperse strings of three or more than three characters occurring in the text subjected to word segmentation are searched for sentence by sentence, the disperse strings occurring in the text subjected to word segmentation are subjected to continuous determination by adopting the N-
gram model to determine identification errors, and the error correction
knowledge base is constructed to generate the error correction candidate text, so that
error checking and correcting processes are combined very well, and the method has the characteristics of high
error checking speed and high
error correcting efficiency.