The invention relates to a document classifying method based on a network measure index. The document classifying method comprises a sample
training phase and a document classifying phase. The sample
training phase comprises the first step of sample collecting, the second step of text segmenting, the third step of word class analyzing, the fourth step of
function word and name removing, the fifth step of word frequency counting, the sixth step of characteristic set Vd establishing, the seventh step of characteristic network peak establishing, the eighth step of characteristic network edge establishing, the ninth step of average degree calculating, the tenth step of
cluster coefficient calculating, the eleventh step of characteristic
path length calculating and the twelfth step of network measure index interval obtaining. The document classifying phase comprises the first step of
processing a document to be classified and the second step of judging
document classification. According to the document classifying method, classifying is accurate, classifying efficiency is high, the problem that according to an existing classifying method, scientific and
technical literature, novels and prose cannot be distinguished is solved, and a scientific classification method and a theoretical foundation is laid for automatic distinguishing of the scientific and
technical literature, the novels and the prose.