The invention discloses a clustering method for
network behavior habits based on K-means and LDA (
Latent Dirichlet Allocation) two-way
authentication. According to the clustering method, webpage properties, keywords and frequency in internet browsing records of persons are utilized to combine with a K-means
algorithm, an LDA document topic extracting model and an annealing
algorithm. The clustering method comprises the following steps: firstly, performing K-means
algorithm clustering and LDA document topic extracting model generation on a staff-
label-frequency set and a person browsing
record-person-keyword set; secondly, storing and calculating an intermediate result, and then performing K-means and LDA two-way
authentication by using the annealing algorithm; calculating a global best topic-classification
label sequence, and optimizing a
network behavior habit clustering result by taking the global best topic-classification
label sequence as a reference. By means of the K-means and LDA two-way
authentication, the sensitivity to person-classification labels is improved; by using the annealing algorithm, the optimizing efficiency of the clustering result can be improved, and further the clustering accuracy is improved.