Kmeans clustering method for efficacy of traditional Chinese medicinal materials based on node similarity

A clustering method and technology of traditional Chinese medicinal materials, which are applied in the fields of drug reference, medical data mining, instruments, etc., can solve the problems of difficulty in ensuring the quality of clustering, inability to ensure the degree of similarity, and difficulty in controlling the K value of the number of clusters. Achieve the effect of being beneficial to the determination of the number of clusters and the analysis of the results, the accuracy of the clustering results, and the improvement of the quality of the clustering

Pending Publication Date: 2020-07-03
HANGZHOU NORMAL UNIVERSITY
View PDF0 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the traditional Kmeans algorithm has two obvious disadvantages: one is that the K value of the number of clusters is difficult to control. For example, Chinese medicinal materials have many effects, and each medicinal material may have dozens or even hundreds of effects, so the clustering quality will be very low. Difficult to guarantee; second, the calculation of the distance between samples cannot guarantee the true similarity, especially for the special data of traditional Chinese medicine

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Kmeans clustering method for efficacy of traditional Chinese medicinal materials based on node similarity
  • Kmeans clustering method for efficacy of traditional Chinese medicinal materials based on node similarity
  • Kmeans clustering method for efficacy of traditional Chinese medicinal materials based on node similarity

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0029] In order to make the purpose, implementation and advantages of the present invention more clearly understood, here in conjunction with specific implementation example, be described in further detail, as figure 1 Shown:

[0030] Step 1. Collect relevant traditional Chinese medicine data, and after data processing, form a prescription composition library, a medicinal material efficacy library, and a binary table of properties, flavors, and meridians of medicinal materials.

[0031] Through literature, databases, and other network resources, collect as complete as possible TCM-related data (such as prescriptions, Chinese medicinal materials, Chinese patent medicines, medicinal materials, and their properties and flavors) based on manual methods, web crawlers, etc., and integrate them into prescriptions Composition library, medicinal material efficacy library, and medicinal material property and flavor meridian distribution binary value table. The prescription composition ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a Kmeans clustering method for the efficacy of traditional Chinese medicinal materials based on node similarity. The method comprises the following steps: collecting related traditional Chinese medicine data, and processing the data to form a prescription composition library, a medicinal material efficacy library and a channel-tropism binary table of the nature and taste ofmedicinal materials; summarizing and classifying the efficacy of the traditional Chinese medicinal materials according to 23 efficacy tables, and constructing a medicinal material efficacy matrix; constructing a prescription-medicinal material bipartite network based on the prescription composition library; calculating expected values of medicinal material pairs based on degree distribution, andtaking the expected values of the medicinal material pairs as the similarity of the traditional Chinese medicinal materials; establishing a Kmeans clustering model based on the similarity of the traditional Chinese medicinal materials; and clustering the traditional Chinese medicinal materials based on the clustering model to obtain potential effects possibly possessed by the traditional Chinese medicinal materials. According to the method, the accuracy of Kmeans clustering via a medicinal material similarity matrix can reach 0.728. Meanwhile, Kmeans is used for clustering the nature-taste channel-tropism data of traditional medicinal materials, an obtained final result is 0.646, which is about 0.08 higher; and therefore, clustering result is allowed to be more accurate through the method.

Description

technical field [0001] The invention relates to the field of computer-aided drug design, in particular to a Kmeans method for clustering the efficacy of Chinese medicinal materials based on node similarity. Background technique [0002] According to the information of prescriptions and medicinal materials in the present invention, medicinal materials with similar or identical efficacy are grouped into one category. The clustering problem is a typical partition-based problem, and the Kmeans clustering algorithm is relatively simple and commonly used among the partition-based clustering algorithms. Kmeans is an unsupervised learning algorithm, which is a method of group observation, with few adjustable parameters, fast clustering speed and simple method. However, the traditional Kmeans algorithm has two obvious disadvantages: one is that the K value of the number of clusters is difficult to control. For example, Chinese medicinal materials have many effects, and each medicina...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G16H70/40G16H50/70G06K9/62
CPCG16H70/40G16H50/70G06F18/23213
Inventor 谭露露周银座吴晨程
Owner HANGZHOU NORMAL UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products