FP growth algorithm model-based traditional Chinese medicine formula data mining method and system

A technology of data mining and algorithm model, which is applied in the field of data processing, can solve the problems of increased search time due to information interference, lack of objectivity and unified standards, and achieve the effect of reducing information interference

Inactive Publication Date: 2017-05-10
康美中药材数据信息服务有限公司
View PDF3 Cites 17 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0002] At present, most of the prescription data preprocessing methods are based on the content of dictionaries or textbooks, and then artificially classify and process, such as the treatment of symptom names and drug aliases, which lack objectivity and uniform standards
In addition, the existing formula mining system needs to manually input the prescription or the name of the medicinal material, and then retrieve the appropriate prescription compatibility information from the database. In the process, due to the large number of prescription combinations of commonly used medicinal materials, a large amount of information interference increases the search time when formulating the prescription. time spent

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • FP growth algorithm model-based traditional Chinese medicine formula data mining method and system
  • FP growth algorithm model-based traditional Chinese medicine formula data mining method and system
  • FP growth algorithm model-based traditional Chinese medicine formula data mining method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0052] It should be understood that the specific embodiments described herein are only used to explain the present invention, but not to limit the present invention.

[0053] The invention provides a traditional Chinese medicine formula data mining method based on FP growth algorithm model. Figure 1a , in one embodiment, the method includes:

[0054] Step S10, pre-processing the input traditional Chinese medicine raw data, the traditional Chinese medicine raw data includes a single herb database and a pharmaceutical formula database; A first set of transactions with an effect or ascribed as a characteristic item;

[0055] Data preprocessing is an important step in the data mining process, including data cleaning, data integration, data transformation and data reduction. Its purpose is to improve the quality of data mining objects, and finally achieve the purpose of improving the quality of pattern knowledge obtained by data mining.

[0056] The original data includes a sing...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an FP growth algorithm model-based traditional Chinese medicine formula data mining method and system. The system comprises the following steps of: pre-processing recorded traditional Chinese medicine original data; constructing an FP-tree data structure which takes characteristic items in a first transaction set as frequent items; arranging characteristic items corresponding transaction in the first transaction set according to the sequence of the frequent items in the FP-tree data structure, and generating a second transaction set, wherein the second transaction set comprises a plurality of data sets, each data set take included characteristic items as transactions and each transaction comprises the characteristic items, arranged before the characteristic items corresponding to the transactions, in the data set; carrying out an FP-growth algorithm on each data set and generating FP-tree sub-data structures in one-to-one correspondence with the plurality of data sets; and mining association rules between different effects and / or association rules between the mined effects and channel tropisms on the basis of a corresponding relationship between the characteristic items in the different FP-tree sub-data structures. According to the method and system disclosed by the invention, the information interferences in the retrieval process of the users can be decreased.

Description

technical field [0001] The invention relates to the technical field of data processing, in particular to a traditional Chinese medicine formula data mining method and system based on an FP growth algorithm model. Background technique [0002] At present, most of the prescription data preprocessing methods are based on the content of dictionaries or textbooks, and then artificially classified, such as the processing of symptom names and drug aliases, which lack objectivity and uniform standards. In addition, the existing formula mining system needs to manually input the name of the prescription or medicinal material, and then retrieve the appropriate prescription compatibility information from the database. In this process, due to the combination of commonly used medicinal materials, a large amount of information interference increases the search when formulating a prescription. of time. SUMMARY OF THE INVENTION [0003] The main purpose of the present invention is to prov...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F19/00
CPCG06F19/3456G16H50/70
Inventor 马兴田许冬瑾张纯黄凯明
Owner 康美中药材数据信息服务有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products