Unlock instant, AI-driven research and patent intelligence for your innovation.

Coremedicine excavation method based on complex network model parallelizing PageRank algorithm

A complex network, core drug technology, applied in computing, special data processing applications, instruments, etc., can solve the problems that the PageRank algorithm cannot run, the algorithm is no longer suitable for large-scale data, etc., and achieves the effect of improving scalability and running speed.

Inactive Publication Date: 2015-05-13
HOHAI UNIV
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] The PageRank algorithm is a method of data mining. The traditional PageRank algorithm cannot run in a distributed parallel environment. With the surge of data, ordinary algorithms are no longer suitable for large-scale data.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Coremedicine excavation method based on complex network model parallelizing PageRank algorithm
  • Coremedicine excavation method based on complex network model parallelizing PageRank algorithm
  • Coremedicine excavation method based on complex network model parallelizing PageRank algorithm

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0054] The present invention will be described in detail below in conjunction with the accompanying drawings.

[0055] like figure 1 As shown, the core drug mining obtains the compound data of traditional Chinese medicine through prescription database query, irregular text data extraction, etc., and generates text data through preprocessing such as data standardization and formatting. A parallelized PageRank algorithm was run on this network to discover core drugs.

[0056] The traditional Chinese medicine compound data network and the PageRank algorithm mining core drugs are the main steps of the invention. The idea of ​​the invention is to effectively mine the core drugs through complex network modeling and parallel PageRank algorithm, while improving the scalability and operation speed of the algorithm.

[0057] The flow chart of the core drug mining method based on the complex network model parallelized PageRank algorithm of the present invention is as follows figure 2 sh...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a core medicine excavation method based on the complex network model parallelizing PageRank algorithm, which comprises the following steps: (1) networking: a, pretreating to generate a TCM (traditional Chinese medicine) data set, and formatting into text data; b, deploying the initial text data on a Hadoop platform; c, establishing a TCM network in a parallelizing way; and d, finishing; and (2) excavating: a, acquiring the text file of the TCM network generated in the step (1)-c; b, deploying the text file of the TCM network on the Hadoop platform; c, finding core medicine nodes by implementing the parallelizing PageRank algorithm; and d, finishing. According to the core medicine excavation method based on the complex network model parallelizing PageRank algorithm, the complex network model of the TCM is established, the networking and the expandability and the operating speed of the PageRank algorithm are improved by using the parallelizing technology, the key core medicine nodes in the compound are excavated effectively, and the matching rules of the TCM are studied.

Description

technical field [0001] The invention relates to a complex network modeling of traditional Chinese medicine and a technology for digging core medicines of traditional Chinese medicine by adopting a parallelized PageRank algorithm on the model. Background technique [0002] Data mining technology can discover potential and useful knowledge under a large amount of data. It is an important part of computer artificial intelligence. Using data mining technology can realize intelligent analysis of traditional Chinese medicine compound data and discover potential Chinese medicine compatibility rules. Commonly used data mining models are based on transaction items, that is, compound prescriptions are regarded as transactions composed of multiple drugs and stored in transaction databases. [0003] With the increase in the data scale of traditional Chinese medicine and the requirement for deeper mining, traditional Chinese medicine association rules, classification and clustering algor...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F19/00
Inventor 吴骏刘正王志坚许峰
Owner HOHAI UNIV