Drug target prediction method based on multiple similarity network walk

A technology of target prediction and similarity, which is applied in the interdisciplinary field of bioinformatics and computer science, can solve the problems of low prediction accuracy and achieve the effect of being easy to handle and helping drug target prediction

Active Publication Date: 2018-09-11
SUN YAT SEN UNIV
View PDF5 Cites 16 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0008] In order to solve the technical defect that the drug target prediction method provided by the prior art ignores the relationship between the drug and the target network and the network topology characteristics, the prediction accuracy is low, and it provides a drug based on multiple similarity...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Drug target prediction method based on multiple similarity network walk
  • Drug target prediction method based on multiple similarity network walk
  • Drug target prediction method based on multiple similarity network walk

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0029] like figure 1 Shown, the method provided by the invention comprises the following steps:

[0030] The first step is to obtain drug information from the DrugBank database, obtain protein information from the HPRD database, obtain disease information from the Comparative Toxicogenomics Database, and obtain drug side effects information from the SIDER database. According to the obtained information, the interaction relationship between drugs and drugs, targets and targets, drugs and diseases, targets and diseases, drugs and side effects is obtained, and a corresponding adjacency matrix is ​​constructed. That is, if there is an interaction relationship, it is 1, and if there is no relationship, it is 0.

[0031] In the second step, for each adjacency matrix in the first step, drug and target nodes are used as a set, and side effects and diseases are used as attributes in the set, and the Jaccard similarity coefficient of each node in the network is calculated, and then gen...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a drug target prediction method based on multiple similarity network walk. The method comprises the following steps of S1, obtaining the interaction relationship between drugsand drugs, targets and targets, drugs and diseases, targets and diseases, drugs and side effects from relevant databases, and constructing corresponding adjacency matrix; S2, by calculating Jaccard similarity among nodes in the adjacency matrix, constructing similarity networks corresponding to each adjacency matrix; S3, using a node2vec method to walk and train each similarity network to generate corresponding network feature vectors; S4, splicing the feature vectors of each network obtained in S3 are to obtain a combined feature vector of multiple networks of the drugs and targets; S5, according to an existing drug target relationship, obtaining a positive sample of a drug target pair, generating a negative sample equivalent to the positive sample by randomly combining, and splicing thedrug feature vectors and the target feature vectors obtained in S4 according to the combination of the positive and negative samples, thereby obtaining a final feature vector of the drug target pair;S6, using a random forest algorithm to conduct training, ten-fold cross-check and classification prediction on the positive and negative samples in S5.

Description

technical field [0001] The invention relates to the interdisciplinary field of bioinformatics and computer science, and more specifically, relates to a drug target prediction method based on multiple similarity network walks. Background technique [0002] Predicting the drug-target relationship with methods in the computer field has become a very important step in the process of discovering new drugs and drug repositioning. The potential drug target relationship identified by machine learning can provide guidance for biochemical or clinical experiments, thereby greatly reducing the time and cost of biochemical experiments. [0003] In the traditional field of machine learning, the extraction and selection of features is a very critical part, and the quality of feature representation usually determines the performance of machine learning methods. The selection of features needs to be done effectively by experts in specific fields. Therefore, when conducting interdisciplinary...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F19/00G06K9/62
CPCG16C20/50G06F18/2411
Inventor 石越常会友
Owner SUN YAT SEN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products