Pharmacokinetic Properties and Toxicity Prediction Method of Drug Molecules Based on Capsule Network

A pharmacokinetic and drug molecule technology, applied in the field of drug molecule pharmacokinetic properties and toxicity prediction based on capsule network, which can solve the loss of original information of molecular fingerprints and molecular descriptors, poor prediction and classification effect, training set Large scale dependence, etc., to achieve the effect of automatic dimensionality reduction and good prediction effect

Inactive Publication Date: 2021-06-22
SICHUAN UNIV
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

This method is based on the molecular fingerprints and molecular descriptors of ligands, and uses deep learning capsule networks to establish the relationship between molecular fingerprints and molecular descriptors, pharmacokinetic properties and toxicity, and overcomes the inadequacy of predicting classification effects in the prior art. Good, the original information of molecular fingerprints and molecular descriptors that characterize ligands is seriously lost, and the accuracy of prediction is greatly dependent on the size of the training set.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Pharmacokinetic Properties and Toxicity Prediction Method of Drug Molecules Based on Capsule Network
  • Pharmacokinetic Properties and Toxicity Prediction Method of Drug Molecules Based on Capsule Network
  • Pharmacokinetic Properties and Toxicity Prediction Method of Drug Molecules Based on Capsule Network

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0138] Predict the active compound of the potassium ion channel encoded by hERG (human ether-a-go-go-related gene), and use the convolution operation as a feature extractor to obtain the low-level features of the molecule. The implementation process is as follows:

[0139] In the first step, data related to hERG activity are collected. Data related to hERG were obtained from the ChEMBL open source database (https: / / www.ebi.ac.uk / chembl / ). ChEMBL is a well-known biological activity database established by the European Institute of Bioinformatics. Anyone can download this public database from the website, so it is widely used by cheminformatics researchers. The workflow for establishing the initial ChEMBL-hERG dataset is as follows:

[0140] 1) According to the ID number of hERG in the data (ChEMBL 240), 17,952 compounds that were tested for hERG activity were extracted;

[0141] 2) Compounds identified as "Nonstandard unit for type", "Outside typical range" (outside typical...

Embodiment 2

[0152] Still predicting the active compounds of the potassium ion channel encoded by hERG (human ether-a-go-go-related gene), a restricted Boltzmann machine was used as a feature extractor.

[0153] The first and second steps are the same as in Example 1.

[0154] In the third step, the prediction model of hERG active / inactive molecules based on the capsule network was established with the ChEMBL-hERG training set. Initialize network weights randomly using a truncated normal distribution and set stddev to 0.01. The feature extractor uses a restricted Boltzmann machine. The probability distribution of the energy function is used as the activation function. To reduce internal covariate shift, the input distribution of each layer is normalized to a standard Gaussian distribution using a batch normalization method. Adam method is used for network optimization. By monitoring multiple evaluation indicators (accuracy, specificity, sensitivity and Matthews correlation coefficient,...

Embodiment 1、2

[0162] The present embodiment 1, 2 adopts the following formula to carry out evaluation verification:

[0163]

[0164]

[0165]

[0166] Where Q represents the overall prediction accuracy of the prediction model, SE represents sensitivity, which refers to the proportion of positive / active compounds correctly predicted by the prediction model, and SP represents specificity, which refers to the proportion of negative / inactive compounds correctly predicted by the prediction model .

[0167] When convolution and restricted Boltzmann machine are used as feature extractors, the overall prediction accuracy of the test set is about 90%, indicating that the established model also has a good predictive ability for compounds independent of the training set.

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

Pharmacokinetic Properties and Toxicity Prediction Method of Drug Molecules Based on Capsule Network. After constructing comprehensive molecular fingerprints and molecular descriptors and preparatory work for establishing models, the low-level feature content of molecules is extracted from the upper-level low-level features through convolution or restricted Boltzmann machine operations, and then the capsule network method is used in the next step. The high-level features of molecules are abstracted from a layer of high-level features, and the relationship between high-level features and active labels is fitted by a dynamic routing algorithm to predict the pharmacokinetic properties and toxicity classification of unknown small molecules. The present invention does not need to collect large-scale data sets for training, and realizes automatic dimensionality reduction through end-to-end optimization of the input, updates the coupling coefficient through iterative dynamic routing process, and dynamic routing transfers all the features of the upper layer capsule to any one of the lower layer In the capsule, the hierarchical position relationship between the low-level features and high-level features is greatly preserved. Better predictive performance than traditional machine learning methods.

Description

technical field [0001] The invention relates to the field of computer-aided drug molecule design, in particular to a method for predicting pharmacokinetic properties and toxicity of drug molecules based on a capsule network. Background technique [0002] The great success of a drug depends not only on its good efficacy, but also on its excellent pharmacokinetic properties and low toxicity. According to statistics, the poor absorption, distribution, metabolism, excretion and toxicity of candidate drugs account for more than 50% of the reasons for the failure of drug development. Therefore, it is possible to exclude and optimize compounds with poor pharmacokinetic properties and toxicity in the early stage of drug development Greatly improve the success rate of drug development. In recent years, although in vitro high-throughput screening methods can be used to measure the pharmacokinetic properties and toxicity of compounds, experimental-based assays are not only expensive a...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G16C10/00G16C20/30G16C20/70
Inventor 杨胜勇王译伟邹俊黄磊姜斯文
Owner SICHUAN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products