Method for predicting dissociation constant of small molecule compounds based on graph convolutional neural network

A technology of convolutional neural network and small molecule compounds, which is applied in the field of computer-aided drug design, can solve the problems affecting the pharmacokinetics and biochemical properties of drug molecules, require large computing power, and slow calculation speed, etc., and achieve fast prediction speed , improve efficiency and avoid noise

Pending Publication Date: 2022-01-21
EAST CHINA NORMAL UNIV
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

At the same time, it will also affect the pharmacokinetics and biochemical properties of drug molecules
[0003] At present, there are three main methods for predicting dissociation constants. One is the linear free energy method, which establishes a linear equation based on the substituent effect, but this type of method needs to extract parameters based on experience, and establish a parameter library and a matrix library; The QSPR method uses machine learning to model the descriptors of the chemical environment around the dissociation center. The descriptors determine the quality of this type of method; there is also a computing method based on quantum chemistry, which requires a lot of computing power. , the calculation speed is slow

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for predicting dissociation constant of small molecule compounds based on graph convolutional neural network
  • Method for predicting dissociation constant of small molecule compounds based on graph convolutional neural network
  • Method for predicting dissociation constant of small molecule compounds based on graph convolutional neural network

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0220] Concrete steps of this embodiment:

[0221] (1) The Smiles representation of all compounds and the "acd_most_apKa" and "acd_most_bpKa" data of each compound were extracted from the ChEMBL database using the MySQL database search language. Finally, 1,624,715 compounds were obtained, including 963,969 acidic dissociation centers and 1,167,260 basic dissociation centers.

[0222] (2) Compound-based Smiles means using RDKit to generate a three-dimensional structure and add hydrogen atoms, neutralize the charge in the compound, and remove the compound containing salt and solvent.

[0223] (3) Use Epik to calculate the dissociation constants of all compounds extracted in (2), and use the most acidic and basic dissociation centers in the Epik calculation results as the dissociation centers to which the dissociation constants extracted in (2) belong center.

[0224](4) For the data obtained in (3), if the difference between the result calculated by Epik and the dissociation c...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a method for predicting a dissociation constant of a small molecule compound based on a graph convolutional neural network, which is different from a traditional prediction method based on molecular fingerprints, and automatically learns a chemical mode related to the dissociation constant through the graph convolutional neural network to construct a prediction model. The dissociation constants of 16,000,000 compounds in the ChEMBL database are used for training the model. In addition, in order to quickly determine the dissociation center in each compound, a substructure database containing 144 SMARTS templates is constructed on the basis of a ChEMBL database. When a new compound is predicted, firstly, the SMARTS template library is used for matching dissociation centers in the compound, and then the dissociation constants of the dissociation centers are predicted in sequence. By utilizing the method, the dissociation constants of the compounds can be quickly and accurately predicted, and the efficiency of drug design and virtual screening is improved.

Description

technical field [0001] The invention relates to the field of computer-aided drug design, which is a method for predicting dissociation constants of small molecule compounds by computer based on graph convolutional neural network, which can be applied in the field of drug virtual screening and drug molecular design, and shortens the cycle of drug research and development. Background technique [0002] Dissociation constant is an important physicochemical property of small molecule drugs. It has been reported that about 79% of orally administered drugs contain at least one ionizable group. It determines the activity, solubility, lipophilicity, bioaccumulation and toxicity of drug molecules. At the same time, it will also affect the pharmacokinetics and biochemical properties of drug molecules. [0003] At present, there are three main methods for predicting dissociation constants. One is the linear free energy method, which establishes a linear equation based on the substitu...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G16C20/50G16C20/70G16C20/90G06N3/04
CPCG16C20/50G16C20/70G16C20/90G06N3/045
Inventor 潘肖林王昊张增辉季长鸽
Owner EAST CHINA NORMAL UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products