Method for predicting protein and ligand molecule binding free energy based on convolutional neural network

A technology of convolutional neural network and ligand molecules, which is applied in the field of predicting the binding free energy of proteins and ligand molecules based on convolutional neural networks, can solve problems such as limited application, difficult generalization of homologue ligands, and difficulties in large-scale application , to achieve the effects of small error, good generalization ability, and precise protein-ligand binding free energy calculation

Pending Publication Date: 2021-01-05
SHENZHEN JINGTAI TECH CO LTD
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

For example, the FEP method faces problems such as the preparation of the simulation system, the processing of the solvent model, the selection of the force field, and the calculation cost, which makes it difficult to apply on a large scale in practice.
The scoring fun

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for predicting protein and ligand molecule binding free energy based on convolutional neural network
  • Method for predicting protein and ligand molecule binding free energy based on convolutional neural network
  • Method for predicting protein and ligand molecule binding free energy based on convolutional neural network

Examples

Experimental program
Comparison scheme
Effect test

Example Embodiment

[0048]Example 1

[0049]Binding free energy prediction for a kinase target:

[0050]First collect the small molecule inhibitors of the target, perform 3D conformation preparation and molecular docking calculation, then call the molecular descriptor module to process these molecules and proteins, and input them into the convolutional neural network model. Such asimage 3 As shown, all molecular data is divided into 5 parts, one of which is selected as the validation set each time, and the other four parts are used as the training set. In this way, the model is trained for 5 rounds, that is, 5-fold cross-validation, and finally all the validation scores are taken The average value is used as the final verification score.Figure 4 ,Figure 5 Represents the performance on the validation set during the model training, fromFigure 4 It can be seen that the error in the initial stage of the model is relatively large, but after about 50 rounds of training, the error is reduced to about 2.3, and it re...

Example Embodiment

[0053]Example 2

[0054]Optimize the structure of the lead compound for a target:

[0055]Some seed compounds with good initial activity are obtained through virtual screening, and the convolutional neural network model is used to optimize the structure of this batch of seed compounds to obtain lead compounds.

[0056]Firstly, the molecular docking of the seed compound and the target is carried out, and then they are encoded and molecular descriptors are calculated. Input it into the convolutional neural network model to predict the free energy of binding between the compound and the target. According to the binding mode between the seed compound and the target, as well as the binding free energy value and spatial hierarchical structure information predicted by the model, the structure of the seed compound is modified and optimized, and groups matching the spatial hierarchical structure of the target are added to the seed compound , To form a better structural complementation with the target...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a method for predicting protein and ligand molecule binding free energy based on a convolutional neural network. The method comprises the following steps: (1) obtaining a high-quality protein-ligand compound crystal structure from a PDB database, collecting a small molecular structure from literature, and performing 3D conformation preparation; (2) performing encoding and descriptor calculation on the protein and the small molecules, and processing the protein into a 3D image; (3) using the 3D image of the protein as an input, and designing a convolutional neural networkarchitecture which comprises an input layer, a hidden layer and an output layer; (4) carrying out feature extraction and fine tuning on the pre-trained model so as to be applied to a new data set; and (5) after all the tasks are completed, calling Panda and Matplotlib data analysis libraries, and directly drawing an analysis result curve in Jupyter. According to the method, rapid and accurate protein-ligand binding free energy calculation is achieved, compared with an experimental value, errors are small, and data analysis and visualization can be automatically conducted on the result.

Description

technical field [0001] The invention belongs to the field of drug research and development based on artificial intelligence, specifically a method for predicting the binding free energy of proteins and ligand molecules based on convolutional neural networks, and realizes the precise prediction of the binding freedom of receptors and drug molecules by applying convolutional neural networks (CNN) It can be applied to the design and development of new drugs. Background technique [0002] New drug design and development is a creative and exploratory research work. Drug molecular design is based on rational strategies and scientific planning to construct new molecular entities with expected pharmacological activities. Molecular design is to gradually optimize active compounds into compounds that are safe, effective, controllable and easy to obtain in the human body, and meet the requirements for multidimensional properties of drugs during the process of changing and modifying st...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G16B15/20G16B15/30G06N3/04G06N3/08
CPCG16B15/20G16B15/30G06N3/08G06N3/045
Inventor 王辉马健张佩宇方磊温书豪赖力鹏
Owner SHENZHEN JINGTAI TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products