Deep learning prediction method of target-ligand binding affinity based on gated attention mechanism

A technology of deep learning and prediction methods, applied in neural learning methods, used to analyze two-dimensional or three-dimensional molecular structures, informatics, etc., can solve the problem that large data sets are difficult to balance accuracy and versatility, and 3D structural features are not easy to obtain and other problems to achieve the effect of saving computing time and cost and high interpretability

Pending Publication Date: 2022-07-12
DALIAN UNIV OF TECH
View PDF0 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Based on data-driven methods (such as machine learning methods), the calculation speed is extremely fast, however, most machine learning algorithms still rely on expert knowledge for feature extraction/selection, and it is difficult to trade off accuracy and versatility in large data sets
Driven by massive data and powerful parallel computing capa

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Deep learning prediction method of target-ligand binding affinity based on gated attention mechanism
  • Deep learning prediction method of target-ligand binding affinity based on gated attention mechanism
  • Deep learning prediction method of target-ligand binding affinity based on gated attention mechanism

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0068] The target-ligand complexes were randomly divided into training, validation, and test sets in a ratio of 8:1:1. The number of samples in the training, validation, and test sets are 6,711, 841, and 841, respectively. There are 3,102 low binding affinity complexes (label "0") and 3,609 high binding affinity complexes (label "1") in the training set, meeting the data balance requirement for deep learning classification models. The loss function in our deep learning model is the cross entropy loss (CEL), which is a commonly used loss function in classification tasks. To minimize the loss function, the model parameters are optimized by using the Adaptive Moment Estimation (Adam) optimizer with a learning rate setting varying from 0.0005 to 0.00005, where a decay factor (0.5) is used when the loss function on the validation set does not decrease with iterations to update the learning rate. Generally speaking, 50% is the threshold for judging true and false in a binary class...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a deep learning prediction method of target-ligand binding affinity based on a gated attention mechanism, and belongs to the field of computer-aided drug design technology and biology and pharmacoinformatics. The deep learning model starts from an SMILES character string of a ligand and an amino acid sequence of a protein, and then is converted into a ligand matrix and a protein matrix. The ligand matrix is sent to a full connection layer and an attention layer based on gate enhancement for feature extraction, and the protein matrix is sent to a one-dimensional convolution layer and a maximum pooling layer and then sent to the attention layer based on gate enhancement. Finally, the processing characteristics of the ligand matrix are polymerized by adding matrix rows, and the same process is performed on the protein matrix, and then the two are spliced together and sent to a subsequent full connection layer to predict the probability of high/low binding affinity of the protein-ligand complex. According to the method, the time and cost related to experimental analysis are effectively reduced, and the efficiency of drug design and virtual screening is improved.

Description

technical field [0001] The invention relates to computer-aided drug design technology and the fields of biology and drug informatics, in particular to a target-ligand binding affinity prediction method. Background technique [0002] Most biological processes are determined by biomolecular recognition, where proteins often act as targets to interact with ligands to regulate biological functions, such as enzymatic catalysis, signal transduction, etc. The study of target-ligand interactions is an important topic. The quantity of binding strength (measured as a real number) of a target-ligand interaction is usually defined as binding affinity, which can be determined by the inhibition constant K i , the dissociation constant K d and half-maximal inhibitory concentration IC 50 to quantify. Most of the existing drugs are small molecule compounds with biological activity. Identifying ligands with high affinity to target proteins (small molecule drug candidates) is a major task ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G16B40/00G16B5/00G16B15/00G16B50/00G06K9/62G06N3/04G06N3/08
CPCG16B40/00G16B5/00G16B15/00G16B50/00G06N3/08G06N3/047G06N3/045G06F18/241G06F18/2415Y02A90/10
Inventor 刘奇磊都健赵雨靓张磊吴心远孟庆伟
Owner DALIAN UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products