Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Compound target protein binding prediction method based on multi-deep learning model consensus

A learning model and deep learning technology, applied in the field of pharmaceutical research and development, can solve problems such as numerous parameters to be adjusted, high false positives, and low success rate

Active Publication Date: 2021-04-02
LANZHOU UNIVERSITY
View PDF11 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] Existing techniques exploit compound-target protein binding relationships (see figure 1 ) cheminformatics software, all manual, many parameters need to be adjusted, the success rate is low, and there is a problem of high false positive
Deng, N et al (J.Phys.Chem.B 2015,119,976-988.) and Nataraj S et al (Biophys Rev (2017)9:91–102.DOI10.1007 / s12551-016-0247-1) respectively From the two aspects of compound target protein binding free energy calculation (free energy calculation) and compound target protein binding software review, it is discussed that there are many parameters in the compound target protein binding prediction, the success rate is low, and there are high false positives.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Compound target protein binding prediction method based on multi-deep learning model consensus
  • Compound target protein binding prediction method based on multi-deep learning model consensus
  • Compound target protein binding prediction method based on multi-deep learning model consensus

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0072] Multiple "recurrent neural network (positive)" models are connected in series to multiple "recurrent neural network (negative)" models to predict the binding relationship and system of compound target proteins.

[0073] Schematic diagram of the overall system structure: see Figure 3 ~ Figure 4 .

[0074] System operating environment:

[0075] Hardware: CPU+GPU;

[0076] Software: Windows or Linux, Python+Tensorflow, PyTorch, Keras, etc.

[0077] Technical solutions:

[0078] Such as Figure 3 ~ Figure 4 shown.

[0079] (1) Compound target protein binding data (positive sample), compound-target protein non-binding data (negative sample)

[0080] Compound target protein binding data (positive sample), compound-target protein non-binding data (negative sample). Among them, the positive sample is the target protein binding of the compound that humans have discovered in scientific research activities, usually selected from ChemSpider, PubChem, BindingDB, ZINC, ChEMBL...

Embodiment 2

[0127] Multiple "recurrent neural networks (positive)" are connected in series with multiple "encoder-decoder neural networks (negative)" deep learning models to predict the binding relationship and system of compound target proteins.

[0128] Schematic diagram of the overall system structure: see Figure 5 .

[0129] System operating environment: same as "Embodiment 1".

[0130] (1) Obtain compound target protein binding / non-binding data (positive sample / negative sample)

[0131] The same as the corresponding part in "Example 1".

[0132] (2-1) Extract compound target protein binding / unbinding data (positive sample / negative sample) according to nine positive and negative sample ratios and synthesize the total data set, which is divided into training set, test set and verification set

[0133] With (2-1) part of " embodiment 1 " technical scheme.

[0134] (2-2) Build multiple different compound target protein binding / non-binding deep learning models (positive / negative mode...

Embodiment 3

[0160] Multiple "encoder-decoder neural networks (positive)" connected in series with multiple "recurrent neural network (negative)" deep learning models to predict the binding relationship and system of compound target proteins

[0161] Schematic diagram of the overall system structure: see Figure 6 .

[0162] System operating environment: same as "Embodiment 1".

[0163] Technical solutions:

[0164] (1) Obtain compound target protein binding / non-binding data (positive sample / negative sample)

[0165] The same as the corresponding part in "Example 1".

[0166] (2-1) Extract compound target protein binding / unbinding data (positive sample / negative sample) according to nine positive and negative sample ratios and synthesize the total data set, which is divided into training set, test set and verification set

[0167] With (2-1) part of " embodiment 1 " technical scheme.

[0168] (2-2) Build multiple different compound target protein binding / non-binding deep learning model...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a compound target protein binding prediction method based on multi-deep learning model consensus. The method comprises the following steps: acquiring compound target protein binding / non-binding data; extracting a plurality of different compound target protein binding / non-binding data sets for training, testing and verifying; respectively constructing and training a plurality of compound target protein binding / non-binding deep learning models, extracting compound target protein binding / non-binding characteristics, and obtaining a plurality of final compound target protein binding / non-binding deep learning models; predicting to obtain a plurality of groups of compound target protein binding / non-binding relationships through the plurality of compound target protein binding / non-binding deep learning models in the previous step; integrating the obtained multiple groups of binding relationship results to obtain a consensus binding relationship, and integrating the obtained multiple groups of non-binding relationship results to obtain a consensus non-binding relationship. The method has the characteristics of low false positive rate and high accuracy, and is suitable for popularization and application.

Description

technical field [0001] The invention belongs to the technical field of pharmaceutical research and development, and relates to a compound target protein binding prediction method based on consensus of multiple deep learning models. Background technique [0002] Use the existing mainstream deep learning models (fully connected neural network model, convolutional neural network model, recurrent neural network model, encoder-decoder network and deep belief network) to extract the characteristics of the binding relationship of the compound target protein, so that Predicting the binding relationship of new compounds to target proteins has important practical significance for the discovery / development of new drugs and the study of the mechanism of action of traditional Chinese medicine. However, although these models can achieve high training accuracy (>90%), the high prediction false positive rate (>90%) prevents the further application of deep learning models in this field...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G16B15/20G06N3/04G06N3/08G06K9/62
CPCG16B15/20G06N3/08G06N3/045G06F18/2453
Inventor 郑光胡成杰刘昊乔安杰陈俊楠高雅杰吕诚李立
Owner LANZHOU UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products