Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Ligand compound rapid pre-screening model based on deep learning

A ligand compound and deep learning technology, which is applied in the field of rapid pre-screening model of ligand compounds based on deep learning, can solve the problems of low affinity prediction accuracy, different performance, and success rate.

Active Publication Date: 2021-09-14
石家庄鲜虞数字生物科技有限公司 +3
View PDF12 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] The most commonly used method for the pre-screening of ligand compounds is molecular docking, through which the protein-ligand binding mode can be simulated, the ligand binding conformation can be predicted, and the affinity can be scored to determine potential active compounds. However, existing molecular The docking theory itself is not perfect. Molecular docking has low accuracy in predicting protein-small molecule binding mode and affinity. In addition, there are many docking programs to choose from, and their performances are different, resulting in a wide range of success rates in actual use.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Ligand compound rapid pre-screening model based on deep learning
  • Ligand compound rapid pre-screening model based on deep learning
  • Ligand compound rapid pre-screening model based on deep learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0035] The following will be combined with figure 1 , clearly and completely describe the technical solution of the present invention,

[0036] Specifically, the present invention provides a rapid pre-screening model for ligand compounds based on deep learning, such as figure 1 As shown, it includes the following steps:

[0037] S1. Obtain the ligand small molecule compound through the DrugSpaceX database or ZINC database and the small molecule data set generated by the generated model, and obtain the receptor protein structure through the RCSB PDB database and PDBbind database;

[0038] S2. Use RDKit to generate SMILES strings of compound molecules, and use word2vec or similar algorithms to encode the compound molecules;

[0039] S3. Extract the amino acid sequence information of the receptor protein binding site (pocket), and encode it using word2vec or similar algorithms;

[0040] S4. Set the docking space range, combine the ligand and the receptor structure vector by a ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a ligand compound rapid pre-screening model based on deep learning. The method includes the following steps: S1, constructing a data set, and encoding amino acid sequence information of binding sites of ligand compounds and receptor proteins; s2, combining the ligand and receptor structure vectors and inputting them into a deep neural network, outputting affinity scores, and training the model in a supervised learning mode; s3, sorting according to affinity scoring results, screening out positive compounds, and removing a large number of negative compounds; s4, obtaining molecular fingerprints (molecular characteristics) of positive compounds, and clustering based on a similarity or distance measurement method; and S5, taking the compound with the highest score in each cluster as a potential active compound. According to the method, rapid pre-screening of the ligand compound is achieved, redundancy can be removed through clustering analysis of the positive compound, the structural diversity of the ligand compound is guaranteed, and the prediction speed and accuracy are improved.

Description

technical field [0001] The invention belongs to the field of computer application technology, and in particular relates to a rapid pre-screening model for ligand compounds based on deep learning. Background technique [0002] As the starting point of drug development, the discovery of hit compounds is crucial to the entire drug development process. As an important technology for discovering hit compounds, virtual screening is the core technology of computer-aided drug design, that is, to quickly screen a small number of compounds from a large number of molecular databases. For target compounds, the pre-screening of ligand compounds can increase the pace of scientific research, reduce the number of compounds entering the biochemical experiment stage, and effectively improve the success rate and efficiency of drug development. [0003] The most commonly used method for the pre-screening of ligand compounds is molecular docking, through which the protein-ligand binding mode can...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G16C20/50G16C20/64G16C20/70G16C20/90G06N3/04G06N3/08
CPCG16C20/50G16C20/64G16C20/70G16C20/90G06N3/04G06N3/08
Inventor 张树科靳彦召王琪贾庆忠赵书良赵金金陈明
Owner 石家庄鲜虞数字生物科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products