Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Automatic drug design method and system, computing equipment and computer readable storage medium

A design method and drug technology, applied in the computer field, can solve the problems of high novelty, low time, manpower, material resources, low effectiveness and uniqueness, etc., and achieve the effects of strong synthesizable, improved performance, and strong druggability.

Pending Publication Date: 2020-12-22
深圳智药信息科技有限公司
View PDF0 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] Problem 1, the validity and uniqueness of molecules generated by the L2L framework model are low
[0006] Problem 2, the high novelty of molecules generated by the L2L framework model is low, and it is difficult to generate compounds with normal structures in the chemical space far away from the target molecule (molecular similarity Tc<0.4)
[0007] Problem 3, the L2L framework model is difficult to generate structurally normal compounds with a molecular weight (Molecular Weight, MW) greater than 500, so it cannot be applied to some lead compound design scenarios that require high molecular weight
[0008] Question 4. The L2L framework model cannot be reused across targets. When applied to different targets, it is necessary to add known active compounds of the target target for training to generate new structural compounds, resulting in waste of time, manpower, and material resources.
[0009] Problem 5, the L2L framework cannot allow the model to fix a certain part of the compound structure, so that automatic sampling can be performed in other parts
[0010] The above problems limit the practical application value of molecular generative models using L2L framework

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Automatic drug design method and system, computing equipment and computer readable storage medium
  • Automatic drug design method and system, computing equipment and computer readable storage medium
  • Automatic drug design method and system, computing equipment and computer readable storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0031] The inventor believes that under the premise of limited training data and computing power, trying to make the machine model correctly output SMILES strings (that is, a complete lead compound) with correct grammar and tens or even hundreds of characters is a difficult task. . Even a giant generative model like GPT-3 can generate text paragraphs with correct text, grammar, content, and logic after almost inputting a corpus of nearly 45TB, using 175 billion parameters, and using a high-performance computing platform for training. Still an extremely difficult thing to do. Therefore, in view of the problems existing in the prior art, combined with the practical experience of medicinal chemists in drug design, this application finds a new way, and proposes a new molecular generation algorithm framework with molecular fragments as input, which is called Lead-to -Fragment-to-Lead(L2F2L). The difference from the L2L framework is that the reference Figure 2Aand B, in the trai...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an automatic medicine design method and system, computing equipment and a computer readable storage medium. The method comprises the following steps of decomposing a target lead compound into fragments with synthesizable modules, and sequentially inputting the fragments into a trained drug design machine learning model for sampling, and reassembling the new fragment outputby the drug design machine learning model to obtain a new lead compound. According to the automatic drug design provided by the invention, the performance in the aspects of molecule generation effectiveness and uniqueness is greatly improved, and molecules with high novelty, strong synthesizability and strong druggability can be generated; molecules can be easily generated in a high molecular weight region; the method can be repeatedly used for different target point lead compound generation scenes only by using a specific data set to train once; the local structure of the immobilized compoundcan be easily realized, and the rest parts are optimized.

Description

technical field [0001] The present invention relates to the field of computer technology, in particular to an automatic drug design method, system, computing equipment and computer-readable storage medium. Background technique [0002] Designing lead compounds with desirable properties is a central task in the drug discovery phase. In fast-follow and similar (Me-too) drug design scenarios, the traditional process requires the collection of a large number of papers and patents, and on the basis of reading and understanding by medicinal chemists, design novel, synthesizable, and druggable drugs Strong compounds, and validated by chemical synthesis and biological characterization. [0003] Molecular Generation is an automatic drug design method based on deep generative learning that has developed rapidly in recent years. By letting the model learn the SMILES (a compound structure represented by a string) or Molecular Graph (atoms and chemical bond connections represented by m...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G16C20/50G16C20/70
CPCG16C20/50G16C20/70
Inventor 黄韬金锋魏文娟
Owner 深圳智药信息科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products