Molecular generation method based on subgraph-variational self-encoding structure

A technology of self-encoding and self-encoder, which is applied in the drug development of molecular generation and optimization, and in the field of artificial intelligence drug research and development, can solve the problems that the chemical properties cannot meet the actual needs and do not consider the chemical structure, etc., to overcome the lack of chemical properties, reduce the Reliance on Domain Knowledge, Effect of High Practical Value

Pending Publication Date: 2021-02-23
GALIXIR TECH (BEIJING) LTD
View PDF0 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Although the molecules generated by this graph-based method have better expressiveness than SMILES in some chemical properties, but because the entire graph is gradually generated based on a single atom, some global chemical properties are not considered in the generation process. structure, so the obtained molecules usually do not meet the actual needs in some chemical properties

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Molecular generation method based on subgraph-variational self-encoding structure
  • Molecular generation method based on subgraph-variational self-encoding structure
  • Molecular generation method based on subgraph-variational self-encoding structure

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0015] Invention principle: In order to generate a brand-new drug molecule with one or several specified chemical properties, which can be synthesized in practice, the present invention proposes a molecular generation method based on the subgraph-variational self-encoding structure, namely First, through the gSpan algorithm (an algorithm for generating frequent subgraphs based on graph pattern search), all frequent subgraphs in the training set molecules are obtained, and the subgraphs that meet the conditions are selected as the total set of molecular subgraphs, and then the molecules are divided. is a ring or frequent sub-atlas, and then use the multiscale variational self-encoding model (multiscale VAE (variational autoencoder)) to reconstruct the original molecular map. We maintain and transfer relevant chemical structures and properties through subgraphs, and then explore more potential molecular structures through hidden variable space, and then perform automatic combinat...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

A molecular generation method based on a subgraph variation self-encoding structure comprises the steps: mining frequent subgraph sets of all molecules in a training set, and then screening frequent subgraphs through counting thresholds and pharmaceutical properties based on statistics to obtain a molecular subgraph total set S* meeting conditions; performing graph decomposition on each molecule Gin the training set to obtain a corresponding molecule sub-graph set S; inputting the molecular set S into an encoder for encoding to obtain a corresponding hidden space vector Z; and finally, inputting Z into a decoder for decoding to obtain a reconstructed molecule G. The method overcomes the defects of chemical property loss, difficulty in synthesis and the like caused by the traditional atom-by-atom generation method, ensures certain chemical properties of the molecule through the application of the substructure, and further generates a molecular structure with stable chemical propertiesand easiness in synthesis; the method brings great convenience to the discovery and design of novel drug small molecules, reduces the dependence of related workers on domain knowledge, improves the design and development efficiency of novel drugs, and shortens the research and development cycle.

Description

technical field [0001] The invention belongs to the field of artificial intelligence drug research and development, in particular to the field of drug research and development based on molecular subgraph-self-encoding structure molecular generation and optimization. Background technique [0002] In the field of drug molecule generation and optimization, due to the large number of molecular arrangements and combinations, the molecular space formed is too large, and the space for screening through artificial custom rules is limited. How to use existing molecular data to automatically find the most relevant target molecule? Molecules or molecular spaces are critical. Traditional methods usually analyze the chemical properties of known molecules and target molecules, and artificially design new molecular structures. This method is very inefficient, and this process is very dependent on the domain knowledge and experience of chemists, so some people propose a method based on mac...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G16C20/50G16C20/70
CPCG16C20/50G16C20/70
Inventor 李成涛柳俊宏
Owner GALIXIR TECH (BEIJING) LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products