Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Molecular property prediction method based on molecular spatial position coding attention neural network model

A technology of molecular properties and prediction methods, applied in the fields of machine learning/artificial intelligence, chemical informatics, and can solve problems such as inability to distinguish well, reduce model performance, and disadvantage

Active Publication Date: 2021-08-10
天津大学滨海工业研究院有限公司
View PDF13 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Secondly, using atoms directly as the basic unit of the graph is not conducive to maintaining molecular chemical information. Most existing graph networks use molecules as nodes for training, which often ignores the internal information between molecular substructures. Currently, based on sequences, chemical Molecules are regarded as sentence vectors. Through methods similar to natural language processing, representations of different molecules or the same substructure of the same molecule in different positions cannot be well distinguished, which reduces the performance of the model.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Molecular property prediction method based on molecular spatial position coding attention neural network model
  • Molecular property prediction method based on molecular spatial position coding attention neural network model
  • Molecular property prediction method based on molecular spatial position coding attention neural network model

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0026] The present invention will be further described below in conjunction with the accompanying drawings.

[0027] The method of the present invention encodes the spatial position in the 3D conformation of the molecule through machine learning technology, and better represents the influence of different positions in the molecule on the substructure, so as to better characterize the molecule and utilize the neural network structure and gate of the attention mechanism. A controlled loop network (GRU) was used to predict its chemical properties.

[0028] The present invention is based on the molecular space position coding attention neural network model, which is a machine learning model combining molecular space position information and topological information, such as: figure 1 As shown, the model consists of 3 parts: the embedding layer, the prediction model layer, and the output layer.

[0029] The present invention is based on the basic chemical basis that molecular struc...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a molecular property prediction method based on a molecular spatial position coding attention neural network model. According to the method, spatial positions in a 3D conformation of a molecule are coded through a machine learning technology, so that the influence of different positions in the molecule on a substructure is better represented, the molecule is better represented, and meanwhile, a neural network structure of an attention mechanism and a gated loop network (GRU) are used for predicting chemical properties of the molecule. Herein, the topological relation of molecular substructures is fully utilized, experiments are carried out on a public data set, the effect of predicting the properties such as water solubility, toxicity and hydrophilicity of molecules is effectively improved, and thereby a new method is provided for drug molecule prediction.

Description

technical field [0001] The present invention relates to the fields of cheminformatics, machine learning / artificial intelligence, in particular to a theoretical method for predicting properties of unknown molecules by means of cheminformatics, machine learning / artificial intelligence based on spatial position coding. Background technique [0002] The drug R&D process has the characteristics of high capital density, high risk and long cycle, and requires a lot of capital, manpower and material resources. In the field of drug development, although many potential drug molecules have been extensively studied in animal models, there are still more than 30% of drug candidates that fail in practical applications because other intrinsic properties of the molecule do not meet the requirements. Property prediction work is of great value and can be used to better predict early molecular properties, which can greatly reduce the burden of later process failures, save a lot of resources an...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G16C20/30G16C20/70G16C20/20G06N3/04G06N3/08
CPCG16C20/30G16C20/70G16C20/20G06N3/04G06N3/08
Inventor 饶国政薛力源
Owner 天津大学滨海工业研究院有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products