Method and apparatus for molecular toxicity prediction based on multi-task graph neural network

A toxicity prediction, neural network technology, applied in neural learning methods, biological neural network models, neural architectures, etc., can solve problems such as high thresholds, and achieve the effect of improving performance

Active Publication Date: 2021-08-13
NANJING UNIV OF POSTS & TELECOMM
View PDF3 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In addition, when molecular toxicity is predicted based on traditional machine learning methods, the molecular characteristics of compounds are mainly extracted manually, which requires researchers to have a high professional knowledge background and a high threshold

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and apparatus for molecular toxicity prediction based on multi-task graph neural network
  • Method and apparatus for molecular toxicity prediction based on multi-task graph neural network
  • Method and apparatus for molecular toxicity prediction based on multi-task graph neural network

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0080] Example: such as figure 1 and figure 2 Described, this molecular toxicity prediction method based on multi-task graph neural network, comprises the following steps:

[0081] S1: Collect data sets about molecular toxicity in public databases and literature, remove inorganic and organic metals, salts and mixtures, discard chemical substances with missing label values ​​in the data set, remove duplicate molecules, and save the data set. Toxicity prediction labels are provided and molecular compounds are saved to the dataset as SMILES strings. :

[0082] S2: Preprocess the chemical molecular canonical expression SMILES, and divide the complete data set randomly into a training set and a test set according to a certain ratio, and a part of the training set is divided by k-fold cross-validation to verify the performance of the verification model set.

[0083] Use the chemical toolkit rdkit to preprocess the canonical expression SMILES molecular formula of each chemical m...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method and an apparatus for molecular toxicity prediction based on a multi-task graph neural network. The method comprises the following steps: S1, preparing a toxicity data set, and obtaining toxicity data represented by a chemical molecule standard expression; S2, generating an atomic node eigenvector by using the toxicity data which is obtained in the step S1 and represented by the chemical molecule standard expression; S3, generating a side information eigenvector by using the toxicity data which is obtained in the step S1 and represented by the chemical molecule standard expression; S4, on the basis of the atomic node eigenvector obtained in the step S2 and the side information eigenvector obtained in the step S3, constructing a molecular toxicity prediction model based on a multi-task graph neural network; S5, verifying performance of the model. According to the multi-task graph neural network designed for molecular toxicity datasets, an automatic learning molecular graph structure information model is constructed, and the performance of a toxicity prediction task can be improved by using a multi-task learning method in combination with the relevance between molecular toxicity tasks.

Description

technical field [0001] The invention relates to a molecular toxicity prediction method and device based on a multi-task graph neural network, which can be used in the technical field of artificial intelligence medicine. Background technique [0002] Drug toxicity refers to adverse effects on an organism due to the action or metabolism of a compound. In the early stages of drug discovery, molecular toxicity prediction is critical for early exclusion of drug candidates in clinical trials. About 30% of new drug development failures are due to safety and toxicity issues, so toxicity prediction is crucial in the drug discovery and development cycle. [0003] Traditionally, evaluating drug toxicity through in vivo biological experiments is usually time-consuming and labor-intensive, and drug toxicity prediction based on machine learning is an important supplement. Toxicity prediction based on machine learning starts from the molecular structure of the compound, and builds a mach...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G16C20/70G06N3/04G06N3/08
CPCG16C20/70G06N3/084G06N3/047Y02P90/30
Inventor 姜榕吴建盛胡海峰朱燕翔
Owner NANJING UNIV OF POSTS & TELECOMM
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products