Eureka AIR delivers breakthrough ideas for toughest innovation challenges, trusted by R&D personnel around the world.

Drug small molecule-protein target reaction prediction method based on multi-dimensional information

A prediction method and technology for drug molecules, applied in molecular design, chemical process analysis/design, chemical statistics, etc., can solve the problems of inconsistent, unreasonable size of distance matrix, inconsistent SMILES sequence length, etc., to improve speed and accuracy , the effect of high practical value

Pending Publication Date: 2021-02-05
GALIXIR TECH (BEIJING) LTD
View PDF0 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Nevertheless, in the process of DTI prediction, there are still the following difficulties: 1) The size of the distance matrix of different proteins is inconsistent, since each element in the matrix represents the distance between a pair of amino acids, so different distance matrices It is unreasonable to directly down-sample and scale to the same size; 2) The SMILES sequence lengths of different drug molecules are inconsistent, and an appropriate model needs to be used to obtain semantic features; 3) The training data set for DTI tasks is relatively small, so Designing an appropriate network structure is critical

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Drug small molecule-protein target reaction prediction method based on multi-dimensional information
  • Drug small molecule-protein target reaction prediction method based on multi-dimensional information

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0018] In order to accurately describe the action process of drug small molecule-protein target and improve the prediction accuracy of drug small molecule-protein target action, this application proposes a multi-dimensional information-based response prediction method. Such as figure 1 As shown, the method semantically encodes small drug molecules and protein targets, respectively. In terms of drug small molecule coding, in order to increase the expressive ability of features, we introduced the contextual semantic expression method based on ELMo and the pre-trained language model based on BERT to describe the general features of small drug molecules, and at the same time integrated the small molecule itself. Chemical properties and pharmaceutical properties. In terms of protein target coding, we use amino acid sequences as input, and also obtain the expression of semantic vectors based on BERT, and learn the local structure information and spatial structure information of pro...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a drug small molecule-protein target reaction prediction method based on multi-dimensional information. The method comprises the following steps: (1), obtaining feature representation of a protein target, including an amino acid vector, a protein three-dimensional space structure feature and a protein chemical feature; (2), obtaining feature representations of the drug small molecules, including semantic features of the drug molecules, general network features of BERT and chemical features of the drug molecules; (3), fusing the features of protein targets and drug smallmolecules; (4), using the fused features as input of a classifier, and training the whole network in combination with labels in a training set. The fusion of three different types of features can greatly mine the potential features of drugs and proteins, and is convenient for more accurate DTI prediction. The method can improve the DTI prediction speed and accuracy at the same time, and is higherin practical value.

Description

technical field [0001] This application relates to artificial intelligence drug research and development, specifically to the field of drug small molecule-protein target interaction. Background technique [0002] Prediction of drug target interactions (drug target interactions, DTIs) is the basis for studying the therapeutic effect of drugs and the side effects of most drugs, and plays a vital role in the process of drug development. Although some experiments have verified that DTI (drug-target interaction) is a relatively reliable method, the combination space of drug small molecule-protein targets is too large, and it will consume huge manpower and computing resources to verify them one by one through experiments. Generally, DTI prediction methods can be divided into two categories: physics-based methods and machine learning-based methods. Among them, physics-based methods, such as molecular docking, utilize predetermined scoring functions to evaluate DTI at the atomic le...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G16C20/10G16C20/50G16C20/70
CPCG16C20/10G16C20/50G16C20/70
Inventor 柳俊宏李成涛
Owner GALIXIR TECH (BEIJING) LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Eureka Blog
Learn More
PatSnap group products