Protein multi-source feature fusion drug-target affinity prediction method

A feature fusion and protein technology, applied in the field of bioinformatics, can solve the problems of unresolved protein structure, ignoring the prior knowledge of protein biology, ignoring the expression of protein features, etc., and achieve the effect of improving the prediction accuracy

Pending Publication Date: 2022-07-08
OCEAN UNIV OF CHINA
View PDF0 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] The existing one-dimensional protein sequence-based and two-dimensional spatial structure-based affinity prediction models only extract features for the sequence or structure of a single target protein, while ignoring the rich biological prior knowledge between proteins, such as protein The interaction and homology information between them also ignore the expression of protein characteristics such as subcellular location and family, and the accuracy of affinity prediction needs to be further improved; in addition, the affinity prediction model based on two-dimensional structure still needs to rely on protein structures, while a large number of protein structures remain unresolved

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Protein multi-source feature fusion drug-target affinity prediction method
  • Protein multi-source feature fusion drug-target affinity prediction method
  • Protein multi-source feature fusion drug-target affinity prediction method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0057] The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, but not all, embodiments of the present invention. Thus, the following detailed description of the embodiments of the invention provided in the accompanying drawings is not intended to limit the scope of the invention as claimed, but is merely representative of selected embodiments of the invention. Based on the embodiments of the present invention, all other embodiments obtained by those skilled in the art without creative work fall within the protection scope of the present invention.

[0058] In order to maximize the use of biological prior knowledge, the present invention collected 18552 human proteins from the SwissProt database, which covered the limited number of target proteins required by the present invention...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method for predicting drug-target affinity based on protein multi-source feature fusion, which comprises the following steps: firstly, constructing a PPI network and an SSN network, extracting protein features from the networks, and then collecting protein features such as subcellular positions, sequence codes, functional sites and structural domains for protein characterization; and fusing multi-source features by using a variational graph auto-encoder, and finally, inputting the multi-source features into a full-connection layer in combination with drug branches to carry out affinity prediction. According to the invention, a PPI network and an SSN network are constructed, so that biological priori knowledge between a target protein and other proteins is learned in addition to focusing on the characteristics of the target protein; according to the method, the protein characteristics are extracted and fused from the aspects of protein interaction, sequence similarity and protein subcellular positions for the first time, so that the drug-target affinity is predicted, and the prediction accuracy is improved; in addition, the characteristic source of the protein does not comprise a protein structure, so that the dependence on the protein structure is abandoned.

Description

technical field [0001] The invention relates to the technical field of biological information, in particular to a method for predicting drug-target affinity based on fusion of protein multi-source features. Background technique [0002] After the drug molecule enters the human body, it binds to the target protein, so as to produce drug effect and achieve the purpose of treating the disease. In the process of drug research and development, researchers will obtain a large number of candidate molecules, and it is very necessary to conduct virtual screening of these molecules. Predicting drug-target binding affinity is one of the important links, which indicates the relationship between drug molecules and target proteins. Bond strength. By predicting drug-target binding affinity to screen out molecules that do not meet affinity requirements, the search space for potential drug molecules is greatly reduced, eliminating the need for medical researchers to conduct extensive clinic...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G16B15/30G16B50/30G16B40/30G16C20/30G06N3/04G06N3/08
CPCG16B15/30G16B50/30G16B40/30G16C20/30G06N3/088G06N3/045
Inventor 魏志强马文健张树刚
Owner OCEAN UNIV OF CHINA
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products