Protein residue contact diagram prediction method based on deep residual neural network

A neural network and prediction method technology, applied in the field of protein residue contact map prediction based on deep residual neural network, can solve the problems of high computational cost, lack of attention, and inability to guarantee prediction accuracy, so as to improve prediction accuracy. , the effect of improving forecasting efficiency

Inactive Publication Date: 2020-09-15
ZHEJIANG UNIV OF TECH
View PDF8 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Although the existing methods can be used to predict the contact map of protein residues, a large number of training data sets and machine learning algorithms are commonly used, so the calculation cost is high, and because the noise information in the training set has not received enough attention, the prediction accuracy is not enough. Guaranteed to be optimal
[0004] To sum up, the existing protein residue contact map prediction methods are still far from the requirements of practical application in terms of calculation cost and prediction accuracy, and urgently need to be improved.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Protein residue contact diagram prediction method based on deep residual neural network
  • Protein residue contact diagram prediction method based on deep residual neural network

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0046] The present invention will be further described below in conjunction with the accompanying drawings.

[0047] refer to figure 1 and figure 2 , a method for predicting protein residue contact maps based on deep residual neural networks, including the following steps:

[0048] 1) Input a protein sequence P of length L to be predicted by residue contact map;

[0049] 2) Use the one-hot encoding method to represent 20 common amino acids, as follows:

[0050] 'A': [1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0]

[0051] 'C': [0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0]

[0052] 'D': [0, 0, 1, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0]

[0053] 'E': [0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0]

[0054] 'F': [0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0]

[0055] 'G': [0,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0]

[0056] 'H': [0,0,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0]

[0057] 'I': [0,0,0,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0]

[0058] 'K': [0,0,0,0,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,0] ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a protein residue contact diagram prediction method based on a deep residual neural network. The method comprises the following steps: firstly, inputting a protein sequence tobe subjected to protein residue contact diagram prediction; then, converting the protein sequence into a 20 * L matrix by utilizing one-hot representation forms of 20 common amino acids for the protein sequence, digitizing the protein sequence information, and calculating by utilizing the 20 * L matrix to obtain a 20 * L * L covariance tensor, i.e., the characteristics of the input network; secondly, establishing a deep residual neural network framework, collecting protein sequences and tags of existing protein contact tags from a PDB library, calculating feature tensors of the protein sequences, forming a data set by the feature tensors and the corresponding tags, and learning a prediction model on the data set by using the deep residual neural network framework; and finally, inputting the protein characteristic tensor to be subjected to protein residue contact diagram prediction into the model to obtain a protein sequence residue contact diagram. The method is low in calculation costand high in prediction precision.

Description

technical field [0001] The invention belongs to the fields of bioinformatics and computer applications, and in particular relates to a method for predicting protein residue contact maps based on deep residual neural networks. Background technique [0002] Protein is one of the most important macromolecules in biology and has a wide range of functions. The interaction between protein molecules is realized through the contact interaction between some residues. This interaction is ubiquitous in life activities. And indispensable. Therefore, accurately identifying the contacts between protein residues has important guiding significance for understanding protein functions, analyzing the relationship between biomolecules, and designing new drugs. [0003] Research literature found that many methods for predicting protein residue contact maps have been proposed, such as: DNCON2 (Adhikari, B., Hou, J. and Cheng, J. (2017) DNCON2: improved protein contact prediction using two-level ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G16B15/30G16B50/30G06N3/04
CPCG16B15/30G16B50/30G06N3/045
Inventor 胡俊樊学强郑琳琳白岩松张贵军
Owner ZHEJIANG UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products