Protein phosphorylation site prediction method based on inner product self-attention neural network

A prediction method and neural network technology, applied in neural learning methods, biological neural network models, neural architectures, etc., to achieve the effects of high prediction accuracy, guaranteed accuracy, and low computational cost

Pending Publication Date: 2021-07-09
ZHEJIANG UNIV OF TECH
View PDF0 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] In summary, the existing protein phosphorylation site prediction methods are still far from the requirements of practical application in terms of calculation cost and prediction accuracy, and urgently need to be improved.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Protein phosphorylation site prediction method based on inner product self-attention neural network
  • Protein phosphorylation site prediction method based on inner product self-attention neural network
  • Protein phosphorylation site prediction method based on inner product self-attention neural network

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0049] The present invention will be further described below in conjunction with the accompanying drawings.

[0050] refer to figure 1 and figure 2 , a protein phosphorylation site prediction method based on inner product self-attention neural network, comprising the following steps:

[0051] 1) Input a protein sequence whose number of amino acid residues is L to be predicted for phosphorylation site, denoted as S;

[0052] 2) Use the one-hot encoding method to digitally encode the 20 common amino acid types that make up the protein, as follows:

[0053] 'A': [1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0]

[0054] 'C': [0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0]

[0055] 'D': [0, 0, 1, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0]

[0056] 'E': [0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0]

[0057] 'F': [0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0]

[0058] 'G': [0,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0]

[0059] 'H': [0,0,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0]

[0060] 'I': [0,0...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a protein phosphorylation site prediction method based on an inner product self-attention neural network. The method comprises the steps that: firstly, a protein sequence which has an quantityL of amino acid residues and is to be subjected to phosphorylation site prediction is input, and the protein sequence is converted into an L * 20 feature matrix Mfea through one-hot coding expression forms of 20 common amino acids; then, for each residue of the protein, a characteristic matrix M with the size of w * 20 is acquired by using a sliding window according to the Mfea; protein sequences of known phosphorylation site tags are obtained from a database to construct a training set; a neural network framework is built, the network is trained by using the training set, and a prediction model is obtained; and finally, the feature matrix of the protein sequence residues to be predicted is input into the trained model, and whether target residues in the protein sequence are phosphorylation sites or not is predicted according to a probability value output by the model. The method is low in calculation cost and high in prediction accuracy.

Description

technical field [0001] The present invention relates to the field of bioinformatics, deep learning and computer application, in particular to a protein phosphorylation site prediction method based on inner product self-attention neural network. Background technique [0002] Protein phosphorylation is a post-translational modification process widely present in eukaryotes, which plays an important role in various biological processes such as energy metabolism, signal transduction pathways, neural activity, cell cycle and apoptosis. Accurate identification of protein phosphorylation sites can not only help us understand complex protein biological systems, but also guide the design of basic biomedical drugs. [0003] Currently, methods for predicting protein phosphorylation sites through deep learning include: NetPhos3.0 (Blom, N. et al. Prediction of post-translational glycosylation and phosphorylation of proteins from the amino acid sequence [J]. Proteomics, 2004: 4 , 1633–16...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G16B15/20G06K9/62G06N3/04G06N3/08
CPCG16B15/20G06N3/08G06N3/047G06N3/045G06F18/214
Inventor 胡俊贾宁欣曾文武殷文杰董明张贵军
Owner ZHEJIANG UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products