Supercharge Your Innovation With Domain-Expert AI Agents!

Prediction method and device of protein signal peptide based on geometric graph neural network

A neural network and prediction method technology, applied in the field of protein signal peptide prediction, can solve the problem of overfitting and cannot accurately identify the cleavage site, and achieve the effect of reducing overfitting and improving accuracy

Active Publication Date: 2021-02-23
WUHAN GENECREATE BIOLOGICAL ENG CO LTD
View PDF7 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The present invention aims at the problems of overfitting and inability to accurately identify the cleavage site existing in the existing methods for predicting protein signal peptides, and provides a prediction of protein signal peptides based on a geometric graph neural network in the first aspect of the present invention The method comprises the following steps: obtaining the protein sequence in the data set, extracting the character sequence of the signal peptide; performing feature extraction on the signal peptide to obtain the sequence evolution characteristics, physicochemical characteristics, structural characteristics, and statistical characteristics of the signal peptide; Fusing evolutionary features, physicochemical features, structural features, and statistical features to obtain a multidimensional vector; constructing a feature map of a signal peptide according to the character sequence and the multidimensional vector; using the feature map as an input to a geometric graph neural network model , train the geometric graph neural network until its error is lower than a threshold, save the geometric graph neural network model and use it to predict whether the amino acid sequence to be predicted contains a signal peptide fragment

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Prediction method and device of protein signal peptide based on geometric graph neural network
  • Prediction method and device of protein signal peptide based on geometric graph neural network
  • Prediction method and device of protein signal peptide based on geometric graph neural network

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0025] The principles and features of the present invention are described below in conjunction with the accompanying drawings, and the examples given are only used to explain the present invention, and are not intended to limit the scope of the present invention.

[0026] refer to figure 1 , a method for predicting protein signal peptides based on a geometric graph neural network, comprising the following steps: S101. Obtain the protein sequence in the data set, and extract the character sequence of the signal peptide; S102. Perform feature extraction on the signal peptide to obtain the Sequence evolution features, physical and chemical features, structural features, and statistical features of the signal peptide; S103. Fusion evolutionary features, physical and chemical features, structural features, and statistical features to obtain a multidimensional vector; S104. According to the character sequence and the Constructing a feature map of the signal peptide from a multidime...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present invention relates to a method and device for predicting protein signal peptides based on a geometric graph neural network. The method includes: acquiring protein sequences in a data set, extracting character sequences of signal peptides; performing feature extraction on the signal peptides to obtain the The sequence evolution features, physical and chemical features, structural features, and statistical features of the signal peptide; the evolutionary features, physical and chemical features, structural features, and statistical features are fused to obtain a multidimensional vector; and the signal is constructed according to the character sequence and the multidimensional vector. The feature map of the peptide; using the feature map as the input of the geometric graph neural network model, training the geometric graph neural network until its error is lower than a threshold, saving the geometric graph neural network model and using it to predict the signal to be predicted peptide. The invention combines the geometric graph neural network and the character sequence, reduces the over-fitting of the model while retaining the original information of the feature extraction, and improves the accuracy of signal peptide prediction.

Description

technical field [0001] The invention relates to the fields of biological information and deep learning, in particular to a method and device for predicting protein signal peptides based on geometric graph neural networks. Background technique [0002] In organisms, most proteins do not function in the form of monomers, but perform different biological functions in the form of interactions. Among them, protein-protein interaction (Protein-Protein Interaction, PPI) refers to the process of forming a protein complex by two or more molecular proteins through covalent bonds. Protein interactions play important roles in most biochemical functions. For example, signaling molecules interact with proteins to transmit signals from outside the cell to the inside of the cell, and signal transmission is the basis for many functions. For protein interaction, in essence, it is realized through the mutual binding of some residues on the protein, and these residues are called protein-prote...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G16B40/10G16B15/00G16B30/00
CPCG16B15/00G16B30/00G16B40/10
Inventor 华权高舒芹赵愿安
Owner WUHAN GENECREATE BIOLOGICAL ENG CO LTD
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More