Deep-learning-based protein function prediction method fusing multiple features

A technology of protein function and deep learning, which is applied in the field of protein function prediction that integrates multiple features, can solve problems such as low accuracy and no protein labeling function, and achieve the effect of good applicability and improved accuracy

Active Publication Date: 2019-07-30
CENT SOUTH UNIV
View PDF3 Cites 10 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Although there are already a large number of computational methods for protein function prediction, the accuracy is generally low
[0005] Protein function prediction is considered to be a huge challenge, mainly for the following reasons: First, a large number of proteins have not yet been annotated. For example, in the UniProt database, less than 1% of proteins have been annotated with functions. method for protein function prediction to further improve the accuracy of protein function prediction
Second, researchers usually use Gene Ontology (Gene Ontology) to label protein functions. Gene Ontology contains more than 40,000 functional categories. Not only does a protein contain multiple functions, but complex biological processes and functions require multiple Proteins act together; that is, protein function prediction is a large-scale, multi-label, multi-classification problem
Third, different data are heterogeneous and complex, so how to use multiple data for protein function prediction is a difficult problem

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Deep-learning-based protein function prediction method fusing multiple features
  • Deep-learning-based protein function prediction method fusing multiple features
  • Deep-learning-based protein function prediction method fusing multiple features

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0031] In order to better explain the present invention and facilitate understanding, the present invention will be described in detail below through specific embodiments in conjunction with the accompanying drawings.

[0032] The present invention provides a protein function prediction method based on deep learning fusion of multiple features, such as figure 1 shown, including the following steps:

[0033] Step S1, obtaining the sequence information, function information and homology information of the target protein.

[0034] The pure amino acid sequence does not produce any life significance for organisms, and the specific spatial conformation formed by these linear sequences after rotation and folding has biological functions and regulates the life activities of the human body. Generally, the functional structure of proteins is analyzed from four levels: the primary protein structure is composed of a series of strings, and each character in the string represents the amino...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a deep-learning-based protein function prediction method fusing multiple features. The deep-learning-based protein function prediction method includes the steps: S1, extractingsemantic structure features, subsequence features and network topology structure features of proteins according to sequence information, action information and homologous information of the proteins;and S2, inputting the semantic structure features, the subsequence features and the network topology structure features of the proteins into a pre-trained protein function prediction model, and outputting a classification result, wherein the protein function prediction model comprises: extracting local semantic features of the proteins according to the semantic structure features of the proteins;according to subsequence features of the proteins, extracting denser and higher-level subsequence features of the proteins; fusing the local semantic features of the proteins, denser and higher-levelsubsequence features and network topological structure features of the proteins to obtain protein classification fusion features; and inputting the protein classification fusion features into a function classification module, and outputting a classification result. The deep-learning-based protein function prediction method fusing multiple features obviously improves accuracy of predicting the protein function.

Description

technical field [0001] The present invention relates to the technical field of bioinformatics, in particular to a protein function prediction method based on deep learning fusion of multiple features. Background technique [0002] The function of proteins plays a very important role in the research of biotechnology and medicine, such as the development of new drugs, the development of new crops and the development of synthetic biochemicals such as biofuels. [0003] The early method of predicting protein function was through in vivo and in vitro experiments, including gene knockout, targeted mutation and inhibition of gene expression, etc. These experimental methods required a lot of manpower and time. [0004] In order to alleviate the above problems, some computational methods have been used for protein function prediction, and the development of high-throughput sequencing technology has provided a large amount of effective data for computational methods. Examples include...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G16B20/00G16B40/00
CPCG16B20/00G16B40/00Y02A90/10
Inventor 李敏张富豪宋虹
Owner CENT SOUTH UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products