Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Protein function prediction method, device, equipment and storage medium

A protein function and prediction method technology, applied in the fields of devices, protein function prediction methods, equipment and storage media, can solve the problems of low accuracy of protein function prediction and inability to provide protein function prediction methods, so as to reduce the loss of characteristic information , improve the effect, improve the accuracy and efficiency of the effect

Active Publication Date: 2022-03-22
SHENZHEN UNIV
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The purpose of the present invention is to provide a protein function prediction method, device, equipment and storage medium, aiming to solve the problem that the accuracy of protein function prediction is not high due to the inability of the prior art to provide an effective protein function prediction method

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Protein function prediction method, device, equipment and storage medium
  • Protein function prediction method, device, equipment and storage medium
  • Protein function prediction method, device, equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0026] figure 1 The implementation flow of the protein function prediction method provided by the first embodiment of the present invention is shown. For the convenience of explanation, only the parts related to the embodiment of the present invention are shown, and the details are as follows:

[0027] In step S101, when a protein function prediction request is received, the protein sequence to be predicted input by the user is acquired.

[0028] The embodiments of the present invention are applicable to protein function prediction platforms or systems. When a protein function prediction request is received, the protein sequence input by the user and to be predicted is obtained, so as to perform function prediction on the protein sequence.

[0029] In step S102, the protein sequence is divided to obtain corresponding amino acid fragments.

[0030] In the embodiment of the present invention, the protein sequence is usually composed of hundreds of amino acids. In order to impr...

Embodiment 2

[0040] figure 2 The implementation process of the protein function prediction method provided by the second embodiment of the present invention is shown. For the convenience of explanation, only the parts related to the embodiment of the present invention are shown, and the details are as follows:

[0041] In step S201, a protein sequence set is obtained, and the protein sequence set includes protein training sequences and functional annotations of the protein training sequences.

[0042]In the embodiment of the present invention, the protein sequence set is a training sample set used for dictionary training and machine learning model training. For the convenience of distinction, the protein sequence in the protein sequence set is called protein training sequence, and the protein sequence set includes multiple protein training sequences. Sequence and functional annotations corresponding to each protein training sequence. Wherein, the protein sequence set can be from the UniP...

Embodiment 3

[0054] image 3 The structure of the protein function prediction device provided by the third embodiment of the present invention is shown. For the convenience of description, only the parts related to the embodiment of the present invention are shown, including:

[0055] The sequence obtaining unit 31 is configured to obtain the protein sequence input by the user and to be predicted when a protein function prediction request is received.

[0056] In the embodiment of the present invention, when a protein function prediction request is received, the protein sequence to be predicted input by the user is obtained, so as to perform function prediction on the protein sequence.

[0057] The fragment division unit 32 is configured to divide the protein sequence to obtain corresponding amino acid fragments.

[0058] In the embodiment of the present invention, the protein sequence is usually composed of hundreds of amino acids. In order to improve the efficiency of protein function p...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present invention is applicable to the technical field of biological information, and provides a protein function prediction method, device, equipment, and storage medium. The method includes: obtaining a protein sequence to be predicted, dividing the protein sequence, obtaining corresponding amino acid fragments, and training Query the word vectors corresponding to the amino acid fragments in a good dictionary, generate the eigenvalues ​​of the protein sequence based on these word vectors, predict the function of the protein sequence according to the eigenvalues ​​of the protein sequence and the trained machine learning model, and generate and output the eigenvalues ​​of the protein sequence Therefore, by obtaining the eigenvalues ​​of protein sequences with context characteristics and performing machine learning on these eigenvalues, the accuracy and efficiency of protein function prediction are effectively improved, and the effect of protein function prediction is improved.

Description

technical field [0001] The invention belongs to the technical field of biological information, and in particular relates to a protein function prediction method, device, equipment and storage medium. Background technique [0002] At present, the main research objects of biological information are genes and proteins. Due to the disorder of gene sequences and protein sequences, it is difficult to judge their specific functions and various biochemical attributes when using traditional test methods if individuals are not included. In various research fields of bioinformatics, protein function prediction has been difficult to achieve high accuracy. The main prediction method is to use the Gene Ontology established by the Gene Ontology Consortium to annotate each protein in the protein database. Finally, a prediction model is established based on the properties of the protein itself after characterization, and finally the function of the uncollected individuals is predicted throug...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G16B20/20
Inventor 杜智华贺宇峰
Owner SHENZHEN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products