Prokaryotic acetylation site prediction method based on information fusion and deep learning

A prokaryotic and deep learning technology, applied in the field of bioinformatics, can solve the problems of underestimating the importance of multi-information fusion, limited prediction accuracy, and failure to integrate multiple feature information

Active Publication Date: 2020-04-24
QINGDAO UNIV OF SCI & TECH
View PDF3 Cites 9 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Secondly, when predicting acetylation sites, a single feature extraction algorithm cannot effectively extract protein sequence information, and does not integrate multiple feature information, which underestimates the importance of multi-information fusion
Finally, we found that classifiers for acetylation site prediction are limited by support vector machines, random forests, logistic regression, etc., with limited prediction accuracy

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Prokaryotic acetylation site prediction method based on information fusion and deep learning
  • Prokaryotic acetylation site prediction method based on information fusion and deep learning
  • Prokaryotic acetylation site prediction method based on information fusion and deep learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0054] In order to make the object, technical solution and advantages of the present invention clearer, the invention will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0055] A prokaryotic acetylation site prediction method based on information fusion and deep learning, such as figure 1 shown, including the following steps:

[0056] 1) Collect acetylation modification site information: Obtain 9 prokaryotic acetylation site data sets from relevant literature, and generate category labels corresponding to positive and negative samples.

[0057] 1-1) The data sets of lysine acetylation sites of 9 categories of prokaryotes were constructed, namely E.coli, S.typhimurium, Bacillus subtilis (B.subtilis), Vibrio parahemolvticus (V.parahemolvticus), Mycobacterium tuberculosis (...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a prokaryotic acetylation site prediction method based on information fusion and deep learning, and relates to the technical field of biological information. According to the prediction method, multi-information fusion is introduced, feature coding is carried out on protein sequences from multiple aspects of sequence information, physicochemical information and evolution information, and the difference between acetylation site and non-acetylation site protein sequences is fully reflected. The original special diagnosis space is screened and optimized by Group Lasso, important features of an acetylation site recognition model are reserved, the optimal feature subset is obtained, and the model calculation speed and quality are improved. The deep neural network generates features with discrimination through hierarchical learning of the optimal feature subset, the acetylation site and the non-acetylation site in the protein sequence are effectively distinguished, the calculation time is saved at the same time, other costs and related limitations are avoided, deep understanding of an acetylation molecular mechanism can be facilitated, and valuable reference information can be provided for experimental verification related to acetylation site recognition.

Description

technical field [0001] The invention relates to the technical field of biological information, in particular to a prokaryotic acetylation site prediction method based on information fusion and deep learning. Background technique [0002] Protein post-translational modification is a regulatory mechanism that plays an important role in normal and pathological cell physiology. At present, hundreds of protein post-translational modification types have been discovered, among which the regulation of lysine acetylation on metabolism is one of the important advances in the field of post-translational modification research in recent years. The process of covalently attaching acetyl groups to lysine residues enzymatically. That is, the acetyl group is covalently linked to a specific lysine residue by lysine acetyltransferase, and the acetyl group is removed by lysine deacetylase. [0003] Acetylation modification is extremely conserved during the evolution of life, and the regulatio...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G16B20/30G16B5/00G16B40/00G06K9/62G06N3/04G06N3/08
CPCG16B20/30G16B5/00G16B40/00G06N3/08G06N3/045G06F18/25Y02P90/30
Inventor 于彬禹昭敏陈成陈瑞欣王磊
Owner QINGDAO UNIV OF SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products