Prediction algorithm for recognizing tyrosine posttranslational modification sites

A post-translation modification and prediction algorithm technology, applied in computing, special data processing applications, instruments, etc., can solve problems such as few training samples, unsatisfactory prediction performance, and long time consumption, and achieve the effect of improving prediction ability.

Inactive Publication Date: 2017-12-12
NANCHANG UNIV
View PDF5 Cites 15 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] There are a variety of experimental methods to identify the post-translational modification sites of tyrosine, but these experimental techniques have low identification efficiency, time-consuming and high cost
The methods and tools for predicting tyrosine post-translational modification sites using bioinformatics methods, such as: GPS-NO2 platform for predicting tyrosine nitration sites constructed based on sequence information by Li et al.; Huang et al. , physicochemical properties and autocorrelation coefficient extraction features to identify the predictor of tyrosine sulfuration sites; Xue et al. based on GPS-based tools for hierarchically predicting kinase-specific phosphorylation; and Gao et al. based on amino acid sequence similarity The Musite tool established by features such as disorder scoring and amino acid frequency predicts kinase-specific phosphorylation sites; there are still the following defects: it can only predict tyrosine nitration or sulfuration or phosphorylation, but cannot predict all three at the same time Modification; the training samples collected when building the model are relatively small; the features are not optimized and screened, and the prediction performance is not ideal; except for GPS and Musite, the corresponding prediction software has not been developed for other methods

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Prediction algorithm for recognizing tyrosine posttranslational modification sites
  • Prediction algorithm for recognizing tyrosine posttranslational modification sites

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0061] a kind of like figure 1 The specific steps of the prediction algorithm for identifying tyrosine post-translational modification sites shown are as follows:

[0062] 1) Data Collection

[0063] Collect data on tyrosine nitration, sulfuration and phosphorylation post-translational modifications from protein databases and related literature in recent years. The protein database is at least one of the PhosphoSitPlus database, UniProtKB / Swiss-Prot database, SYSPTM database and dbPTM database species, the PhosphoSitPlus database only collects data that can undergo tyrosine phosphorylation with relevant kinase annotations;

[0064] 2) Data processing

[0065] Use the CD-HIT program to remove the homology collected from several different protein databases, that is, highly homologous protein sequences with the same or similarity greater than 30%, to obtain non-redundant tyrosine nitrosylation, sulfuration and phosphorylation modifications The positive sample data set and the ...

Embodiment 2

[0092] The prediction software platform TyrPred was applied to predict the tyrosine nitration site and tyrosine sulfuration site of the protein named "B2RSH2".

[0093] The prediction software is TyrPred, a prediction software platform developed by using MATLAB software and C# programming language to build the optimal model based on SVM. The prediction software platform TyrPred, after the user submits at least one unknown protein sequence in FASTA format and selects the type of post-translational modification to be predicted, will efficiently return the prediction information of potential tyrosine post-translational modification sites, and realize simultaneous analysis of the complete protein. High-throughput prediction of tyrosine nitration, sulfuration and phosphorylation sites, prediction information includes protein name, modification site position, flanking residues of modification site and SVM probability value.

[0094] To predict the nitration site of the protein seque...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a prediction algorithm for recognizing tyrosine posttranslational modification sites. The algorithm comprises the steps of data collection, data processing, feature coding, feature optimization and model training and evaluation. The invention furthermore discloses application of the prediction algorithm. According to the algorithm, features of the tyrosine posttranslational modification sites are extracted comprehensively from the perspectives of protein sequence information, evolutional information and physical and chemical properties, Elastic Net is used as an optimization means to automatically select variables to screen multidimensional features, redundant information is removed, a prediction model of tyrosine nitration, sulfuration and phosphorylation sites is constructed in combination with an SVM, the prediction capability of the prediction model is improved, and the prediction quality of the tyrosine posttranslational modification sites is remarkably improved. Through a developed prediction software platform TyrPred, predictive analysis of nitration modification sites, sulfuration modification sites and phosphorylation modification sites of tyrosine on intact protein is realized, and a convenient, economical and rapid research tool and important reference are provided for research of tyrosine posttranslational modification.

Description

technical field [0001] The present invention relates to digital computing or data processing equipment or data processing methods especially for specific applications, and in particular to a prediction algorithm for identifying tyrosine post-translational modification sites. Background technique [0002] Tyrosine post-translational modifications include nitrosation, sulfuration and phosphorylation. Tyrosine nitration is mainly due to the interaction between reactive oxygen species and reactive nitrogen species in tissue cells. A large amount of reactive oxygen species and reactive nitrogen species can directly damage proteins, nucleic acids and lipid macromolecules. Studies have shown that a variety of human diseases such as atherosclerosis, Parkinson's disease, chronic renal failure, etc. are related to tyrosine nitration. Tyrosine sulfuration is mainly tyrosyl protein sulfotransferase catalyzing the sulfuration reaction of tyrosine residues in proteins. This enzyme can ca...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F19/18G06F19/20G06F19/24
CPCG16B20/00G16B25/00G16B40/00
Inventor 施绍萍曹曼陈国东
Owner NANCHANG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products