Complete glycopeptide identifying method and system

A glycopeptide and complete technology, applied in the fields of glycoproteomics, mass spectrometry, and bioinformatics, can solve the problems of indistinguishability and poor reliability, and achieve the effects of accurate false discovery rate, improved reliability, and low computational complexity

Active Publication Date: 2016-10-12
INST OF COMPUTING TECH CHINESE ACAD OF SCI
View PDF5 Cites 11 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] In the above two methods, only the complete glycopeptide spectrum is used to identify the sugar chain part or peptide part of the glycopeptide, while the other part is directly inferred by mass, so the reliability is poor
For example, after using the complete glycopeptide spectrum to identify the sugar chain part of the glycopeptide, the mass of the peptide is estimated to be 999.5633, and there are many masses that can be matched within this error range, such as the two peptides of LTEAKPVDK and DVPKAETLK The quality is exactly the same, and both can be matched to 999.5633. If only the quality is used to match, the two cannot be distinguished at all.
Similarly, the method of inferring the composition of sugar chains only based on the quality of sugar chains also has the above problems

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Complete glycopeptide identifying method and system
  • Complete glycopeptide identifying method and system
  • Complete glycopeptide identifying method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0059] For ease of understanding, first give the meaning of some professional concepts involved in this article:

[0060] Glycosidic bond: a chemical bond formed by connecting monosaccharides and monosaccharides;

[0061] The reducing end of the sugar chain: refers to the end containing the free radical -CHO. In glycoproteomics, it refers to the end of the sugar chain that is connected to the peptide;

[0062] The reducing end ion of the sugar chain: after the glycosidic bond of the sugar chain is broken, the fragment ion at the reducing end is sometimes called the Y ion of the sugar chain

[0063] Y ion of glycopeptide: the reducing end ion of the sugar chain plus the peptide sequence of the complete glycopeptide;

[0064] B / y ion of glycopeptide: b / y ion of peptide part of glycopeptide, b / y ion does not contain any sugar chain structure;

[0065] Pentasaccharide core: fixed five monosaccharide structures generally present at the reducing end of N sugar chain, namely 2×HexNAc+3×Hex.

[0...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a complete glycopeptide identifying method. The method includes the steps the a carbohydrate chain structure database is traversed for any actual measurement tandem mass spectrum to be identified, for each carbohydrate chain structure, the mass of glycopeptide Y ions possibly obtained in fragment tests is concluded according to the mass of parent ions in the current series connection spectrogram, then the number of spectrum peaks matched with the current second-level spectrum is calculated, and the number of the matched spectrum peaks is used as a coarse marking result of matching between the glycopeptide Y ions and the current second-level spectrum; carbohydrate chain structures with the top K coarse marking scores serve as candidate carbohydrate chain structures; for the current series connection spectrogram, all the candidate carbohydrate chain structures are traversed, spectrum-spectrum matching between actual measurement spectrums and theoretical spectrums of peptide fragments of each candidate carbohydrate chain structure is marked, spectrum-spectrum matching between actual measurement spectrums and theoretical spectrums of the carbohydrate chain structures is marked, and then the glycopeptide structure identifying result is obtained. Reliability of complete large-scale glycopeptide identification can be improved, and calculation complexity is low.

Description

Technical field [0001] The present invention relates to the technical field of bioinformatics. Specifically, the present invention relates to the technical field of glycoproteomics and mass spectrometry. Background technique [0002] Mass spectrometry is the main method to identify site-specific protein glycosylation modifications on a large scale. In mass spectrometry, the intact glycopeptide is usually identified by tandem mass spectrometry of the intact glycopeptide, and then the glycosylation modification on the protein is inferred. At present, for the identification of large-scale complete glycopeptide spectra, there are two identification strategies, namely ①The identification of sugar chains based on complete glycopeptide tandem spectra represented by GRIP, ArMone 2.0 and other systems, and then matching peptides based on peptide quality The method, as well as the method of identifying peptides based on intact glycopeptide spectrograms represented by Byonic and other syst...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G01N27/62
CPCG01N27/62
Inventor 曾文锋刘铭琪张晓今吴建强张扬孙瑞祥杨芃原贺思敏
Owner INST OF COMPUTING TECH CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products