Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A de novo peptide sequencing method and system based on hcd and etd mass spectrometry

A mass spectrometry and peptide technology, applied in the field of bioinformatics, can solve problems such as slow calculation speed, large search space, and large number of amino acids, and achieve the effects of improving performance, improving accuracy, and avoiding interference

Active Publication Date: 2016-01-06
INST OF COMPUTING TECH CHINESE ACAD OF SCI +1
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] However, the peptide de novo sequencing method is not as widely used as the database search method. There are two main reasons: (1) The fragmentation information of the mass spectrum is incomplete, and quite a few spectra cannot be obtained using the de novo sequencing method. There are too many amino acids without fragmentation information; (2) The de novo sequencing method is equivalent to searching for the correct peptide in all amino acid combinations that meet the mass error range of the parent ion, and the search space is large, so that the calculation speed is very slow
In addition, it is generally believed that a spectral peak can only match one type of ion, so searching for a path in a mass spectrum needs to consider antisymmetric constraints, and searching for an antisymmetric path in a directed acyclic graph is an NP-hard problem

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A de novo peptide sequencing method and system based on hcd and etd mass spectrometry
  • A de novo peptide sequencing method and system based on hcd and etd mass spectrometry
  • A de novo peptide sequencing method and system based on hcd and etd mass spectrometry

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0054] The technical solution of the present invention will be described in detail below in conjunction with the accompanying drawings and specific embodiments to further understand the purpose, solution and effect of the present invention, but it is not intended to limit the scope of protection of the appended claims of the present invention.

[0055] figure 1 It is a flowchart of the peptide de novo sequencing method based on HCD and ETD mass spectrograms of the present invention. As shown, the method includes:

[0056] Step 1. Corresponding the mass spectra of HCD and ETD to form a new mass spectrum, performing preprocessing on the new mass spectrum, detecting effective spectrum peaks and deleting interference spectrum peaks;

[0057]Step 2, constructing a directed acyclic graph according to the effective spectrum peak;

[0058] Step 3, searching for a path conforming to weight rules in the directed acyclic graph, and generating candidate peptides according to the path; ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present invention provides a peptide de novo sequencing method and system based on HCD and ETD mass spectrograms. The method includes: step 1, corresponding HCD and ETD mass spectrograms to form a new mass spectrogram, and pre-processing the new mass spectrogram. Processing, detecting effective spectral peaks and deleting interference spectral peaks; step 2, constructing a directed acyclic graph according to the effective spectral peaks; step 3, searching for a path that meets the weight rule in the directed acyclic graph, and according to The path generates candidate peptides; step 4, matching and scoring the candidate peptides with the new mass spectrum, sorting and outputting the candidate peptides according to the scoring results. The invention makes up for the shortcoming of incomplete spectral peaks caused by a single fragmentation type, combines the respective advantages of HCD and ETD spectral diagrams, and improves the accuracy of de novo sequencing. At the same time, preprocessing is performed before de novo sequencing to remove a large number of isotopic peaks and noise peaks to avoid interference with the de novo sequencing algorithm. And use a more discriminative scoring algorithm to improve the performance of de novo sequencing.

Description

technical field [0001] The invention relates to the field of biological information, in particular to a peptide de novo sequencing method and system based on HCD and ETD mass spectrograms. Background technique [0002] Peptide and protein identification using tandem mass spectrometry has become the core technology of current proteomics. Currently, there are two main technical routes for protein identification based on tandem mass spectrometry: database searching and de novo peptide sequencing. Due to the rapid development and improvement of protein databases, database search has become the main technology for protein identification. However, the de novo sequencing method does not rely on the existing database. According to the characteristics of the regular fragmentation of the peptide, the sequence of the peptide is directly deduced from the map, which can analyze the tandem mass spectrometry data of new species or species whose genome has not been sequenced, and has datab...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G01N27/62
Inventor 迟浩陈海丰孙瑞祥董梦秋贺思敏
Owner INST OF COMPUTING TECH CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products