Tandem mass spectrometric identification method for protein based on matching between characteristic information of target database and decoy database

A feature information, two-stage mass spectrometry technology, applied in the field of protein two-stage mass spectrometry identification, can solve the problems of neglecting matching characteristics, affecting the efficiency and accuracy of database search, and errors of biological mass spectrometry instruments

Active Publication Date: 2016-04-27
YUNNAN MINZU UNIV
View PDF7 Cites 14 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

③ The error of the biological mass spectrometer itself: different error precision can greatly affect the efficiency and accuracy of database search
However, based on the premise of positive and negative libraries, exploring the matching characteristics of different types of fragment ions in different mass error ranges and intensity intervals is ignored in the existing protein secondary mass spectrometry identification algorithms.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Tandem mass spectrometric identification method for protein based on matching between characteristic information of target database and decoy database
  • Tandem mass spectrometric identification method for protein based on matching between characteristic information of target database and decoy database
  • Tandem mass spectrometric identification method for protein based on matching between characteristic information of target database and decoy database

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0101] The present invention will be described in further detail below in conjunction with the embodiments and accompanying drawings.

[0102] see Figure 4 As shown, in this embodiment, a protein secondary mass spectrometry identification method based on the matching of positive and negative library feature information includes the following steps:

[0103] (1) Download the protein reference sequence database, and reverse the protein reference sequence to obtain the protein sequence database including the positive library and the reverse library;

[0104] (2) virtual enzymolysis of the above-mentioned protein database sequence, and establish a peptide quality database and a peptide quality database index according to the mass number of the peptide after enzymolysis;

[0105] (3) Remove isotope peaks from the experimental spectrum to be analyzed, and select effective peaks reasonably to improve the signal-to-noise ratio of the experimental spectrum itself;

[0106] (4) Accor...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a tandem mass spectrometric identification method for protein based on matching between characteristic information of a target database and a decoy database. The method mainly carries out statistics of matching between experimental peaks of different types and theoretical peaks of the target database and the decoy database in different error ranges and intensity intervals, then extracts novel characteristic information of tandem spectra and carries out mathematical quantification, and finally, fusing the quantified novel characteristic information into a protein tandem mass spectrometric identification algorithm scoring model. To verify reliability of PepFind algorithm, the PepFind algorithm is tested by using data sets generated by different mass spectrometric platforms; and identification results of the PepFind algorithm are compared with identification results obtained by using widely-applied commercial and related open-source tandem mass spectrometric identification software under the condition that FDR is 1%, and comparison results show that the PepFind algorithm has better identification number and sensitivity to experimental spectra. The tandem mass spectrometric identification method for protein based on matching between characteristic information of the target database and the decoy database can obviously improve the effective mass spectrometric number and the peptide fragment number of protein.

Description

technical field [0001] The invention relates to the field of protein secondary mass spectrometry identification, in particular to a protein secondary mass spectrometric identification method based on the matching of positive and negative library feature information. Background technique [0002] Tandem mass spectrometry (LC-MS / MS) is widely used in the identification and quantification of complex protein mixtures. In a traditional LC-MS / MS experiment, the peptide mixture obtained after enzymatic hydrolysis is separated by strong cation exchange chromatography and reversed-phase chromatography, and the obtained peptides flow into the biological mass spectrometer in sequence according to their own hydrophobicity. Using electrospray technology or Laser desorption technology ionizes and fragments the peptides entering the mass spectrometer, and simultaneously measures the mass information of the corresponding fragment ions, then selects the first few fragment ions with the highe...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G01N30/72
CPCG01N30/72
Inventor 陈晓舟肖传乐李华梅陈君华
Owner YUNNAN MINZU UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products