Supercharge Your Innovation With Domain-Expert AI Agents!

Tandem mass spectrometry data precursor ion detection model training method and precursor ion detection method

A tandem mass spectrometry and detection model technology, applied in the field of bioinformatics, can solve the problems of cumbersome use and difficult to meet real-time data analysis, and achieve the effect of high recall rate and fast detection speed

Active Publication Date: 2016-09-28
INST OF COMPUTING TECH CHINESE ACAD OF SCI
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the current algorithms such as Hardklor, pParse, and MaxQuant all need to manually select the relevant parameters of each feature, and compare repeatedly on multiple data sets, and gradually adjust the value of each parameter to achieve a certain recall rate, so it is very cumbersome to use.
In terms of detection speed, even with the fastest Hardklor algorithm, it takes more than 15 minutes to export the precursor ions of 9000 spectra, which is difficult to meet the requirements of real-time data analysis
Moreover, the recall rate of the existing precursor ion detection algorithm also needs to be further improved

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Tandem mass spectrometry data precursor ion detection model training method and precursor ion detection method
  • Tandem mass spectrometry data precursor ion detection model training method and precursor ion detection method
  • Tandem mass spectrometry data precursor ion detection model training method and precursor ion detection method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0038] According to an embodiment of the present invention, a method for training a parent ion detection model of tandem mass spectrometry data is provided. In this embodiment, the multivariate adaptive regression spline (MARSpline: Multivariate Adaptive Regression Spline, referred to herein as MARS) classification model is used as the basic model, and the training is carried out based on the 11-dimensional feature vector of the secondary spectrum-precursor ion combination, and the series Mass spectrometry precursor ion detection model, and then realize fast and sensitive precursor ion detection.

[0039] figure 1 The method for training the tandem mass spectrometry precursor ion detection model of this embodiment is shown. Include the following steps:

[0040] Step 1: Obtain the parent ion annotation spectrum dataset. The spectral data set contains a large number of secondary spectra and primary spectra, and the precursor ion of each secondary spectrum has been marked, tha...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a method for training a tandem mass spectrometry parent ion detection model, comprising the following steps: 1) obtaining a known spectrum data set of the parent ion, and for each secondary spectrum, determining the candidate precursor ion of the secondary spectrum ; 2) Extract the eigenvector of each secondary spectrogram-candidate precursor ion combination, and carry out corresponding assignment according to whether the secondary spectrogram and candidate precursor ion are paired correctly; Wherein, the element of feature vector comprises: isotopic peak cluster is similar degree, the spectral peak intensity ratio in the fragmentation window, the chromatographic similarity and the virtual chromatographic similarity; 3) the eigenvectors of all secondary spectra-candidate precursor ion combinations are used as input, and the secondary spectra and candidate precursor ions are paired The correct or incorrect assignment is used as an output to train the MARS model and obtain a tandem mass spectrometry precursor ion detection model. The invention also provides a corresponding parent ion detection method. The invention can improve the recall rate of precursor ions and improve the detection speed of precursor ions.

Description

technical field [0001] The invention relates to the technical field of bioinformatics, in particular, the invention relates to a method for training a parent ion detection model of tandem mass spectrometry data in proteomics and a method for detecting precursor ions. Background technique [0002] The shotgun method is one of the important methods for identifying proteins in bioinformatics. It cuts proteins in biological samples into peptides, and then sends them to the tandem mass spectrometer to obtain the corresponding mass spectrometry data sets, and then searches through the tandem mass spectrometry database. Algorithms, such as SEQUEST, Mascot, pFind, etc., can identify peptides from tandem mass spectrometry data, and further infer peptides to proteins to obtain protein information in the sample. [0003] The generation of peptide biological samples to mass spectrometry data is divided into two stages: in the first stage, peptides enter the mass spectrometer in batches,...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G01N30/88G01N30/86G06F19/00
Inventor 邬龙曾文锋袁作飞刘超孟佳明贺思敏
Owner INST OF COMPUTING TECH CHINESE ACAD OF SCI
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More