Classification method of differential proteomics

A technology of proteomics and classification methods, applied in special data processing applications, instruments, electrical digital data processing, etc., to achieve high classification accuracy and robustness

A technology of proteomics and classification methods, applied in special data processing applications, instruments, electrical digital data processing, etc., to achieve high classification accuracy and robustness

CN101901345AInactive Publication Date: 2010-12-01FUDAN UNIV

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Classification method of differential proteomics
  • Classification method of differential proteomics
  • Classification method of differential proteomics

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0031] 1) Using two sets of differential proteomics public datasets widely used in the world as research materials

[0032] The first set of samples came from the National Cancer Institute (NCI). The data were divided into ovarian cancer samples and normal samples. The data were generated by the SELDI-TOF-MS analysis method, including 162 ovarian cancer samples and 91 normal samples. Dataset address: http: / / home.ccr.cancer.gov / ncifdaproteomics / ppatterns.asp. The second group of samples comes from the Keck laboratory of Yale University in the United States, and is divided into 93 ovarian cancer samples and 77 normal samples, which are produced by Micromass MALDI-L / R. In the present invention, the linear mode (Linear Mode) m / Z value is selected at Data sets ranging from 3450 to 28000Da were analyzed. Dataset address: http: / / bioinformatics.med.yale.edu / MSDATA2. In order to observe the effect of classifying the noise data (control) that is randomly grouped into samples, the clas...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention belongs to the field of proteomics classification, relating to a classification method of differential proteomics. The method comprises: selecting characteristics by univariate statistics analysis, sequential feature selection and a genetic algorithm; extracting characteristics by main ingredient analysis and a partial least squares method; connecting and integrating with linear discriminant analysis, a k-nearest neighbor classifier, a support vector machine, a decision tree, a naive Bayes classifier and an artificial neural network in series to obtain serial integrated classifiers which are connected in parallel and combined; endowing each base classifier with a weighting coefficient according to classification accuracy rate; and taking a fuzzy attribute value as a classification result output mode to obtain the classification judgment result of a target sample. The method has higher classification accuracy rate and robustness on classifying and judging abnormal and normal samples in the differential proteomics, and is suitable for classifying and analyzing the data of differential proteomics.

Description

Technical field: [0001] The invention belongs to the field of biotechnology and proteomics classification, and relates to a differential proteomics classification method. Background technique: [0002] An important direction in differential proteome research is to mine disease proteome data for bioinformatics and establish classification models to classify, judge and predict clinical samples. With the development of high-throughput research technologies, the scale of omics data output accumulation is growing rapidly. Therefore, it has become a general trend to use computer methods to automatically classify and judge clinical samples. The existing technology mainly includes the following steps in the computer classification and judgment of differential proteomes: after obtaining the original data, first perform data preprocessing, then perform feature space optimization on the training data set and select or extract feature variables, and then obtain the feature The data is...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
01 Dec 2010
Publication
CN101901345A
IPC
G06K9/62; G06F19/00
Inventors
贺福初; 罗凯旋