A classifier model generation method for gene microarray data

A gene microarray and model generation technology, which is applied in the fields of electrical digital data processing, special data processing applications, instruments, etc., can solve problems such as high-noise cancer prediction, and achieve the effect of improving prediction efficiency and accuracy

Active Publication Date: 2017-05-03
西安电子科技大学重庆集成电路创新研究院
View PDF1 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0008] The purpose of the present invention is to overcome the technical difficulties of accurate and efficient cancer prediction caused by the high dimensionality, high noise, high correlation and small samples of gene microarray data. Starting from mining hidden gene information and reducing the dimension of gene features, a new A cancer prediction method that combines independent component analysis and linear discriminant analysis. While reducing the dimension of gene features and removing redundant noise, it fully mines the hidden information between features and improves the accuracy and efficiency of cancer prediction.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A classifier model generation method for gene microarray data
  • A classifier model generation method for gene microarray data
  • A classifier model generation method for gene microarray data

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0020] figure 1 It is a schematic diagram of gene chip production and analysis process. figure 2 It is a flowchart of the prediction method used in the embodiment of the present invention.

[0021] refer to figure 2 , the gene microarray data classifier model generation method in the embodiment of the present invention, the specific implementation steps are as follows:

[0022] (1) Preprocess gene microarray data with filter technology:

[0023] (1a) Filtering technology integrates four implementation strategies - student test analysis, entropy analysis, Chernoff bound analysis, unbiased statistical analysis, respectively as follows:

[0024] ①Student's test analysis: The assumption of the student's test analysis is that the two samples obey the normal distribution and have the same variance. Assume is from a normal population N(μ 1 , σ 2 ) samples, is from a normal population N(μ 2 , σ 2 ) samples, the two samples are independent, and the sample mean is The sa...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to an isolated component analysis and linear discriminant analysis combined cancer forecasting method which particularly includes the following steps: (1) pre-processing gene micro-array data with the filtering technology, (2) converting the pre-processed gene micro-array data with the isolated component analysis technology to obtain an isolated component set, (3) processing the isolated component set with the linear discriminant analysis technology, and projecting the gene micro-array data into low-dimensional space with the optimal separability, and (4) training a nearest neighbor classifier through the projected gene micro-array data to generate a classifier model. According to the isolated component analysis and linear discriminant analysis combined cancer forecasting method, the gene micro-array data are filtered and processed, implicit information of the gene micro-array data is mined through isolated component analysis, and the gene micro-array data are projected to the low-dimensional space with the optimal separability through linear discriminant analysis, the cancer forecasting accuracy is improved, and the cancer forecasting time is shortened.

Description

technical field [0001] The invention relates to a method for generating a classifier model of gene microarray data, which belongs to the cross technical field of machine learning and medical diagnosis. Background technique [0002] Gene microarray, also known as gene array or gene chip, is a special glass slide with gene microarray coating. Thousands or even tens of thousands of nucleic acid probes are installed on a chip with an area of ​​only a few square centimeters, and a large number of pre-designed complementary DNA or oligonucleotides are made into a dot array on the chip to match the homologous nucleic acid in the sample. Molecules are hybridized to obtain microarray data for gene sequence and gene expression information. [0003] Studies have found that there is a great correlation between the occurrence of cancer and genetic genes, and analyzing the genetic information contained in gene microarray data provides a new method for the prediction and diagnosis of canc...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F19/00G06F19/20
Inventor 杨利英刘志敏李菲袁细国张军英黎成殷黎洋
Owner 西安电子科技大学重庆集成电路创新研究院
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products