Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Cancer gene expression profile data identification method based on integration of extreme learning machines

An extreme learning machine, tumor gene technology, applied in the field of computer analysis of tumor gene expression profiling data, which can solve the problems of high noise and high variation data analysis of gene expression profiling data, and few samples of gene expression profiling data.

Inactive Publication Date: 2015-03-25
JIANGSU UNIV OF SCI & TECH
View PDF0 Cites 12 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Second, gene expression profile data has fewer samples, which contradicts the huge number of genes that constitute an imbalance
Third, gene expression profile data inherently have data analysis difficulties such as high noise and high variation
However, these integrated ELMs simply integrate multiple ELMs, and the selection of member ELMs and the design of integration rules are too simple, so there is still a lot of room for improvement in their performance
So far, there is no corresponding report on the integration of ELM for gene expression profile data for tumor identification

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Cancer gene expression profile data identification method based on integration of extreme learning machines
  • Cancer gene expression profile data identification method based on integration of extreme learning machines
  • Cancer gene expression profile data identification method based on integration of extreme learning machines

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0049] A method for identifying tumor gene expression profile data based on an integrated extreme learning machine, including the selection of member ELMs and the integration steps between member ELMs, the ELM (Extreme learning machine) in the present invention is an extreme learning machine, and the present invention specifically includes the following steps :

[0050] Step 1: Preprocessing of tumor gene expression profile data sets, including gene selection and normalization of tumor expression profile data;

[0051] Step 2: Generate N sample sets according to a certain proportion through the Bagging method for the data set obtained in step 1, and then generate N training sets and verification sets according to a certain proportion of these N sample sets;

[0052] Step 3: Learn to generate N extreme learning machines on the N training sets in step 2, and select the highest L ELMs (L

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a cancer gene expression profile data identification method based on integration of extreme learning machines. The method includes the steps of selection and integration of member extreme learning machines (ELMs). The method concretely includes the steps that preprocessing is carried out on a cancer gene expression profile data set, wherein the preprocessing includes gene selection and normalization of expression profile data; N sample sets are generated through a Bagging method, and each sample set is divided into a training set and a verification set according to a certain proportion; N ELMs are generated on the N training sets in a learning mode, and L ELMs (L&1t; N) with the highest recognition rate on the corresponding verification sets are selected to form an alternative member ELM base; K member ELMs (K&1t; L) forming an integrated system are selected from L ELMs based on the particle swarm optimization algorithm; the integrated vote weight of K member ELMs is worked out by utilizing the minimum-norm least square method; an integrated ELM system is obtained, and the integrated ELM system is used for performing tumor recognition on a newly increased cancer gene expression profile sample. Through the method, the cancer gene expression profile data can be quickly and accurately recognized.

Description

technical field [0001] The invention belongs to the application field of computer analysis technology of tumor gene expression profile data, and in particular relates to a method for identifying tumor gene expression profile data based on an integrated extreme learning machine. Background technique [0002] In life science research, while DNA microarray technology brings unprecedented opportunities for biological and medical research, the complex gene expression profile data it generates poses great challenges to existing data analysis and processing methods. First, gene expression profile data has very high dimensions (genes), and there are very complex relationships between these gene dimensions. Second, gene expression profile data have fewer samples, which contradicts the huge number of genes that constitute an imbalance. Third, gene expression profile data inherently have data analysis difficulties such as high noise and high variation. Fourth, a large amount of usefu...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/66G06F19/24
CPCG16B40/00G06V30/194
Inventor 凌青华韩飞叶松林杨春崔宝祥
Owner JIANGSU UNIV OF SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products