Tumor key gene identification method based on prior information and parallel binary particle swarm optimization

A technology of particle swarm optimization and prior information, which is applied in the field of tumor key gene identification based on prior information and parallel binary particle swarm optimization, can solve the problem of few samples of gene expression profile data, no corresponding reports on tumor gene identification, gene expression Solve the problems of high noise and high variation data analysis of spectral data, so as to improve the effect of tumor identification

Inactive Publication Date: 2017-03-29
JIANGSU UNIV
View PDF0 Cites 7 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Second, gene expression profile data has fewer samples, which contradicts the huge number of genes that constitute an imbalance
Third, gene expression profile data inherently have data analysis

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Tumor key gene identification method based on prior information and parallel binary particle swarm optimization
  • Tumor key gene identification method based on prior information and parallel binary particle swarm optimization
  • Tumor key gene identification method based on prior information and parallel binary particle swarm optimization

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0052] A tumor key gene identification method based on prior information and parallel binary particle swarm optimization, including K-means gene clustering based on particle swarm optimization (PSO), and using prior information and parallel binary particle swarm optimization (Binary Particle Swarm Optimization, BPSO) carries out the step of key gene identification, and the present invention specifically comprises the following steps:

[0053] Step 1. Preprocessing of tumor gene expression profile data, including normalization and preliminary dimensionality reduction of tumor gene expression profile data sets, and simultaneously dividing tumor gene expression profile data sets into training sets and test sets;

[0054] Step 2: On the training set, use the improved Elbow method to determine the optimal gene clustering number K through the self-defined criterion function;

[0055] Step 3: Use the PSO algorithm to select K optimal cluster centers, and use the K-means method on the...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a tumor key gene identification method based on prior information and a parallel binary particle swarm optimization. The tumor key gene identification method comprises the steps of performing preprocessing on tumor gene expression profile data, on a training set, determining an optimal gene cluster number K through a user-defined criterion function by an improved Elbow method; preferably selecting K optimal cluster centers by the particle swarm optimization (PSO), and clustering the tumor genes into K classes on the training set by a K-mean value method; on the training set, obtaining gene to class sensitivity (GCS) information and gene regulation (GR) information separately; and taking the obtained K gene clusters as a searching spacing, by combining with the obtained two kinds of prior information, identifying the key tumor genes by adopting the parallel binary particle swarm optimization (BPSO). Compared with the existing tumor key gene identification method, the probability of losing key information genes related to classes of tumors is lowered by consideration of the two kinds of prior information in the tumor key gene identification method disclosed by the invention, so that subsequent tumor identification can be improved.

Description

technical field [0001] The invention belongs to the application field of computer analysis technology of tumor gene expression spectrum data, and specifically relates to a tumor key gene identification method based on prior information and parallel binary particle swarm algorithm. Background technique [0002] In life science research, while DNA microarray technology brings unprecedented opportunities for biological and medical research, the complex gene expression profile data it generates poses great challenges to existing data analysis and processing methods. First, gene expression profile data has very high dimensions (genes), and there are very complex relationships between these gene dimensions. Second, gene expression profile data have fewer samples, which contradicts the huge number of genes that constitute an imbalance. Third, gene expression profile data inherently have data analysis difficulties such as high noise and high variation. Fourth, a large amount of us...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F19/18G06F19/24
CPCG16B20/00G16B40/00
Inventor 韩飞杨春凌青华崔宝祥宋余庆朱玉全周从华
Owner JIANGSU UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products