Cancer disease gene characteristic selection method based on historical data

A technology of genetic characteristics and historical data, applied in the fields of instruments, character and pattern recognition, biostatistics, etc., can solve the problems of poor practicability, high time complexity, and many redundant features, so as to improve the accuracy of prediction, The effect of reducing data dimensionality

Active Publication Date: 2019-07-30
ANHUI UNIVERSITY
View PDF2 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the existing methods often have problems such as ineffective dimension reduction, too many redundant features, high time complexity, and poor practicability.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Cancer disease gene characteristic selection method based on historical data
  • Cancer disease gene characteristic selection method based on historical data
  • Cancer disease gene characteristic selection method based on historical data

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0038] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with specific embodiments and with reference to the accompanying drawings.

[0039] Such as figure 1 As shown, this embodiment provides a method for selecting gene features of cancer diseases based on historical data, including the following steps:

[0040] Step A: divide the cancer disease gene data into a training data set and a test data set; specifically, divide the cancer disease gene data into ten parts, 7 parts are used as training data sets, and 3 parts are used as test data sets.

[0041] Step B: Calculate the total average error rate after all the features on the training data set are selected by using the 50-fold crossover, specifically: divide the training data set into five parts on average, select one as the test set, and the other four as the training set get error rate where CE ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a cancer disease gene characteristic selection method based on historical data. The cancer disease gene characteristic selection method based on historical data includes the following steps: A, dividing cancer disease gene data into a training set and a test set; b, calculating a total average error rate after all characteristics on the training set are selected; C, generating an initial population, and constructing a fitness function; D, recording all characteristic selection schemes into a characteristic tree, adjusting the distribution of the characteristic selectionschemes, taking the characteristic selection scheme with the minimum fitness value as the optimal characteristic selection scheme, and returning a result to a genetic operator and a guide search operator; E, guiding the evolutionary direction of the characteristic population; and F, judging a termination condition, if the termination condition is not met, repeating the steps D-F, and if the termination condition is met, outputting an optimal solution. The cancer disease gene characteristic selection method based on historical data has the advantages that the data dimension can be effectivelyreduced; the prediction accuracy is improved; the related genes of diseases such as cancer are screened through the characteristic tree in combination with the genetic algorithm; and assistance is provided for diagnosis and treatment.

Description

technical field [0001] The invention relates to the technical field of disease-causing gene screening, in particular to a method for selecting gene features of cancer diseases based on historical data. Background technique [0002] As the most common malignant tumor in human beings, cancer seriously affects people's physical and mental health, and has attracted great attention from experts and scholars in different fields. Because cancer patients will generate a large amount of clinical data in the process of examination, treatment and medication. These data are of great significance for predicting the occurrence of malignant tumors and the development of diseases. However, these data often have the characteristics of high dimensionality, diversity, data chaos and small samples. [0003] With the rapid development of computer technology, people think of using computers to process these complex data. Through the establishment of relevant machine learning prediction models,...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G16B40/20G06K9/62
CPCG16B40/20G06F18/2111Y02A90/10
Inventor 邱剑锋郭能张兴义苏延森
Owner ANHUI UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products