Genetic and environmental correlation-based colorectal cancer data model analysis method

A colorectal cancer and data model technology, applied in the field of bioinformatics, can solve the problem of not improving the accuracy of predicting colorectal cancer, and achieve the effect of good robustness and reliability, and improved accuracy

Active Publication Date: 2017-08-18
SOUTHWEST UNIVERSITY
View PDF5 Cites 8 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the above research methods have certain limitations and have not improved the accuracy of predicting colorectal cancer

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Genetic and environmental correlation-based colorectal cancer data model analysis method
  • Genetic and environmental correlation-based colorectal cancer data model analysis method
  • Genetic and environmental correlation-based colorectal cancer data model analysis method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0068] Embodiments of the present invention will be described in detail below in conjunction with the accompanying drawings. It should be clear that the described embodiments are only some of the embodiments of the present invention, but not all of them. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0069] Those skilled in the art should know that the following specific embodiments or specific implementation methods are a series of optimized configurations listed by the present invention to further explain the specific content of the invention, and these configurations can be combined with each other or used in association with each other, unless it is clearly stated in the present invention that some or a specific embodiment or implementation cannot be associated with or used in conjunction with other embodiments or ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a genetic and environmental correlation-based colorectal cancer data model analysis method. The method comprises the steps of receiving colorectal cancer (CRC) data of a specified feature type of a reference population; preprocessing the data to obtain standardized data; based on the standardized data, classifying the data; performing feature selection on each subclass by using a sparse principal component analysis method, an information entropy method and/or a Relief method; obtaining an intersection of three methods by using a Venn diagram, and obtaining features with remarkable difference by using a U test; and dividing a feature gene data sample set into test samples and training samples, obtaining a trained classifier according to the training samples, injecting the test samples into the trained classifier, performing feature classification on the test samples, and performing statistics on classification accuracy of the classifier. According to the method, the accuracy of extracting carcinogens can be improved and the classification accuracy can be improved.

Description

technical field [0001] The present invention relates to the technical field of bioinformatics, mainly relates to methods of biological data analysis and biological data mining, and specifically relates to the establishment of a robust colorectal cancer data model based on large genetic and environment-related colorectal cancer data, and based on the data model data analysis and mining. Background technique [0002] Colorectal cancer, including colon and rectal cancer, is a leading cause of cancer-related morbidity and mortality worldwide. In 2002, there were approximately 1,023,152 newly diagnosed cases of colorectal cancer, and 528,978 patients died of colorectal cancer. Colorectal cancer ranks fourth in the incidence and death spectrum of malignant tumors in men, and ranks fourth in malignant tumors in women. It ranks third in the incidence spectrum and fifth in the death spectrum. That is to say, one person is newly diagnosed with colorectal cancer every half minute, an...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F19/00
CPCG16H50/70
Inventor 章乐郑纯秋李甜周紫垣陈霸东邢磊李婷婷
Owner SOUTHWEST UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products