Eureka AIR delivers breakthrough ideas for toughest innovation challenges, trusted by R&D personnel around the world.

Gene data processing method and gene data processing device

A genetic data and processing method technology, applied in the direction of electronic digital data processing, special data processing applications, instruments, etc., can solve the problem of large impact on classification accuracy, low accuracy of feature gene selection, and low coverage of sample genetic data selection, etc. problems, to reduce impact and improve accuracy

Inactive Publication Date: 2015-03-11
SHENZHEN INST OF ADVANCED TECH
View PDF0 Cites 11 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In the current commonly used gene data processing methods, the coverage rate of sample gene data selection is low, the accuracy of feature gene selection is low, and the selection of test samples and training samples has a greater impact on classification accuracy.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Gene data processing method and gene data processing device
  • Gene data processing method and gene data processing device
  • Gene data processing method and gene data processing device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0023] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0024] see figure 1 , is a schematic flowchart of the first embodiment of the genetic data processing method provided by the present invention. The genetic data processing method described in this embodiment includes steps:

[0025] S101. Receive genetic data of a sample characteristic type of a reference population, and divide the genetic data into test data and training data based on a cross-validation method.

[0026] In some feasible embodiments, the fea...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The embodiment of the invention discloses a gene data processing method and a gene data processing device. The method comprises the following steps: receiving gene data of a specified feature type of a reference population; preprocessing the gene data to obtain standardized gene data; performing feature gene selection on the standardized gene data by a LASSO method to obtain feature gene data; upon a cross validation method, dividing a sample set of the feature gene data into test samples and training samples; injecting the training samples into a classifier to obtain a trained classifier; injecting the test samples into the trained classifier; performing feature classification on the test samples, and performing statistics on the classification accuracy of the classifier. According to the gene data processing method and the gene data processing device provided by the embodiment of the invention, the accuracy of the feature gene selection can be improved, and the influence of the selection of the test samples and the training samples on the classification accuracy is lowered.

Description

technical field [0001] The invention relates to the technical field of gene data processing, in particular to a gene data processing method and device. Background technique [0002] DNA microarray (gene chip) technology is a major technological breakthrough in the field of molecular biology, and is widely used in various fields of biology and medical research, such as large-scale DNA sequencing, disease diagnosis, gene regulation and interaction relationship mining, etc. . Least Absolute Shrinkage and Selection Operator (LASSO) is a feature selection method based on a paradigm, which is used to describe a class of constrained optimization problems. The basic idea is to minimize the sum of squared residuals under the constraint that the sum of the absolute values ​​of the regression coefficients is less than a constant, so that some regression coefficients that are strictly equal to 0 can be generated and an interpretable model can be obtained. [0003] The expression level...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F19/24
Inventor 周丰丰赵苗苗
Owner SHENZHEN INST OF ADVANCED TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Eureka Blog
Learn More
PatSnap group products