Plasma sample cancer early screening method based on ensemble learning

An integrated learning and cancer technology, applied in the field of early cancer screening of plasma samples based on integrated learning, can solve the problems of high cost, poor tolerance of detection methods, inability to effectively detect early cancer, etc., to achieve improved stability, stable prediction accuracy, The effect of optimizing model performance

Inactive Publication Date: 2021-11-05
哈尔滨智吾康科技有限公司
View PDF3 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

It can be seen that the current methods for diagnosing cancer have great limitations, and there are still def

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Plasma sample cancer early screening method based on ensemble learning
  • Plasma sample cancer early screening method based on ensemble learning
  • Plasma sample cancer early screening method based on ensemble learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0026] Source of original data: Circulating Tumor DNA (ctDNA) is one of the important detection objects of liquid biopsy. ctDNA methylation levels and associated DNA mutations (SNVs, INDELS, and copy number mutations) are important sources of liquid biopsy characteristics. Due to the low content of ctDNA released by early malignant tumors, improving the signal-to-noise ratio of DNA mutation detection and methylation monitoring is a key problem to be solved. In cell-free DNA (Cell-Free DNA, cfDNA), in addition to ctDNA, there are also a large number of DNA fragments from the hematopoietic system, which is the main source of noise data affecting ctDNA analysis.

[0027] 1. Data cleaning

[0028] In the present invention, the ctDNA in the plasma is used as a marker, the methylation level is used as a detection measurement value, and considering the influence of sample noise, the bias of sample distribution and other factors, a method for cleaning tumor marker data based on noise...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a plasma sample cancer early screening method based on ensemble learning, and belongs to the field of cancer early screening. The cancer early screening method comprises the following steps: 1, taking data obtained by performing characteristic value extraction on ctDNA mutation and methylation analysis data in plasma as a training set and a verification set, and then respectively importing the training set into a gradient boosting tree model and a classification model of a support vector machine; 2, fusing the gradient boosting tree model trained in the step 1 and the classification model of the support vector machine trained in the step 1 to obtain an integrated classification model; 3, importing the verification set in the step 1 into the integrated classification model in the step 3, and obtaining a classification result through a voting mechanism, namely, a cancer early screening result. The performance of the model is optimized under different training conditions, the adaptability to sample size, sample feature distribution and the like during model training is enhanced, the stability of the model is effectively improved, the reliability in practical application is ensured, and stable prediction precision is generated.

Description

technical field [0001] The invention belongs to the field of early cancer screening, in particular to a plasma sample early cancer screening method based on integrated learning. Background technique [0002] Early diagnosis of cancer is one of the most effective means to improve the survival rate of cancer patients. At present, early screening and clinical diagnosis of cancer mainly rely on CT, PET-CT, MRI and other imaging methods, as well as ultrasound, endoscopy, cytology detection, invasive tissue sample collection and pathological detection. The detection is mostly based on one or a combination of the above detection methods, such as CT and X-ray technology for high-risk groups of lung cancer. However, due to the small size of tumors in the early stage of cancer, CT identification requires the scanning area of ​​the diseased tissue to be larger than 1 cm, so CT, X-ray, and ultrasound examinations are difficult to identify early tumors. In addition, the accumulation of...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G16H50/20G16B40/20G06K9/62
CPCG16H50/20G16B40/20G06F18/2148G06F18/2411G06F18/24323G06F18/259G06F18/254C12Q1/6827
Inventor 逄龙赵玲玲
Owner 哈尔滨智吾康科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products