Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Method and system for constructing disease risk prediction model based on sequencing and machine learning

A prediction model and disease risk technology, applied in the field of biomedicine, can solve the problems of early and late lesions without good molecular markers, and achieve the effect of high diagnostic sensitivity and specificity, and high prediction accuracy.

Pending Publication Date: 2021-06-18
QINGDAO MEDINTELL BIOMEDICAL CO LTD
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

At present, the clinical distinction between ulcer and cancer can be basically confirmed by colonoscopy combined with biopsy and pathological diagnosis. However, there are no good molecular markers for the identification of early and advanced lesions.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and system for constructing disease risk prediction model based on sequencing and machine learning
  • Method and system for constructing disease risk prediction model based on sequencing and machine learning
  • Method and system for constructing disease risk prediction model based on sequencing and machine learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0157] Embodiment 1 Construction of disease prediction model

[0158] 1. Get data

[0159] Collect the sequencing data for the sample.

[0160] 2. Processing of sequencing data

[0161] Fastp software was used for joint processing and quality control to obtain cleandata.

[0162] 3. Sequence Alignment

[0163] Use ICGC software to align cleandata to the human reference genome (version GRCh38.d1.vd1) to obtain a bam file.

[0164] 4. Construct expression matrix

[0165] Use htseq software, combined with annotation files, compare and compare bam files to quantify gene expression, and construct M*N gene expression matrix according to gene ID and expression of multiple samples, the i-th in the gene expression matrix The value in the jth column of the row indicates the expression count value of the jth sample corresponding to the ith gene, where 1≤i≤M, 1≤j≤N; M indicates the number of detected genes, and N indicates the number of analyzed samples. Save the expression matrix a...

Embodiment 2

[0193] Embodiment 2 Construction of colorectal disease diagnosis model

[0194] 1. Data source and acquisition

[0195] All the data for building the risk model of colorectal cancer were downloaded from the TCGA and NCBI-SRA databases, the expression files of colorectal cancer and adjacent cancers were downloaded from the TCGA database, and the raw data of intestinal polyps were downloaded from the NCBI-SRA database. A total of 443 samples of colorectal cancer cases, 31 samples of intestinal polyps, and 72 normal samples were selected from the retrieval, and a total of 546 sample data were used for further screening and quality control.

[0196] 2. Raw data processing

[0197] Use fastp software for joint processing and quality control to obtain cleandata. The steps include:

[0198] a. Joint treatment

[0199] Use the double-end sequence automatic detection mode of fastp software to process the adapter;

[0200] b. Data pruning and quality control

[0201] The minimum th...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a method and a system for constructing a disease risk prediction model based on sequencing and machine learning. The system is embedded with a disease risk prediction model developed by utilizing machine learning, and the disease risk of a subject is judged according to the risk prediction model by receiving sequencing information from the subject.

Description

technical field [0001] The invention belongs to the field of biomedicine, and relates to a method and system for constructing a disease risk prediction model based on sequencing and machine learning. Background technique [0002] With the development of sequencing technology and the reduction of cost, in the field of human health, human genome sequencing will become the mainstream trend in the future, and precision medicine will be the ultimate goal of sequencing. Therefore, how to accurately discover the sequencing results has become a necessary means to realize precision medicine. [0003] Colorectal cancer (CRC) is the third most common cancer worldwide and the fourth most common cause of cancer-related death. Its onset is rapid, its prognosis is poor, and its incidence is increasing year by year. According to statistics, people with a positive family history of colorectal cancer and people over the age of 50 have a significantly increased risk of CRC, and patients with ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G16B40/20G16H50/30G06K9/62
CPCG16B40/20G16H50/30G06F18/2135G06F18/241
Inventor 杨承刚李雨晨
Owner QINGDAO MEDINTELL BIOMEDICAL CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products