Single-sample whole-genome allele specific copy number variation prediction method

An allele-specific and copy number variation technology, applied in the field of bioinformatics, can solve the problems of high sequencing costs, high false negatives, and increased costs, and achieve the effect of low sequencing depth and high detection accuracy

Active Publication Date: 2021-05-14
苏州吉因加医学检验有限公司
View PDF5 Cites 17 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the disadvantage is that high-depth sequencing is required, resulting in high sequencing costs; low-depth sequencing strategies can only accurately analyze LST (Large-sca

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Single-sample whole-genome allele specific copy number variation prediction method
  • Single-sample whole-genome allele specific copy number variation prediction method
  • Single-sample whole-genome allele specific copy number variation prediction method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0136] In this embodiment, a tumor sample refers to a tumor sample, and a normal sample refers to a normal sample.

[0137] Such as figure 1 As shown, the steps of each module in this embodiment are as follows:

[0138] 1. Model building blocks

[0139]In this embodiment, 148 cases of paired tumor samples (that is, each tumor sample is paired with a normal sample from the same individual, the tumor sample is a tumor tissue sample, and the normal sample is a paracancerous tissue sample) high-depth whole-genome sequencing data (the sequencing depth is 30×). It covers healthy people (the tumors of healthy people are nodules or benign tumors) and the four major types of cancer (ovarian cancer, breast cancer, prostate cancer, bladder cancer). Data quality filtering (Q20>80%, N<5%); use BWA software to compare to the human reference genome hg19, evaluate the contamination rate of the sample, and remove samples with a high contamination rate (specifically remove samples with a com...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

A single-sample whole-genome allele specific copy number variation prediction method comprises the following steps: analyzing and comparing sequencing data of a to-be-detected sample to a reference genome, and extracting tumor purity information, whole-genome replication information and total copy number variation information; performing classification processing according to the total copy number variation information of each segment of the chromosome, converting the total copy number variation information into allele specific copy number variation information, if the total copy number variation information of the chromosome segment is an odd number interval or a zero interval, directly calculating the allele specific copy number variation information, and if the total copy number variation information of the chromosome segment is an odd number interval or a zero interval, directly calculating the allele specific copy number variation information; and if the total copy number variation information of the chromosome segment is a non-zero even interval, obtaining allele specific copy number variation information of the interval through model prediction. According to the method, only a single sample is needed, a normal sample does not need to be paired, the sequencing depth of the needed to-be-detected sample is low, the detection accuracy is high, and the homologous recombination defect of the low-tumor-purity sample can be detected.

Description

technical field [0001] The invention relates to the field of bioinformatics, in particular to a method for predicting allele-specific copy number variation in a single-sample whole genome. Background technique [0002] In recent years, with the emergence and application of Poly ADP-ribose Polymerase inhibitors (Poly ADP-ribose Polymerase inhibitors, PARPi), major breakthroughs have been made in the maintenance treatment of patients with ovarian cancer and other cancers, BRCA mutations and homologous recombination defects (Homologous Recombination The guiding role of Deficiency (HRD) status as a marker is increasingly prominent in clinical practice. Clinically, the beneficiary population of PARP inhibitors has expanded from BRCA mutation patients to HRD positive populations, which means that more cancer patients have the opportunity to benefit from the treatment of PARP inhibitors. At the same time, the applicable cancer types of the drug have also expanded from ovarian canc...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G16B20/20G16B20/30G16B30/10G16B40/00
CPCG16B20/20G16B20/30G16B30/10G16B40/00
Inventor 黄毅陈海新刘久成吴玲清刘青峰
Owner 苏州吉因加医学检验有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products