Rapid and ultrahigh-sensitivity DNA fusion gene detection method

A technology that combines genes and genes, applied in the field of bioinformatics analysis, can solve problems such as excessive false positive information, consumption of computing resources, and insufficient sensitivity, to achieve high sensitivity, reduce computing resource consumption, and shorten running time Effect

Active Publication Date: 2021-06-25
NANJING SIMCERE MEDICAL LAB CO LTD +3
View PDF7 Cites 8 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] However, the above fusion detection methods have their own advantages and disadvantages.
Based on the method of sequence assembly, the detection effect depends on the quality of the sequence assembly, and it is easy to generate more false positive information, and the method of sequence assembly has the problem of consuming a large amount of computing resources
The method based on the read pair needs to estimate the size of the inserted fragment of the sequencing data and the difference in the distance between it and the read pair, which is likely to cause the problem of false negative and excessive false positive information, and this method can only give the approximate area of ​​fusion
Using split reads to determine the fusion breakpoint is a more accurate method, but in the specific implementation process of different detection methods, there are also problems of insufficient sensitivity and high false positive detection
[0006] In addition, existing fusion discovery methods often perform well in simulated data, but overestimate breakpoints in real tumor genomes in clinical samples, almost all of which suffer from high false positive rates
At the same time, in the case of low proportion of tumor cells, the current detection method has defects in the detection sensitivity of low fusion frequency, and it is difficult to meet the sensitivity requirements of current clinical production, especially liquid biopsy.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Rapid and ultrahigh-sensitivity DNA fusion gene detection method
  • Rapid and ultrahigh-sensitivity DNA fusion gene detection method
  • Rapid and ultrahigh-sensitivity DNA fusion gene detection method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0080] Embodiment 1 method is established

[0081] (1) Breakpoint search and preliminary screening

[0082] a. Paired breakpoint search: directly search for reads with soft clip areas in the BAM file, and obtain all paired breakpoint information at one time: I. Determine the breakpoint 1 through the main comparison position and cigar value of soft clipped reads Position and CN region; II. Determine the position and CN region of breakpoint 2 according to the comparison position and cigar value of the secondary comparison information (SA tag) of soft clipped read; III. Statistics have the same paired breakpoint and CN region The number of reads is the number of Supply_Support support for paired breakpoints.

[0083] b. Fusion Supply_Support support number correction: Correct the false positive fusion problem caused by the high support number caused by PCR duplicate, and record it as the dupcount support number.

[0084] c. Preliminary filtering of breakpoints: It is generally ...

Embodiment 2

[0117] Embodiment 2 tissue sample simulation test

[0118] The specific fusion information of the positive standard product in this embodiment is shown in Table 2 below. In this embodiment, a total of 10 cases of HD753 were used as gDNA standard products to simulate fresh tissue samples to test the detection effect of this method. DNA sequencing data were obtained for all samples using targeted capture, and the target capture range covered the fusion breakpoint position in the standard. The sequencing depth is 2500X.

[0119] Table 2 Standard product fusion gene information

[0120]

[0121] The sequencing data of 10 samples were compared and analyzed using the BWA (v0.7.17) MEM algorithm, and the "-Y" parameter was set to retain the softclip region sequence for subsequent fusion sequence correction and fusion event verification. The rest of the parameters use the default parameters of the BWA MEM algorithm. The reference sequence uses the human reference genome version...

Embodiment 3

[0127] Example 3 Liquid biopsy sample simulation test

[0128] Liquid biopsy requires higher and higher sensitivity for fusion detection, so this example uses HD786 cfDNA standard for testing. There are SLC34A2-ROS1, CCDC6-RET fusions with a frequency of about 5% in the HD786 standard. In order to obtain a lower frequency of fusion-positive standards, HD786 standards were mixed with fusion-negative samples in different proportions, and fusion-positive cfDNA samples with expected frequencies of 1% and 0.5% were simulated. In this embodiment, three groups are set, each group has 10 repeated samples, and the information is shown in Table 5. The experimental sequencing depth was 10000X. The specific analysis process is the same as in Example 1, and this method is used for fusion analysis. The fusion detection results are shown in Table 6, all detected by this method. And the running time is significantly shortened under the same thread (20 threads), as shown in Table 7. The f...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

According to the DNA fusion gene detection method provided by the invention, on the premise that high-sensitivity detection is not influenced, the operation time is short, high detection specificity is achieved, and meanwhile, fusion gene breakpoints, frequencies and splicing information in DNA sequencing data can be accurately represented.

Description

technical field [0001] The invention relates to the field of bioinformatics analysis, in particular to a bioinformatics analysis method for rapid and ultra-high sensitivity DNA fusion gene detection. Background technique [0002] Genome structure variation usually refers to changes in genome structure longer than 1kbp, mainly including large fragment deletion (Deletion), insertion (Insertion), inversion (Inversion) and translocation (Translocation) [1] . Sequence splicing between different genes caused by genome structure variation is usually called fusion gene. [0003] As an important biomarker, gene fusion provides important information in the diagnosis, prognosis and treatment of tumors. For example, ALK, ROS1, RET gene fusions are often independent oncogenic factors in non-small cell lung cancer and are targets for targeted drug action [2,3] . In the past, the detection of tumor biomarkers was mainly performed on tumor tissue samples. Liquid biopsy is a revolutiona...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G16B20/20
CPCG16B20/20
Inventor 寻雪颖叶雷邓望龙任用李诗濛卜范峰丁然陆光华
Owner NANJING SIMCERE MEDICAL LAB CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products