Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Detection and analysis method of ultra-low frequency mutation sites based on duplex-seq

A technology of mutation sites and analysis methods, which is applied in the field of second-generation high-throughput sequencing, can solve problems such as annotation process and related statistics are not systematic enough, data information is not effectively presented, and systematic analysis has not been seen, so as to increase diversity , increase the quantity, improve the effect of analysis efficiency

Active Publication Date: 2019-05-31
SHANGHAI PASSION BIOTECHNOLOGY CO LTD
View PDF3 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] 1. Data quality control: In the existing Duplex-seq data analysis process, there is no systematic analysis of the previous data quality, such as data repetition rate, UMI type, quantity, ratio, R1R2 balance, etc.
[0006] 2. Differential analysis of UMI: The comprehensive and independent analysis of single-strand-specific UMI and double-strand complementary UMI has not been reported in the existing Duplex-seq data analysis methods
[0007] 3. Variant site annotation process: The mutation sites copied out based on Duplex-seq data belong to low-frequency mutation sites. Therefore, the existing methods can be optimized for the parameters related to the detection of variant sites. not systematic enough
[0008] 4. Readability of results: In the existing Duplex-seq data analysis methods, there are only some simple chart files and text reports in the results, and a lot of data information is not effectively presented

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Detection and analysis method of ultra-low frequency mutation sites based on duplex-seq
  • Detection and analysis method of ultra-low frequency mutation sites based on duplex-seq
  • Detection and analysis method of ultra-low frequency mutation sites based on duplex-seq

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0074] In order to realize the object of the present invention, as figure 1 and figure 2 As shown, the present invention is based on the duplex-seq ultra-low frequency mutation site detection and analysis method, comprising the following steps:

[0075] 1) Evaluate the quality of the original sequencing data, reduce data noise, and provide effective data for subsequent analysis;

[0076] 2) Extract the random barcode to the title line of each sequence in the sequence file, so as to facilitate the subsequent quick retrieval of the barcode and create a consistent sequence;

[0077] 3) Create consensus sequences based on family barcode and duplex barcode, excluding mutations introduced during library construction or PCR;

[0078] 4) Construct a double-strand consensus sequence according to the duplex-tag, and further exclude asymmetric mutation sites in the sequence;

[0079] 5) Perform local quality correction on the compared data, and detect low-frequency variant sites; ann...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

A duplex-seq-based ultra-low frequency mutation site detection and analysis method disclosed in the present invention comprises the following steps: 1) evaluating the quality of original sequencing data, reducing data noise, and providing effective data for subsequent analysis; 2) random barcode Extract the title line of each sequence of the sequence file, which is convenient for subsequent quick retrieval of barcode and creation of consistent sequences; 3) Create consistent sequences based on family barcode and duplex barcode, and exclude mutations introduced during the library construction process or PCR process ; 4) Construct a double-stranded consensus sequence according to duplex-tag, and further exclude asymmetric mutation sites in the sequence; 5) Perform local quality correction on the compared data, and perform low-frequency mutation site detection; Three-level annotation of gene structure, function, and clinical phenotype; 6) Statistics of SSCS, DCS sequence numbers, comparison results, and variation site information, and output visual charts.

Description

technical field [0001] The invention belongs to a biological information data processing method, in particular to a duplex-seq-based ultra-low frequency mutation site detection and analysis method, which is mainly used in the field of second-generation high-throughput sequencing, based on duplex-seq whole exome sequencing , to detect and analyze the ultra-low frequency mutation sites of ctDNA. Background technique [0002] The development of next-generation sequencing technology is in full swing, and it is profoundly changing the research of traditional genetics with overwhelming power, and thus giving birth to the germination of precision medicine. Compared with traditional experimental techniques, this technique can detect thousands of genetic mutations at one time. However, the fly in the ointment is that the next generation sequencing technology still has a relatively high error rate (0.1-1%). For the detection of high-frequency genetic mutations, this error is accepta...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G16B20/20
CPCG16B30/00
Inventor 刘港飚朱月艳孙子奎
Owner SHANGHAI PASSION BIOTECHNOLOGY CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products