Methods and compositions for reducing redundant molecular barcodes created in primer extension reactions

a primer extension and barcode technology, applied in the field of nucleotide sequence amplification, can solve the problems of reducing affecting the amplification efficiency of primer extension reactions, so as to reduce the number of molecular barcodes

Active Publication Date: 2018-10-16
PARAGON GENOMICS INC
View PDF31 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Benefits of technology

[0010]Described herein are methods, compositions, systems and kits for reducing redundant molecular barcodes produced in a template-dependent multiplex primer extension reaction. These methods and apparatuses (e.g., kits) for performing them may include one or more enzymes to remove single-stranded DNA regions containing unmatched base pairs on the amplified DNA targets after three cycles of a primer extension reaction, thereby to remove redundant molecular barcodes. The end molecular barcodes may not be directly traced back to the original individual molecules, however, the end sequences may be grouped into a pool of sense strands and a pool of antisense strands, upon sorting the molecular barcodes at the 5′ and 3′ ends of the amplified DNA targets. By confirming variant frequencies in these two pools, this method can further eliminate random errors that occur on either strand. This method may be incorporated into a three-hour process. The simplicity of the method enables high efficiency and detection of rare mutations from low amount of DNA samples.
[0016]There are eight offspring molecules created at cycle three. After cleaving the heteroduplex region and the terminal single-stranded regions, or only the terminal single-stranded regions, only one or two molecules go into the secondary amplification, respectively, thus producing one or four uniquely barcoded offspring, respectively, per original target after the secondary amplification. While reducing the number of molecular barcodes (or their combinations if molecular barcode cassette is used in both primers) to 1 or 4 per original target, this method reduces the required total sequencing reads by 8 to 2-fold. This means lower cost of sequencing, or more DNA targets can be analyzed on a specified size of flow cell.
[0019]In any of the methods described herein, the method may also include removing the cleaved single-stranded DNA and the degraded non-specific amplification products, leaving the substantial proportion of said plurality of target-specific amplification products with reduced number of molecular barcodes.

Problems solved by technology

One of major problems of high sensitivity sequencing is the presence of a vast number of random errors.
These random errors are produced in PCR during sample prep, in hybridization capture through chemical modification of bases, or in the sequencing step by the error of the DNA polymerase.
Usually, hundreds to thousands of random errors occur at 0.1-0.2% frequency and lower, making it impossible to find low-frequency de novo variants.
These methods take several hours to two days of work time and numerous reagents and equipment.
They demand up to several hundreds of nanograms of DNA, and have low efficiency to detect rare mutations.
It seems impossible to use PCR-based methods to add identical molecular barcodes onto the double strands of the same targets for duplex sequencing.
The key problem is that an original target molecule is amplified into multiple molecules, each with a different molecular barcode.
These redundant barcodes make it impossible to trace themselves to the sense and antisense strand of the original double stranded DNA.
These methods cannot support duplex sequencing due to that fact that either only one strand of the target DNA was labeled with molecular barcode, or redundant barcodes were produced from one original DNA molecule.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Methods and compositions for reducing redundant molecular barcodes created in primer extension reactions
  • Methods and compositions for reducing redundant molecular barcodes created in primer extension reactions
  • Methods and compositions for reducing redundant molecular barcodes created in primer extension reactions

Examples

Experimental program
Comparison scheme
Effect test

example 1

Dual Molecular Barcodes

[0101]To demonstrate the feasibility of our method of removing redundant molecular barcodes from a primer extension reaction involving molecular barcodes on BOTH primers (depicted in FIG. 1), and the consequent removing of random errors and variant calling from both strands of target DNA (FIG. 3), the effect of single-stranded DNA specific exonuclease on the yield of DNA libraries was tested, and the subsequent variant calling after sequencing the libraries.

[0102]To enable a library to be sequenced in Illumina sequencing machines, the library structure was designed as shown in FIG. 6A. DNA targets were amplified with a plurality of pairs of primers (also referred as primer panel or panel) in the multiplex PCR reaction for three cycles. After removing redundant barcodes, the products of multiplex PCR were further amplified in the second round of PCR with a pair of primers to produce the library. A primer panel containing 40 pairs of primers (40 plex) was used. ...

example 2

Single Molecular Barcode

[0110]In this example, the feasibility of the method to remove redundant molecular barcodes from a primer extension reaction involving molecular barcode on ONE primer was demonstrated, as depicted in FIGS. 4 and 5. A primer panel containing 205 pairs of primers were used to amplify corresponding DNA regions in 0.2% NA12878 / NA18507. Each primer of these 205 pairs of primers contained a 3′ end target-specific nucleotide sequence and a 5′ fixed nucleotide sequences. The 5′ fixed nucleotide sequences were complimentary to the pair of primers used in the secondary PCR. Each forward primer additionally contained a 10-nucleotide random sequence, serving as molecular barcode, between the 3′ end target-specific sequence and the 5′ end fixed nucleotide sequence. 100 ng of 0.2% NA12878 / NA18507 and 25 nM each of the 205 pairs of primers were used in a multiplex PCR reaction. The reagents and the method of multiplex PCR were provided by Paragon Genomics Inc. (CleanPlex™ T...

example 3

y of Amplification

[0114]Uniformity is a measure of how well every amplicon in a library is equally amplified in a multiplex PCR reaction. In order words, it measures the difference of copy numbers of amplicons between the under-amplified amplicons and the over-amplified amplicons. We found that the 40-plex panel and 205-plex panel used in Example 1 and 2 did not properly measure the uniformity. These two panels easily generated highly uniform libraries (FIG. 18, table 1). The number of amplicons defined by these panel were not sufficiently large to accommodate various kinds of easy and difficult-to-amplify amplicons. We then used a panel of 629 pairs of primer and made libraries with the reagent used above. We sequenced these libraries at mean reads around 700. The uniformity measured at 0.2× mean reads was larger than 99%, while uniformity at 0.5× mean reads was higher than 87%. Furthermore, amplicons with GC content from over 20% to 80% were equally amplified. Three amplicons cove...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

PropertyMeasurementUnit
work timeaaaaaaaaaa
temperatureaaaaaaaaaa
temperatureaaaaaaaaaa
Login to view more

Abstract

Methods, compositions, systems and kits to introduce a controlled-number of molecular barcodes onto DNA fragments by reducing redundant molecular barcodes formed in template-dependent primer extension reactions, and to find variant frequencies from both strands of DNA. The methods, compositions, systems and kits described herein may include, or include the use of, one or more single-stranded DNA specific nucleases that cleave the single-stranded regions containing unmatched base pairs in amplification products.

Description

CROSS REFERENCE TO RELATED APPLICATIONS[0001]This patent application claims priority to U.S. Provisional Patent Application No. 62 / 444,704, filed Jan. 10, 2017 (titled “METHODS AND COMPOSITIONS FOR REDUCING REDUNDANT MOLECULAR BARCODES CREATED IN PRIMER EXTENSION REACTIONS”), and herein incorporated by reference in its entirety.[0002]This patent application may also be related to U.S. patent application Ser. No. 15 / 290,981, filed on Oct. 11, 2016, which claims priority as a continuation-in-part to U.S. patent application Ser. No. 15 / 041,644, filed on Feb. 11, 2016, now U.S. Pat. No. 9,464,318, and also titled “METHODS AND COMPOSITIONS FOR REDUCING NON-SPECIFIC AMPLIFICATION PRODUCTS”, which claims priority to U.S. provisional patent applications: U.S. Provisional Patent Application No. 62 / 114,788, titled “A METHOD FOR ELIMINATING NONSPECIFIC AMPLIFICATION PRODUCTS IN MULTIPLEX PCR” and filed on Feb. 11, 2015; and U.S. Provisional Patent Application No. 62 / 150,600, titled “METHODS AN...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(United States)
IPC IPC(8): C12Q1/68C12N15/10C12Q1/686C12Q1/6806C12Q1/6848C12Q1/6874
CPCC12Q1/6874C12N15/1065C12N15/1068C12Q1/686C12Q1/6806C12Q1/6848C12Q2521/325C12Q2525/191C12Q2537/143C12Q2537/159C12Q2563/179
Inventor LIU, ZHITONGLIU, GUOYINGLIU, JEFFREY JUEHUICHEN, TAO
Owner PARAGON GENOMICS INC
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products