Safe sequencing system

a sequencing system and sequencing technology, applied in the field of nucleic acid sequencing, can solve the problem that mass parallel sequencing cannot be used to detect rare variants, and achieve the effect of sensitive and accurate determination of nucleic acid features or sequences

Active Publication Date: 2014-08-14
THE JOHN HOPKINS UNIV SCHOOL OF MEDICINE
View PDF3 Cites 302 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Benefits of technology

[0011]These and other embodiments which will be apparent to those of skill in the art upon reading the specification provide the art with tools and methods for sensitively and accurately determining nucleic acid features or sequences.

Problems solved by technology

However, massively parallel sequencing cannot generally be used to detect rare variants because of the high error rate associated with the sequencing process.
Some of these errors presumably result from mutations introduced during template preparation, during the pre-amplification steps required for library preparation and during further solid-phase amplification on the instrument itself.
Other errors are due to base mis-incorporation during sequencing and base-calling errors.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Safe sequencing system
  • Safe sequencing system
  • Safe sequencing system

Examples

Experimental program
Comparison scheme
Effect test

example 1

Endogenous UIDs

[0039]UIDs, sometimes called barcodes or indexes, can be assigned to nucleic acid fragments in many ways. These include the introduction of exogenous sequences through PCR (40, 41) or ligation (42, 43). Even more simply, randomly sheared genomic DNA inherently contains UIDs consisting of the sequences of the two ends of each sheared fragment (FIG. 2 and FIG. 5). Paired-end sequencing of these fragments yields UID-families that can be analyzed as described above. To employ such endogenous UIDs in Safe-SeqS, we used two separate approaches: one designed to evaluate many genes simultaneously and the other designed to evaluate a single gene fragment in depth (FIG. 2 and FIG. 5, respectively).

[0040]For the evaluation of multiple genes, we ligated standard Illumina sequencing adapters to the ends of sheared DNA fragments to produce a standard sequencing library, then captured genes of interest on a solid phase (44). In this experiment, a library made from the DNA of ˜15,000...

example 2

Exogenous UIDs

[0044]Though the results described above show that Safe-SeqS can increase the reliability of massively parallel sequencing, the number of different molecules that can be examined using endogenous UIDs is limited. For fragments sheared to an average size of 150 bp (range 125-175), 36 base paired-end sequencing can evaluate a maximum of ˜7,200 different molecules containing a specific mutation (2 reads×2 orientations×36 bases / read×50 base variation on either end of the fragment). In practice, the actual number of UIDs is smaller because the shearing process is not entirely random.

[0045]To make more efficient use of the original templates, we developed a Safe-SeqS strategy that employed a minimum number of enzymatic steps. This strategy also permitted the use of degraded or damaged DNA, such as found in clinical specimens or after bisulfite-treatment for the examination of cytosine methylation (45). As depicted in FIG. 3, this strategy employs two sets of PCR primers. The...

example 3

Analysis of DNA Polymerase Fidelity

[0046]Measurement of the error rates of DNA polymerases is essential for their characterization and dictates the situations in which these enzymes can be used. We chose to measure the error rate of Phusion polymerase, as this polymerase has one of the lowest reported error frequencies of any commercially available enzyme and therefore poses a particular challenge for an in vitro-based approach. We first amplified a single human DNA template molecule, comprising a segment of an arbitrarily chosen human gene, through 19 rounds of PCR. The PCR products from these amplifications, in their entirety, were used as templates for Safe-SeqS as described in FIG. 3. In seven independent experiments of this type, the number of UID-families identified by sequencing was 624,678±421,274, which is consistent with an amplification efficiency of 92±9.6% per round of PCR.

[0047]The error rate of Phusion polymerase, estimated through cloning of PCR products encoding β-g...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The identification of mutations that are present in a small fraction of DNA templates is essential for progress in several areas of biomedical research. Though massively parallel sequencing instruments are in principle well-suited to this task, the error rates in such instruments are generally too high to allow confident identification of rare variants. We here describe an approach that can substantially increase the sensitivity of massively parallel sequencing instruments for this purpose. One example of this approach, called “Safe-SeqS” for (Safe-Sequencing System) includes (i) assignment of a unique identifier (UID) to each template molecule; (ii) amplification of each uniquely tagged template molecule to create UID-families; and (iii) redundant sequencing of the amplification products. PCR fragments with the same UID are truly mutant (“super-mutants”) if ≧95% of them contain the identical mutation. We illustrate the utility of this approach for determining the fidelity of a polymerase, the accuracy of oligonucleotides synthesized in vitro, and the prevalence of mutations in the nuclear and mitochondrial genomes of normal cells.

Description

[0001]This invention was made using support from the National Institutes of Health, grants CA62924, CA43460, and CA57345. Certain rights to the invention are retained by the U.S. government under the terms of the grant.TECHNICAL FIELD OF THE INVENTION[0002]This invention is related to the area of nucleic acid sequencing. In particular, it relates to manipulative and analytic steps for analyzing and verifying the products of low frequency events.BACKGROUND OF THE INVENTION[0003]Genetic mutations underlie many aspects of life and death—through evolution and disease, respectively. Accordingly, their measurement is critical to several fields of research. Luria and Delbrück's classic fluctuation analysis is a prototypic example of the insights into biological processes that can be gained simply by counting the number of mutations in carefully controlled experiments (1). Counting de novo mutations in humans, not present in their parents, have similarly led to new insights into the rate at...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(United States)
IPC IPC(8): C12Q1/68
CPCC12Q2563/179C12Q1/6869C12Q1/6874C12Q2535/122C12Q2565/514C12Q1/6806C12Q2521/501C12Q2525/155C12Q2525/179C12Q2525/191C12Q1/6876C12Q2600/158
Inventor VOGELSTEIN, BERTKINZLER, KENNETH W.PAPADOPOULOS, NICKOLASKINDE, ISAAC
Owner THE JOHN HOPKINS UNIV SCHOOL OF MEDICINE
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products