PHI29 enzyme variants and uses thereof

Phi29 enzyme variants with targeted amino acid substitutions and pegylation enhance processivity and stability, addressing the limitations of existing Phi29 polymerases by improving amplification efficiency and reducing off-target effects.

WO2026136385A1PCT designated stage Publication Date: 2026-06-25SINGULAR GENOMICS SYSTEMS INC

Patent Information

Authority / Receiving Office
WO · WO
Patent Type
Applications
Current Assignee / Owner
SINGULAR GENOMICS SYSTEMS INC
Filing Date
2025-12-16
Publication Date
2026-06-25

AI Technical Summary

Technical Problem

Existing efforts to improve the processivity, stability, and strand displacing activity of Phi29 DNA polymerase without sacrificing accuracy or efficiency have not been adequately addressed.

Method used

Development of Phi29 enzyme variants with specific amino acid substitutions that enhance processivity and stability while maintaining high accuracy, including pegylation to reduce off-target amplification.

Benefits of technology

The enzyme variants demonstrate improved amplification efficiency and reduced off-target amplification, enabling more reliable and precise nucleic acid amplification processes.

✦ Generated by Eureka AI based on patent content.

Smart Images

  • Figure US2025059881_25062026_PF_FP_ABST
    Figure US2025059881_25062026_PF_FP_ABST
Patent Text Reader

Abstract

Disclosed herein, inter alia, are mutant enzymes, kits, and methods of use thereof.
Need to check novelty before this filing date? Find Prior Art

Description

Atorney Docket No.: 051385-638001WOPHI29 ENZYME VARIANTS AND USES THEREOFCROSS-REFERENCES TO RELATED APPLICATIONS

[0001] This application claims the benefit of U.S. Provisional Application No. 63 / 736,227, filed December 19, 2024 and U.S. Provisional Application No. 63 / 743,343, filed January 9, 2025, each of which is incorporated herein by reference in their entirety and for all purposes.SEQUENCE LISTING

[0002] The Sequence Listing written in file 00638001WO.xml, created December 15, 2025, 75,323 bytes, machine format IBM-PC, MS Windows operating system, is hereby incorporated by reference.BACKGROUND

[0003] DNA polymerases replicate the genomes of living organisms. DNA polymerases add nucleotide triphosphate (dNTP) residues to the 3 ’-end of the growing DNA chain, using a complementary DNA as template. One such DNA polymerase, Phi29 DNA polymerase, is a monomeric enzyme of about 66 kDa, and is the replicative polymerase from the Bacillus subtilis phage phi29 belonging to the eukaryotic-type family of DNA polymerases (family B). Referred to as proofreading, phi29 contains an exonuclease domain that catalyzes 3’— >5’ exonucleolysis of mismatched nucleotides preferentially on single-stranded DNA or RNA, thereby enhancing replication fidelity at least 100-fold. Additionally, wild-type phi29 DNA polymerase reliably binds to single stranded DNA, and performs DNA synthesis without processivity cofactors, accounting for the highest known processivity (>70 kb) among other DNA polymerases. Strong processivity, robust strand displacement activity, and high accuracy allow the enzyme to amplify whole genomes with minimal amplification bias compared to PCR based amplification methods. Therefore, it has been widely used for rapidly amplifying targets (e.g., multiple displacement amplification (MDA) or rolling cycle amplification (RCA)), and enabling point-of-care analyses and immunoassays. Efforts to improve the processivity, stability, and strand displacing activity of phi29, without sacrificing the accuracy or efficiency of phi29 enzymes remains a challenge. Disclosed herein, inter alia, are solutions to these and other problems in the art.BRIEF SUMMARYAtorney Docket No.: 051385-638001WO

[0004] In an aspect is provided a polymerase including one or more amino acid substitutions as described herein. In embodiments, the polymerase includes an amino acid sequence that is at least 50% identical to SEQ ID NO: 1 (e.g., 50% identical, 60% identical, 70% identical, 80% identical, 90% identical, or greater); and includes an amino acid substitution, wherein the amino acid substitution is as described herein.

[0005] In an aspect is provided a method of incorporating a nucleotide into a primed nucleic acid template (e.g., a primer hybridized to a template nucleic acid). In embodiments, the method includes combining in a reaction vessel: (i) a primer hybridized to a nucleic acid template, (ii) a nucleotide solution including a plurality of nucleotides, and (iii) a polymerase, wherein the polymerase is a polymerase as described herein.BRIEF DESCRIPTION OF THE DRAWINGS

[0006] FIG. 1 provides an alignment of seven sequences described herein. A portion of the entire amino acid sequence is shown, aligning the wild type polymerase (SEQ ID NO:1) to BSTP6 (SEQ ID NO: 12), MinWT (SEQ ID NO:2), BSTP4 (SEQ ID NO:8), Whitingl8 (SEQ ID NO: 17), Phage M2 (SEQ ID NO:20), and Beecentumtrevirus (SEQ ID NO:22). The alignment highlights a negative three (-3) frameshift in amino acid positions of MinWT, BSTP4, Whitingl8, Phage M2, and Beecentumtrevirus relative to wild type, wherein the amino acid positions are shifted by three positions. For example, the amino acid position 312 of the wild type sequence corresponds to the amino acid position 309 if BSTP4 (SEQ ID NO:8). Amino acid substitutions relative to the wild type sequence are shaded. Marking of these amino acids makes clear that their homologous position in related proteins is readily identified, both for the proteins aligned herein and for other proteins readily identified by one of skill in the art using standard sequence search capabilities such as a BLAST search, available at the NCBI website affiliated with the National Institutes of Health and the National Library of Medicine (ncbi.nlm.nih.gov).

[0007] FIGS. 2A-2B. Data testing enzyme amplification following an incubation time at 42°C in reaction buffer at pH 7.5, and rolling circle amplification (RCA) at 37°C. FIG. 2A provides a measurement of the amplification (as measured in relative fluorescent units, RFU) for MS-65, MS-238, MS-254, MS-262, and MS-296 as measured at different time points, 0 minutes, 5 minutes, 10 minutes, and 20 minutes. FIG. 2B provides a measurement of the amplification forAtorney Docket No.: 051385-638001WOMS-65, MS-238, MS-311 , and MS-312 as measured at different time points, 0 minutes, 5 minutes, 10 minutes, and 20 minutes.

[0008] FIG. 3. Data from an experiment testing enzyme amplification following an incubation time at 42°C in reaction buffer at pH 7.5, and RCA at 42°C.

[0009] FIGS. 4A-4C report on tonsil tissue transcript detection using MS-65 (FIG. 4A), MS- 65+PEG (FIG. 4B), and MS-65 in a buffer containing free PEG (FIG. 4C). The detected transcripts off tissue (i.e., indicated as dots on the black background and not overlaid on the tissue) are significantly reduced in FIG. 4B indicating that pegylated amplification enzymes eliminates amplification off tissue.

[0010] FIG. 5 provides a side-by-side comparison showing the transcript map (i.e., detected transcripts overlaid on a tonsil image) next to an H&E stained tissue (right). The top row shows an unmodified variant, MS-65, produces a significant number of amplification products both on and off tissue, however the pegylated version significantly reduces the amount of amplification products generated and detected off-tissue.DETAILED DESCRIPTIONI. Definitions

[0011] All patents, patent applications, articles and publications mentioned herein, both supra and infra, are hereby expressly incorporated herein by reference in their entireties.

[0012] Unless defined otherwise herein, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this disclosure belongs. Various scientific dictionaries that include the terms included herein are well known and available to those in the art. Although any methods and materials similar or equivalent to those described herein find use in the practice or testing of the disclosure, some preferred methods and materials are described. Accordingly, the terms defined immediately below are more fully described by reference to the specification as a whole. It is to be understood that this disclosure is not limited to the particular methodology, protocols, and reagents described, as these may vary, depending upon the context in which they are used by those of skill in the art. The following definitions are provided to facilitate understanding of certain terms used frequently herein and are not meant to limit the scope of the present disclosure.Atorney Docket No.: 051385-638001WO

[0013] As used herein, the singular terms “a”, “an”, and “the” include the plural reference unless the context clearly indicates otherwise. Reference throughout this specification to, for example, “one embodiment”, “an embodiment”, “another embodiment”, “a particular embodiment”, “a related embodiment”, “a certain embodiment”, “an additional embodiment”, or “a further embodiment” or combinations thereof means that a particular feature, structure or characteristic described in connection with the embodiment is included in at least one embodiment of the present disclosure. Thus, the appearances of the foregoing phrases in various places throughout this specification are not necessarily all referring to the same embodiment. Furthermore, the particular features, structures, or characteristics may be combined in any suitable manner in one or more embodiments.

[0014] It is understood that the examples and embodiments described herein are for illustrative purposes only and that various modifications or changes in light thereof will be suggested to persons skilled in the art and are to be included within the spirit and purview of this application and scope of the appended claims. All publications, patents, and patent applications cited herein are hereby incorporated by reference in their entirety for all purposes.

[0015] As used herein, the term “about” means a range of values including the specified value, which a person of ordinary skill in the art would consider reasonably similar to the specified value. In embodiments, the term “about” means within a standard deviation using measurements generally acceptable in the art. In embodiments, about means a range extending to + / - 10% of the specified value. In embodiments, about means the specified value.

[0016] Unless defined otherwise, technical and scientific terms used herein have the same meaning as commonly understood by a person of ordinary skill in the art. See, e.g., Singleton et al., DICTIONARY OF MICROBIOLOGY AND MOLECULAR BIOLOGY 2nd ed., J. Wiley & Sons (New York, NY 1994); Sambrook et al., MOLECULAR CLONING, A LABORATORY MANUAL, Cold Springs Harbor Press (Cold Springs Harbor, NY 1989). Any methods, devices and materials similar or equivalent to those described herein can be used in the practice of this invention. The following definitions are provided to facilitate understanding of certain terms used frequently herein and are not meant to limit the scope of the present disclosure.Atorney Docket No.: 051385-638001WO

[0017] ‘ ‘Nucleic acid” refers to nucleotides (e g., deoxyribonucleotides or ribonucleotides) and polymers thereof in either single-, double- or multiple-stranded form, or complements thereof. The terms “polynucleotide,” “oligonucleotide,” “oligo” or the like refer, in the usual and customary sense, to a sequence of nucleotides. The term “nucleotide” refers, in the usual and customary sense, to a single unit of a polynucleotide, z.e., a monomer. Nucleotides can be ribonucleotides, deoxyribonucleotides, or modified versions thereof. Examples of polynucleotides contemplated herein include single and double stranded DNA, single and double stranded RNA, and hybrid molecules having mixtures of single and double stranded DNA and RNA with linear or circular framework. Non-limiting examples of polynucleotides include a gene, a gene fragment, an exon, an intron, intergenic DNA (including, without limitation, heterochromatic DNA), messenger RNA (mRNA), transfer RNA, ribosomal RNA, a ribozyme, cDNA, a recombinant polynucleotide, a branched polynucleotide, a plasmid, a vector, isolated DNA of a sequence, isolated RNA of a sequence, a nucleic acid probe, and a primer. Polynucleotides useful in the methods of the disclosure may comprise natural nucleic acid sequences and variants thereof, artificial nucleic acid sequences, or a combination of such sequences. As may be used herein, the terms “nucleic acid oligomer” and “oligonucleotide” are used interchangeably and are intended to include, but are not limited to, nucleic acids having a length of 200 nucleotides or less. In some embodiments, an oligonucleotide is a nucleic acid having a length of 2 to 200 nucleotides, 2 to 150 nucleotides, 5 to 150 nucleotides or 5 to 100 nucleotides. The terms “polynucleotide,” “oligonucleotide,” “oligo” or the like refer, in the usual and customary sense, to a linear sequence of nucleotides. Oligonucleotides are typically from about 5, 6, 7, 8, 9, 10, 12, 15, 25, 30, 40, 50 or more nucleotides in length, up to about 100 nucleotides in length. In some embodiments, an oligonucleotide is a primer configured for extension by a polymerase when the primer is annealed completely or partially to a complementary nucleic acid template. A primer is often a single stranded nucleic acid. In certain embodiments, a primer, or portion thereof, is substantially complementary to a portion of an adapter. In some embodiments, a primer has a length of 200 nucleotides or less. In certain embodiments, a primer has a length of 10 to 150 nucleotides, 15 to 150 nucleotides, 5 to 100 nucleotides, 5 to 50 nucleotides or 10 to 50 nucleotides. In some embodiments, an oligonucleotide may be immobilized to a solid support.Atorney Docket No.: 051385-638001WO

[0018] The term “duplex” in the context of polynucleotides refers, in the usual and customary sense, to double-strandedness. Nucleic acids can be linear or branched. For example, nucleic acids can be a linear chain of nucleotides or the nucleic acids can be branched, e.g., such that the nucleic acids comprise one or more arms or branches of nucleotides. Optionally, the branched nucleic acids are repetitively branched to form higher ordered structures such as dendrimers and the like. Different polynucleotides may have different three-dimensional structures, and may perform various functions, known or unknown. Nucleic acids, including e.g., nucleic acids with a phosphothioate backbone, can include one or more reactive moieties. As used herein, the term reactive moiety includes any group capable of reacting with another molecule, e.g., a nucleic acid or polypeptide through covalent, non-covalent or other interactions. By way of example, the nucleic acid can include an amino acid reactive moiety that reacts with an amio acid on a protein or polypeptide through a covalent, non-covalent or other interaction.

[0019] The term “base” and “nucleobase” as used herein refers to a purine or pyrimidine compound, or a derivative thereof, that may be a constituent of nucleic acid (i.e. DNA or RNA, or a derivative thereof). In embodiments, the base is a derivative of a naturally occurring DNA or RNA base (e.g., a base analogue). In embodiments, the base is a base-pairing base. In embodiments, the base pairs to a complementary base. In embodiments, the base is capable of forming at least one hydrogen bond with a complementary base (e.g., adenine hydrogen bonds with thymine, adenine hydrogen bonds with uracil, guanine pairs with cytosine). Non-limiting examples of a base includes cytosine or a derivative thereof (e.g., cytosine analogue), guanine or a derivative thereof (e.g., guanine analogue), adenine or a derivative thereof (e.g., adenine analogue), thymine or a derivative thereof (e.g., thymine analogue), uracil or a derivative thereof (e.g., uracil analogue), hypoxanthine or a derivative thereof (e g., hypoxanthine analogue), xanthine or a derivative thereof (e.g., xanthine analogue), guanosine or a derivative thereof (e.g., 7 -methylguanosine analogue), deaza-adenine or a derivative thereof (e.g., deaza-adenine analogue), deaza-guanine or a derivative thereof (e.g., deaza-guanine), deaza-hypoxanthine or a derivative thereof, 5,6-dihydrouracil or a derivative thereof (e.g., 5,6-dihydrouracil analogue), 5- methylcytosine or a derivative thereof (e.g., 5-methylcytosine analogue), or 5- hydroxymethylcytosine or a derivative thereof (e.g., 5-hydroxymethylcytosine analogue) moieties. In embodiments, the base is thymine, cytosine, uracil, adenine, guanine, hypoxanthine,Atorney Docket No.: 051385-638001WO xanthine, theobromine, caffeine, uric acid, or isoguanine. In embodiments, the base is

[0020] A polynucleotide is typically composed of a specific sequence of four nucleotide bases: adenine (A); cytosine (C); guanine (G); and thymine (T) (uracil (U) for thymine (T) when the polynucleotide is RNA. Thus, the term “polynucleotide sequence” is the alphabetical representation of a polynucleotide molecule; alternatively, the term may be applied to the polynucleotide molecule itself. This alphabetical representation can be input into databases in a computer having a central processing unit and used for bioinformatics applications such as functional genomics and homology searching. Polynucleotides may optionally include one or more non-standard nucleotide(s), nucleotide analog(s) and / or modified nucleotides.

[0021] The term “isolated”, when applied to a nucleic acid or protein, denotes that the nucleic acid or protein is essentially free of other cellular components with which it is associated in the natural state. It can be, for example, in a homogeneous state and may be in either a dry or an aqueous solution. Purity and homogeneity are typically determined using analytical chemistry techniques such as polyacrylamide gel electrophoresis or high performance liquid chromatography. A protein that is the predominant species present in a preparation is substantially purified.

[0022] The terms “analog” and “analogue” and “derivative” in reference to a chemical compound, refers to compounds having a structure similar to that of another one, but differing from it in respect of one or more different atoms, functional groups, or substructures that are replaced with one or more other atoms, functional groups, or substructures. In the context of a nucleotide useful in practicing the invention, a nucleotide analog refers to a compound that, like the nucleotide of which it is an analog, can be incorporated into a nucleic acid molecule (e.g., an extension product) by a suitable polymerase, for example, a DNA polymerase in the context of a dNTP analogue. The terms also encompass nucleic acids containing known nucleotide analogs or modified backbone residues or linkages, which are synthetic, naturally occurring, and non-Atorney Docket No.: 051385-638001WO naturally occurring, which have similar binding properties as the reference nucleic acid, and which are metabolized in a manner similar to the reference nucleotides. Examples of such analogs include, include, without limitation, phosphodiester derivatives including, e.g., phosphorami date, phosphorodi ami date, phosphorothioate (also known as phosphothioate having double bonded sulfur replacing oxygen in the phosphate), phosphorodithioate, phosphonocarboxylic acids, phosphonocarboxylates, phosphonoacetic acid, phosphonoformic acid, methyl phosphonate, boron phosphonate, or O-methylphosphoroamidite linkages (see Eckstein, OLIGONUCLEOTIDES AND ANALOGUES: A PRACTICAL APPROACH, Oxford University Press) as well as modifications to the nucleotide bases such as in 5-methyl cytidine or pseudouridine; and peptide nucleic acid backbones and linkages. Other analog nucleic acids include those with positive backbones; non-ionic backbones, modified sugars, and non-ribose backbones (e.g. phosphorodiamidate morpholino oligos or locked nucleic acids (LNA) as known in the art), including those described in U.S. Patent Nos. 5,235,033 and 5,034,506, and Chapters 6 and 7, ASC Symposium Series 580, CARBOHYDRATE MODIFICATIONS IN ANTISENSE RESEARCH, Sanghui & Cook, eds. Nucleic acids containing one or more carbocyclic sugars are also included within one definition of nucleic acids. Modifications of the ribose-phosphate backbone may be done for a variety of reasons, e.g., to increase the stability and half-life of such molecules in physiological environments or as probes on a biochip. Mixtures of naturally occurring nucleic acids and analogs can be made; alternatively, mixtures of different nucleic acid analogs, and mixtures of naturally occurring nucleic acids and analogs may be made. In embodiments, the internucleotide linkages in DNA are phosphodiester, phosphodiester derivatives, or a combination of both.

[0023] As used herein, a “native” nucleotide is used in accordance with its plain and ordinary meaning and refers to a naturally occurring nucleotide that does not include an exogenous label (e.g., a fluorescent dye, or other label) or chemical modification such as may characterize a nucleotide analog. Examples of native nucleotides useful for carrying out procedures described herein include: dATP (2’ -deoxy adenosine-5’ -triphosphate); dGTP (2’-deoxyguanosine-5’- triphosphate); dCTP (2’ -deoxy cytidine-5’ -triphosphate); dTTP (2’-deoxythymidine-5’- triphosphate); and dUTP (2’-deoxyuridine-5’-triphosphate).Atorney Docket No.: 051385-638001WO

[0024] The term “complement,” as used herein, refers to a nucleotide (e ., RNA or DNA) or a sequence of nucleotides capable of base pairing with a complementary nucleotide or sequence of nucleotides. As described herein and commonly known in the art, the complementary (matching) nucleoside of adenosine is thymidine and the complementary (matching) nucleoside of guanosine is cytidine. Thus, a complement may include a sequence of nucleotides that base pair with corresponding complementary nucleotides of a second nucleic acid sequence. The nucleotides of a complement may match, partially or completely, the nucleotides of the second nucleic acid sequence. Where the nucleotides of the complement completely match each nucleotide of the second nucleic acid sequence, the complement forms base pairs with each nucleotide of the second nucleic acid sequence. Where the nucleotides of the complement partially match the nucleotides of the second nucleic acid sequence, only some of the nucleotides of the complement form base pairs with nucleotides of the second nucleic acid sequence. Examples of complementary sequences include coding and non-coding sequences, wherein the non-coding sequence contains complementary nucleotides to the coding sequence and thus forms the complement of the coding sequence. A further example of complementary sequences are sense and antisense sequences, wherein the sense sequence contains complementary nucleotides to the antisense sequence and thus forms the complement of the antisense sequence.

[0025] As described herein, the complementarity of sequences may be partial, in which only some of the nucleic acids match according to base pairing, or complete, where all the nucleic acids match according to base pairing. Thus, two sequences that are complementary to each other may have a specified percentage of nucleotides that are complementary (i.e., about 60% identity, preferably 65%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or higher complementarity over a specified region).

[0026] “DNA” refers to deoxyribonucleic acid, a polymer of deoxyribonucleotides (e.g., dATP, dCTP, dGTP, dTTP, dUTP, etc.) linked by phosphodiester bonds. DNA can be singlestranded (ssDNA) or double-stranded (dsDNA), and can include both single and double-stranded (or “duplex”) regions. “RNA” refers to ribonucleic acid, a polymer of ribonucleotides linked by phosphodiester bonds. RNA can be single-stranded (ssRNA) or double-stranded (dsRNA), and can include both single and double-stranded (or “duplex”) regions. Single-stranded DNA (orAtorney Docket No.: 051385-638001WO regions thereof) and ssRNA can, if sufficiently complementary, hybridize to form doublestranded DNA / RNA complexes (or regions).

[0027] The term “DNA primer” or simply “primer” refers to any DNA molecule that may hybridize to a DNA template and be bound by a DNA polymerase and extended in a template- directed process for nucleic acid synthesis. The primer may be a separate polynucleotide from the polynucleotide template, or both may be portions of the same polynucleotide (e.g., as in a hairpin structure having a 3’ end that is extended along another portion of the polynucleotide to extend a double-stranded portion of the hairpin). Primers (e.g., forward or reverse primers) may be attached to a solid support. A primer can be of any length depending on the particular technique it will be used for. For example, PCR primers are generally between 10 and 40 nucleotides in length. The length and complexity of the nucleic acid fixed onto the nucleic acid template may vary. In some embodiments, a primer has a length of 200 nucleotides or less. In certain embodiments, a primer has a length of 10 to 150 nucleotides, 15 to 150 nucleotides, 5 to 100 nucleotides, 5 to 50 nucleotides or 10 to 50 nucleotides. A primer typically has a length of 10 to 50 nucleotides. For example, a primer may have a length of 10 to 40, 10 to 30, 10 to 20, 25 to 50, 15 to 40, 15 to 30, 20 to 50, 20 to 40, or 20 to 30 nucleotides. In some embodiments, a primer has a length of 18 to 24 nucleotides. One of skill can adjust these factors to provide optimum hybridization and signal production for a given hybridization procedure. The primer permits the addition of a nucleotide residue thereto, or oligonucleotide or polynucleotide synthesis therefrom, under suitable conditions. In an embodiment, the primer is a DNA primer, i.e., a primer consisting of, or largely consisting of, deoxyribonucleotide residues. The primers are designed to have a sequence that is the complement of a region of template / target DNA to which the primer hybridizes. The addition of a nucleotide residue to the 3’ end of a primer by formation of a phosphodiester bond results in a DNA extension product. The addition of a nucleotide residue to the 3’ end of the DNA extension product by formation of a phosphodi ester bond results in a further DNA extension product. In another embodiment the primer is an RNA primer. In embodiments, a primer is hybridized to a target polynucleotide. A “primer” is complementary to a polynucleotide template, and complexes by hydrogen bonding or hybridization with the template to give a primer / template complex for initiation of synthesis by a polymerase, which is extended by the addition of covalently bonded bases linked at its 3' end complementary to the template in the process of DNA synthesis.Atorney Docket No.: 051385-638001WO

[0028] As used herein, the term “primer binding sequence” refers to a polynucleotide sequence that is complementary to at least a portion of a primer (e.g., a sequencing primer or an amplification primer). Primer binding sequences can be of any suitable length. In embodiments, a primer binding sequence is about or at least about 10, 15, 20, 25, 30, or more nucleotides in length. In embodiments, a primer binding sequence is 10-50, 15-30, or 20-25 nucleotides in length. The primer binding sequence may be selected such that the primer (e.g., sequencing primer) has the preferred characteristics to minimize secondary structure formation or minimize non-specific amplification, for example having a length of about 20-30 nucleotides; approximately 50% GC content, and a Tm of about 55°C to about 65°C.

[0029] The term “DNA template” refers to any DNA molecule that may be bound by a DNA polymerase and utilized as a template for nucleic acid synthesis.

[0030] The term “dATP analogue” refers to an analogue of deoxyadenosine triphosphate (dATP) that is a substrate for a DNA polymerase. The term “dCTP analogue” refers to an analogue of deoxycytidine triphosphate (dCTP) that is a substrate for a DNA polymerase. The term “dGTP analogue” refers to an analogue of deoxyguanosine triphosphate (dGTP) that is a substrate for a DNA polymerase. The term “dNTP analogue” refers to an analogue of deoxynucleoside triphosphate (dNTP) that is a substrate for a DNA polymerase. The term “dTTP analogue” refers to an analogue of deoxythymidine triphosphate (dUTP) that is a substrate for a DNA polymerase. The term “dUTP analogue” refers to an analogue of deoxyuridine triphosphate (dUTP) that is a substrate for a DNA polymerase.

[0031] The term “extendible” means, in the context of a nucleotide, primer, or extension product, that the 3 ’-OH group of the molecule is available and accessible to a DNA polymerase for extension or addition of nucleotides derived from dNTPs or dNTP analogues.“Incorporation” means joining of the modified nucleotide to the free 3’ hydroxyl group of a second nucleotide via formation of a phosphodiester linkage with the 5’ phosphate group of the modified nucleotide. The second nucleotide to which the modified nucleotide is joined will typically occur at the 3’ end of a polynucleotide chain.

[0032] The term “modified nucleotide” refers to nucleotide or nucleotide analogue modified in some manner. Typically, a nucleotide contains a single 5-carbon sugar moiety, a singleAtorney Docket No.: 051385-638001WO nitrogenous base moiety and 1 to three phosphate moieties. In particular embodiments, a nucleotide can include a blocking moiety or a label moiety. A blocking moiety (e.g., a reversible terminator moiety) on a nucleotide prevents formation of a covalent bond between the 3’ hydroxyl moiety of the nucleotide and the 5’ phosphate of another nucleotide. A blocking moiety on a nucleotide can be reversible (i.e., a reversible terminator), whereby the blocking moiety can be removed or modified to allow the 3’ hydroxyl to form a covalent bond with the 5’ phosphate of another nucleotide. A blocking moiety can be effectively irreversible under particular conditions used in a method set forth herein. A label moiety of a nucleotide can be any moiety that allows the nucleotide to be detected, for example, using a spectroscopic method. Exemplary label moieties are fluorescent labels, mass labels, chemiluminescent labels, electrochemical labels, detectable labels and the like. One or more of the above moieties can be absent from a nucleotide used in the methods and compositions set forth herein. For example, a nucleotide can lack a label moiety or a blocking moiety or both.

[0033] A “removable” group, e.g., a label or a blocking group or protecting group, refers to a chemical group that can be removed from a dNTP analogue such that a DNA polymerase can extend the nucleic acid (e.g., a primer or extension product) by the incorporation of at least one additional nucleotide. Removal may be by any suitable method, including enzymatic, chemical, or photolytic cleavage. Removal of a removable group, e.g., a blocking group, does not require that the entire removable group be removed, only that a sufficient portion of it be removed such that a DNA polymerase can extend a nucleic acid by incorporation of at least one additional nucleotide using a dNTP of dNTP analogue.

[0034] “Reversible blocking groups” or “reversible terminators” include a blocking moiety located, for example, at the 3’ position of the nucleotide and may be a chemically cleavable moiety such as an allyl group, an azidomethyl group or a methoxymethyl group, or may be an enzymatically cleavable group such as a phosphate ester. Suitable nucleotide blocking moieties are described in applications WO 2004 / 018497, U.S. Pat. Nos. 7,057,026, 7,541,444, WO 96 / 07669, U.S. Pat. Nos. 5,763,594, 5,808,045, 5,872,244 and 6,232,465 the contents of which are incorporated herein by reference in their entirety. The nucleotides may be labelled or unlabeled. They may be modified with reversible terminators useful in methods provided herein and may be 3’-O-blocked reversible or 3 ’-unblocked reversible terminators. In nucleotides withAtorney Docket No.: 051385-638001WO3’-O-blocked reversible terminators, the blocking group -OR [reversible terminating (capping) group] is linked to the oxygen atom of the 3 ’-OH of the pentose, while the label is linked to the base, which acts as a reporter and can be cleaved. The 3’-O-blocked reversible terminators are known in the art, and may be, for instance, a 3’-ONH2 reversible terminator, a 3’-O-allyl reversible terminator, or a 3’-O-azidomethy reversible terminator.3’-0-blocked reversible terminator o o o base — cleavable linker .. dyetermteteisg group

[0035] The term “non-covalent linker” is used in accordance with its ordinary meaning and refers to a divalent moiety which includes at least two molecules that are not covalently linked to each other but are capable of interacting with each other via a non-covalent bond (e.g. electrostatic interactions (e.g. ionic bond, hydrogen bond, halogen bond) or van der Waals interactions (e.g. dipole-dipole, dipole-induced dipole, London dispersion). In embodiments, the non-covalent linker is the result of two molecules that are not covalently linked to each other that interact with each other via a non-covalent bond.

[0036] The terms “cleavable linker” or “cleavable moiety” as used herein refers to a divalent or monovalent, respectively, moiety which is capable of being separated (e g., detached, split, disconnected, hydrolyzed, a stable bond within the moiety is broken) into distinct entities. A cleavable linker is cleavable (e.g., specifically cleavable) in response to external stimuli (e.g., enzymes, nucleophilic / basic reagents, reducing agents, photo-irradiation, electrophilic / acidic reagents, organometallic and metal reagents, or oxidizing reagents). A chemically cleavable linker refers to a linker which is capable of being split in response to the presence of a chemical (e.g., acid, base, oxidizing agent, reducing agent, Pd(0), tris-(2-carboxyethyl)phosphine, dilute nitrous acid, fluoride, tri s(3 -hydroxy propyl)phosphine), sodium dithionite (Na2S2C>4), hydrazine (N2H4)). A chemically cleavable linker is non-enzymatically cleavable. In embodiments, the cleavable linker is cleaved by contacting the cleavable linker with a cleaving agent. InAtorney Docket No.: 051385-638001WO embodiments, the cleaving agent is sodium dithionite (Na2S2C>4), weak acid, hydrazine (N2H4), Pd(0), or light-irradiation (e.g., ultraviolet radiation).

[0037] The term “orthogonal detectable label” or “orthogonal detectable moiety” as used herein refer to a detectable label (e.g. fluorescent dye or detectable dye) that is capable of being detected and identified (e.g., by use of a detection means (e.g., emission wavelength, physical characteristic measurement)) in a mixture or a panel (collection of separate samples) of two or more different detectable labels. For example, two different detectable labels that are fluorescent dyes are both orthogonal detectable labels when a panel of the two different fluorescent dyes is subjected to a wavelength of light that is absorbed by one fluorescent dye but not the other and results in emission of light from the fluorescent dye that absorbed the light but not the other fluorescent dye. Orthogonal detectable labels may be separately identified by different absorbance or emission intensities of the orthogonal detectable labels compared to each other and not only be the absolute presence of absence of a signal. An example of a set of four orthogonal detectable labels is the set of Rox™-Labeled Tetrazine, Alexa Fluor® 488-Labeled SHA, Cy®5-Labeled Streptavidin, and R6G-Labeled Dibenzocyclooctyne. ROX™ is a trademark of Applera Corporation. Alexa Fluor® is a trademark of Life Technologies Corporation. Cy® is a trademark of Cytiva.

[0038] A “detectable agent” or “detectable compound” or “detectable label” or “detectable moiety” is a composition detectable by spectroscopic, photochemical, biochemical, immunochemical, chemical, magnetic resonance imaging, or other physical means. For example, detectable agents include fluorophores (e.g. fluorescent dyes), modified oligonucleotides (e.g., moi eties described in PCT / US2015 / 022063, which is incorporated herein by reference), electron-dense reagents, enzymes (e.g., as commonly used in an ELISA), biotin, digoxigenin, paramagnetic molecules, paramagnetic nanoparticles, ultrasmall superparamagnetic iron oxide (“USPIO”) nanoparticles, USPIO nanoparticle aggregates, superparamagnetic iron oxide (“SPIO”) nanoparticles, SPIO nanoparticle aggregates, monochrystalline iron oxide nanoparticles, monochrystalline iron oxide, nanoparticle contrast agents, liposomes or other delivery vehicles containing Gadolinium chelate (“Gd-chelate”) molecules, Gadolinium, radioisotopes, radionuclides (e.g. carbon-11, nitrogen-13, oxygen-15, fluorine-18, rubidium-82), fluorodeoxyglucose (e.g. fluorine-18 labeled), any gamma ray emitting radionuclides, positron-Atorney Docket No.: 051385-638001WO emitting radionuclide, radiolabeled glucose, radiolabeled water, radiolabeled ammonia, biocolloids, microbubbles (e.g. including microbubble shells including albumin, galactose, lipid, and / or polymers; microbubble gas core including air, heavy gas(es), perfluorcarbon, nitrogen, octafluoropropane, perflexane lipid microsphere, perflutren, etc.), iodinated contrast agents (e.g. iohexol, iodixanol, ioversol, iopamidol, ioxilan, iopromide, diatrizoate, metrizoate, ioxaglate), barium sulfate, thorium dioxide, gold, gold nanoparticles, gold nanoparticle aggregates, fluorophores, two-photon fluorophores, or haptens and proteins or other entities which can be made detectable, e.g., by incorporating a radiolabel into a peptide or antibody specifically reactive with a target peptide. Examples of detectable agents include imaging agents, including fluorescent and luminescent substances, including, but not limited to, a variety of organic or inorganic small molecules commonly referred to as “dyes,” “labels,” or “indicators.” Examples include fluorescein, rhodamine, acridine dyes, Alexa Fluor® dyes, and cyanine dyes. In embodiments, the detectable moiety is a fluorescent molecule (e.g., acridine dye, cyanine, dye, fluorine dye, oxazine dye, phenanthridine dye, or rhodamine dye). In embodiments, the detectable moiety is a fluorescent molecule (e.g., acridine dye, cyanine, dye, fluorine dye, oxazine dye, phenanthridine dye, or rhodamine dye). In embodiments, the detectable moiety is a fluorescein isothiocyanate moiety, tetramethylrhodamine-5-(and 6)-isothiocyanate moiety, Cy®2 moiety, Cy®3 moiety, Cy®5 moiety, Cy®7 moiety, 4’,6-diamidino-2-phenylindole moiety, Hoechst 33258 moiety, Hoechst 33342 moiety, Hoechst 34580 moiety, propidium-iodide moiety, or acridine orange moiety. In embodiments, the detectable label is a fluorescent dye. In embodiments, the detectable label is a fluorescent dye capable of exchanging energy with another fluorescent dye (e.g., fluorescence resonance energy transfer (FRET) chromophores).

[0039] A “cleavable site” or “scissile linkage” in the context of a polynucleotide is a site which allows controlled cleavage of the polynucleotide strand (e.g., the linker, the primer, or the polynucleotide) by chemical, enzymatic, or photochemical means known in the art and described herein. A scissile site may refer to the linkage of a nucleotide between two other nucleotides in a nucleotide strand (i.e., an internucleosidic linkage). In embodiments, the scissile linkage can be located at any position within the one or more nucleic acid molecules, including at or near a terminal end (e.g., the 3' end of an oligonucleotide) or in an interior portion of the one or more nucleic acid molecules. In embodiments, conditions suitable for separating a scissile linkage include a modulating the pH and / or the temperature. In embodiments, a scissile site can includeAtorney Docket No.: 051385-638001WO at least one acid-labile linkage. For example, an acid-labile linkage may include a phosphoramidate linkage. In embodiments, a phosphoramidate linkage can be hydrolysable under acidic conditions, including mild acidic conditions such as trifluoroacetic acid and a suitable temperature (e.g., 30°C), or other conditions known in the art, for example Matthias Mag, et al Tetrahedron Letters, Volume 33, Issue 48, 1992, 7319-7322. In embodiments, the scissile site can include at least one photolabile internucleosidic linkage (e.g., o-nitrobenzyl linkages, as described in Walker et al, I. Am. Chem. Soc. 1988, 110, 21, 7170-7177), such as o- nitrobenzyloxymethyl or p-nitrobenzyloxymethyl group(s). In embodiments, the scissile site includes at least one uracil nucleobase. In embodiments, a uracil nucleobase can be cleaved with a uracil DNA glycosylase (UDG) or Formamidopyrimidine DNA Glycosylase Fpg. In embodiments, the scissile linkage site includes a sequence-specific nicking site having a nucleotide sequence that is recognized and nicked by a nicking endonuclease enzyme or a uracil DNA glycosylase.

[0040] The term “amino acid” refers to naturally occurring and synthetic amino acids, as well as amino acid analogs and amino acid mimetics that function in a manner similar to the naturally occurring amino acids. Naturally occurring amino acids are those encoded by the genetic code, as well as those amino acids that are later modified, e.g., hydroxyproline, y-carboxy glutamate, and O-phosphoserine. Amino acid analogs refers to compounds that have the same basic chemical structure as a naturally occurring amino acid, i.e., an a carbon that is bound to a hydrogen, a carboxyl group, an amino group, and an R group, e.g., homoserine, norleucine, methionine sulfoxide, methionine methyl sulfonium. Such analogs have modified R groups (e.g., norleucine) or modified peptide backbones, but retain the same basic chemical structure as a naturally occurring amino acid. Amino acid mimetics refers to chemical compounds that have a structure that is different from the general chemical structure of an amino acid, but that functions in a manner similar to a naturally occurring amino acid. The terms “non-naturally occurring amino acid” and “unnatural amino acid” refer to amino acid analogs, synthetic amino acids, and amino acid mimetics, which are not found in nature.

[0041] Amino acids may be referred to herein by either their commonly known three letter symbols or by the one-letter symbols recommended by the IUPAC-IUB BiochemicalAtorney Docket No.: 051385-638001WONomenclature Commission. Nucleotides, likewise, may be referred to by their commonly accepted single-letter codes.

[0042] The terms “polypeptide,” “peptide” and “protein” are used interchangeably herein to refer to a polymer of amino acid residues, wherein the polymer may in embodiments be conjugated to a moiety that does not consist of amino acids. The terms apply to amino acid polymers in which one or more amino acid residue is an artificial chemical mimetic of a corresponding naturally occurring amino acid, as well as to naturally occurring amino acid polymers and non-naturally occurring amino acid polymers. A “fusion protein” refers to a chimeric protein encoding two or more separate protein sequences that are recombinantly expressed as a single moiety.

[0043] “Conservatively modified variants” applies to both amino acid and nucleic acid sequences. With respect to particular nucleic acid sequences, “conservatively modified variants” refers to those nucleic acids that encode identical or essentially identical amino acid sequences. Because of the degeneracy of the genetic code, a number of nucleic acid sequences will encode any given protein. For instance, the codons GCA, GCC, GCG and GCU all encode the amino acid alanine. Thus, at every position where an alanine is specified by a codon, the codon can be altered to any of the corresponding codons described without altering the encoded polypeptide. Such nucleic acid variations are “silent variations,” which are one species of conservatively modified variations. Every nucleic acid sequence herein, which encodes a polypeptide, also describes every possible silent variation of the nucleic acid. One of skill will recognize that each codon in a nucleic acid (except AUG, which is ordinarily the only codon for methionine, and TGG, which is ordinarily the only codon for tryptophan) can be modified to yield a functionally identical molecule. Accordingly, each silent variation of a nucleic acid that encodes a polypeptide is implicit in each described sequence.

[0044] As to amino acid sequences, one of skill will recognize that individual substitutions, deletions or additions to a nucleic acid, peptide, polypeptide, or protein sequence which alters, adds or deletes a single amino acid or a small percentage of amino acids in the encoded sequence is a “conservatively modified variant” where the alteration results in the substitution of an amino acid with a chemically similar amino acid. Conservative substitution tables providing functionally similar amino acids are well known in the art. Such conservatively modifiedAtorney Docket No.: 051385-638001WO variants are in addition to and do not exclude polymorphic variants, interspecies homologs, and alleles of the disclosure.

[0045] The following groups each contain amino acids that are conservative substitutions for one another: 1) Non-polar - Alanine (A), Leucine (L), Isoleucine (I), Valine (V), Glycine (G), Methionine (M); 2) Aliphatic - Alanine (A), Leucine (L), Isoleucine (I), Valine (V); 3) Acidic - Aspartic acid (D), Glutamic acid (E); 4) Polar - Asparagine (N), Glutamine (Q); Serine (S), Threonine (T); 5) Basic - Arginine (R), Lysine (K); 7) Aromatic - Phenylalanine (F), Tyrosine (Y), Tryptophan (W), Histidine (H); 8) Other - Cysteine (C) and Proline (P).

[0046] “Percentage of sequence identity” is determined by comparing two optimally aligned sequences over a comparison window, wherein the portion of the polynucleotide or polypeptide sequence in the comparison window may comprise additions or deletions (i.e., gaps) as compared to the reference sequence for optimal alignment of the two sequences. The percentage is calculated by determining the number of positions at which the identical nucleic acid base or amino acid residue occurs in both sequences to yield the number of matched positions, dividing the number of matched positions by the total number of positions in the window of comparison and multiplying the result by 100 to yield the percentage of sequence identity. Percent identity often refers to the percentage of matching positions of two sequences for a contiguous section of positions, wherein the two sequences are aligned in such a way to maximize matching positions and minimize gaps of non-matching positions. In some embodiments, alignments are conducted wherein there are no gaps between the two sequences. In some instances, the alignment results in less than 5% gaps, less than 3% gaps, or less than 1% gaps. Additional methods of sequence comparison or alignment are also consistent with the disclosure.

[0047] The terms “identical” or percent “identity,” in the context of two or more nucleic acids or polypeptide sequences, refer to two or more sequences or subsequences that are the same or have a specified percentage of amino acid residues or nucleotides that are the same (i.e., about 60% identity, preferably 65%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or higher identity over a specified region, when compared and aligned for maximum correspondence over a comparison window or designated region) as measured using a BLAST or BLAST 2.0 sequence comparison algorithm with default parameters described below,Atorney Docket No.: 051385-638001WO or by manual alignment and visual inspection (see, e.g., NCBI web site www.ncbi.nlm.nih.gov / BLAST / or the like). Such sequences are then said to be “substantially identical.” This definition also refers to, or may be applied to, the complement of a test sequence. The definition also includes sequences that have deletions and / or additions, as well as those that have substitutions. As described below, the preferred algorithms can account for gaps and the like. Preferably, identity exists over a region that is at least about 25 amino acids or nucleotides in length, or more preferably over a region that is 50-100 amino acids or nucleotides in length. As used herein, percent (%) amino acid sequence identity is defined as the percentage of amino acids in a candidate sequence that is identical to the amino acids in a reference sequence, after aligning the sequences and introducing gaps, if necessary, to achieve the maximum percent sequence identity. Alignment for purposes of determining percent sequence identity can be achieved in various ways that are within the level of skill in the art, for instance, using publicly available computer software such as BLAST, BLAST-2, ALIGN, ALIGN-2 or Megalign (DNASTAR) software. Appropriate parameters for measuring alignment, including any algorithms needed to achieve maximal alignment over the full length of the sequences being compared can be determined by known methods.

[0048] For sequence comparisons, typically one sequence acts as a reference sequence, to which test sequences are compared. When using a sequence comparison algorithm, test and reference sequences are entered into a computer, subsequence coordinates are designated, if necessary, and sequence algorithm program parameters are designated. Preferably, default program parameters can be used, or alternative parameters can be designated. The sequence comparison algorithm then calculates the percent sequence identities for the test sequences relative to the reference sequence, based on the program parameters.

[0049] A “comparison window”, as used herein, includes reference to a segment of any one of the number of contiguous positions selected from the group consisting of from 10 to 700, usually about 50 to about 200, more usually about 100 to about 150 in which a sequence may be compared to a reference sequence of the same number of contiguous positions after the two sequences are optimally aligned. Methods of alignment of sequences for comparison are well- known in the art. Optimal alignment of sequences for comparison can be conducted, e g., by the local homology algorithm of Smith & Waterman, Adv. AppL Math. 2:482 (1981), by theAtorney Docket No.: 051385-638001WO homology alignment algorithm of Needleman & Wunsch, J. Mol. Biol. 48:443 (1970), by the search for similarity method of Pearson & Lipman, Proc. Nat ’I. Acad. Sci. USA 85:2444 (1988), by computerized implementations of these algorithms (GAP, BESTFIT, FASTA, and TFASTA in the Wisconsin Genetics Software Package, Genetics Computer Group, 575 Science Dr., Madison, WI), or by manual alignment and visual inspection (see, e.g., Current Protocols in Molecular Biology (Ausubel et al., eds. 1995 supplement)).

[0050] An amino acid or nucleotide base “position” is denoted by a number that sequentially identifies each amino acid (or nucleotide base) in the reference sequence based on its position relative to the N-terminus (or 5 ’-end). Due to deletions, insertions, truncations, fusions, and the like that must be taken into account when determining an optimal alignment, in general the amino acid residue number in a test sequence determined by simply counting from the N- terminus will not necessarily be the same as the number of its corresponding position in the reference sequence. For example, in a case where a variant has a deletion relative to an aligned reference sequence, there will be no amino acid in the variant that corresponds to a position in the reference sequence at the site of deletion. Where there is an insertion in an aligned reference sequence, that insertion will not correspond to a numbered amino acid position in the reference sequence. In the case of truncations or fusions there can be stretches of amino acids in either the reference or aligned sequence that do not correspond to any amino acid in the corresponding sequence.

[0051] The terms “position”, “numbered with reference to” or “corresponding to,” when used in the context of the numbering of a given amino acid or polynucleotide sequence, refer to the numbering of the residues of a specified reference sequence when the given amino acid or polynucleotide sequence is compared to the reference sequence. Similarly, the term “functionally equivalent to” in relation to an amino acid position refers to an amino acid residue in a protein that corresponds to a particular amino acid in a reference sequence. An amino acid “corresponds” to a given residue when it occupies the same essential structural position within the protein as the given residue. One skilled in the art will immediately recognize the identity and location of residues corresponding to a specific position in a protein (e.g., polymerase) in other proteins with different numbering systems. For example, by performing a simple sequence alignment with a protein (e.g., polymerase) the identity and location of residues corresponding toAtorney Docket No.: 051385-638001WO specific positions of said protein are identified in other protein sequences aligning to said protein. For example, a selected residue in a selected protein corresponds to methionine at position 129 when the selected residue occupies the same essential spatial or other structural relationship as a methionine at position 129. In some embodiments, where a selected protein is aligned for maximum homology with a protein, the position in the aligned selected protein aligning with methionine 129 is said to correspond to methionine 129. Instead of a primary sequence alignment, a three-dimensional structural alignment can also be used, e.g., where the structure of the selected protein is aligned for maximum correspondence with the methionine at position 129, and the overall structures compared. In this case, an amino acid that occupies the same essential position as methionine 129 in the structural model is said to correspond to the methionine 129 residue. Sequence alignments may be compiled using any of the standard alignment tools known in the art, such as for example BLAST and DIAMOND (Buchfink et al. Nat Methods 12, 59-60 (2015)), and the like.

[0052] The term “DNA polymerase” and “nucleic acid polymerase” are used in accordance with their plain ordinary meaning and refer to enzymes capable of synthesizing nucleic acid molecules from nucleotides (e.g., deoxyribonucleotides). Typically, a DNA polymerase adds nucleotides to the 3’ - end of a DNA strand one nucleotide at a time.

[0053] The term “exonuclease activity” is used in accordance with its ordinary meaning in the art, and refers to the removal of a nucleotide from a nucleic acid by a DNA polymerase. For example, during polymerization, nucleotides are added to the 3’ end of the primer strand. Occasionally a DNA polymerase incorporates an incorrect nucleotide to the 3 ’-OH terminus of the primer strand, wherein the incorrect nucleotide cannot form a hydrogen bond to the corresponding base in the template strand. Such a nucleotide, added in error, is removed from the primer as a result of the 3’ to 5’ exonuclease activity of the DNA polymerase. In embodiments, exonuclease activity may be referred to as “proofreading.” When referring to 3 ’-5’ exonuclease activity, it is understood that the DNA polymerase facilitates a hydrolyzing reaction that breaks phosphodiester bonds at the 3’ end of a polynucleotide chain to excise the nucleotide, thereby releasing deoxyribonucleoside 5 ’-monophosphates one after another. One having skill in the art understands that an enzyme having 3 ’-5’ exonuclease activity does not cleave DNA strands without terminal 3 ’-OH moi eties. In embodiments, 3 ’-5’ exonuclease activity refers to theAtorney Docket No.: 051385-638001WO successive removal of nucleotides in single-stranded DNA in a 3’ — > 5’ direction, releasing deoxyribonucleoside 5 ’-monophosphates one after another. Methods for quantifying exonuclease activity are known in the art, see for example Southworth et al, PNAS Vol 93, 8281- 8285 (1996).

[0054] The terms “measure”, “measuring”, “measurement” and the like refer not only to quantitative measurement of a particular variable, but also to qualitative and semi-quantitative measurements. Accordingly, “measurement” also includes detection, meaning that merely detecting a change, without quantification, constitutes measurement.

[0055] A “polymerase-template complex” refers to a functional complex between a DNA polymerase and a DNA primer-template molecule (e.g., nucleic acid). In embodiments, the polymerase is non-covalently bound to a nucleic acid primer and the template nucleic acid molecule.

[0056] The terms “sequencing”, “sequence determination”, “determining a nucleotide sequence”, and the like include determination of partial as well as full sequence information of the polynucleotide being sequenced. That is, the term includes sequence comparisons, fingerprinting, and like levels of information about a target polynucleotide, as well as the express identification and ordering of nucleotides in a target polynucleotide. The term also includes the determination of the identification, ordering, and locations of one, two, or three of the four types of nucleotides within a target polynucleotide.

[0057] The term “sequencing reaction mixture” refers to an aqueous mixture that contains the reagents necessary to allow a dNTP or dNTP analogue to add a nucleotide to a DNA strand by a DNA polymerase. Exemplary mixtures include buffers (e.g., saline-sodium citrate (SSC), tris(hydroxymethyl)aminomethane or “Tris”), salts (e.g., KC1 or (NHfhSCE)), nucleotides, polymerases, cleaving agent (e.g., tri-n-butyl-phosphine, triphenyl phosphine and its sulfonated versions (i.e., tris(3-sulfophenyl)-phosphine, TPPTS), and tri(carboxyethyl)phosphine (TCEP) and its salts, cleaving agent scavenger compounds (e.g., 2'-Dithiobisethanamine or 11-Azido- 3,6,9-trioxaundecane-l-amine), detergents and / or crowding agents or stabilizers (e.g., PEG, Tween®, BSA). Tween® is a registered trademark of Croda International PLC.Atorney Docket No.: 051385-638001WO

[0058] The term “solid substrate” means any suitable medium present in the solid phase to which an antibody or an agent can be covalently or non-covalently affixed or immobilized. Preferred solid substrates are glass. Non-limiting examples include chips, beads and columns. The solid substrate can be non-porous or porous. Exemplary solid substrates include, but are not limited to, glass and modified or functionalized glass, plastics (including acrylics, polystyrene and copolymers of styrene and other materials, polypropylene, polyethylene, polybutylene, polyurethanes, Teflon™, cyclic olefins, polyimides, etc.), nylon, ceramics, resins, Zeonor®, silica or silica-based materials including silicon and modified silicon, carbon, metals, inorganic glasses, optical fiber bundles, and polymers. Zeonor® is a registered trademark of Zeon Corporation.

[0059] The term “species”, when used in the context of describing a particular compound or molecule species, refers to a population of chemically indistinct molecules. When used in the context of taxonomy, “species” is the basic unit of classification and a taxonomic rank. For example, in reference to the microorganism Pyrococcus horikoshii horikoshii is a species of the genus Pyrococcus.

[0060] The term “expression” includes any step involved in the production of the polypeptide including, but not limited to, transcription, post-transcriptional modification, translation, post- translational modification, and secretion. Expression can be detected using conventional techniques for detecting protein (e.g., ELISA, Western blotting, flow cytometry, immunofluorescence, immunohistochemistry, etc.).

[0061] A “cell” as used herein, refers to a cell carrying out metabolic or other function sufficient to preserve or replicate its genomic DNA. A cell can be identified by well-known methods in the art including, for example, presence of an intact membrane, staining by a particular dye, ability to produce progeny or, in the case of a gamete, ability to combine with a second gamete to produce a viable offspring. Cells may include prokaryotic and eukaryotic cells. Prokaryotic cells include but are not limited to bacteria. Eukaryotic cells include but are not limited to yeast cells and cells derived from plants and animals, for example mammalian, insect (e.g., spodoptera) and human cells.Atorney Docket No.: 051385-638001WO

[0062] “ Control” or “control experiment” is used in accordance with its plain ordinary meaning and refers to an experiment in which the subjects (e.g., enzymes) or reagents of the experiment are treated as in a parallel experiment except for omission of a procedure, reagent, or variable of the experiment (e.g., a polymerase not having one or more mutations relative to the polymerase being tested). In some instances, the control is used as a standard of comparison in evaluating experimental effects. In some embodiments, a control is the measurement of the activity of a protein in the absence of a mutation as described herein (including embodiments and examples). “Control polymerase” is defined herein as the polymerase against which the activity of the altered polymerase is compared. In one embodiment of the invention the control polymerase may comprise a wild type polymerase or an exo-variant thereof. Unless otherwise stated, by “wild type” it is generally meant that the polymerase comprises its natural amino acid sequence, as it would be found in nature. The invention is not limited to merely a comparison of activity of the polymerases as described herein against the wild type equivalent or exo-variant of the polymerase that is being altered. Many polymerases exist whose amino acid sequence has been modified (e.g., by amino acid substitution mutations) and which can prove to be a suitable control for use in assessing the modified nucleotide incorporation efficiencies of the polymerases as described herein. The control polymerase can, therefore, comprise any known polymerase, including mutant polymerases known in the art. The activity of the chosen “control” polymerase with respect to incorporation of the desired nucleotide analogues may be determined by an incorporation assay. In embodiments, the control includes performing the experiment with a wild type polymerase.

[0063] The term “modulate” is used in accordance with its plain ordinary meaning and refers to the act of changing or varying one or more properties. “Modulation” refers to the process of changing or varying one or more properties.

[0064] The term “kit” is used in accordance with its plain ordinary meaning and refers to any delivery system for delivering materials or reagents for carrying out a method of the invention. Such delivery systems include systems that allow for the storage, transport, or delivery of reaction reagents (e.g., nucleotides, enzymes, nucleic acid templates, etc. in the appropriate containers) and / or supporting materials (e.g., buffers, written instructions for performing the reaction, etc.) from one location to another location. For example, kits include one or moreAtorney Docket No.: 051385-638001WO enclosures (e.g., boxes) containing the relevant reaction reagents and / or supporting materials. Such contents may be delivered to the intended recipient together or separately. For example, a first container may contain an enzyme, while a second container contains nucleotides. In embodiments, the kit includes vessels containing one or more enzymes, primers, adaptors, or other reagents as described herein. Vessels may include any structure capable of supporting or containing a liquid or solid material and may include tubes, vials, jars, containers, tips, etc. In embodiments, a wall of a vessel may permit the transmission of light through the wall. In embodiments, the vessel may be optically clear. The kit may include the enzyme and / or nucleotides in a buffer. In embodiments, the buffer includes an acetate buffer, 3-(N- morpholino)propanesulfonic acid (MOPS) buffer, N-(2-Acetamido)-2-aminoethanesulfonic acid (ACES) buffer, phosphate-buffered saline (PBS) buffer, 4-(2-hy droxy ethyl)- 1- piperazineethanesulfonic acid (HEPES) buffer, N-(l,l-Dimethyl-2-hydroxyethyl)-3-amino-2- hydroxypropanesulfonic acid (AMPSO) buffer, borate buffer (e.g., borate buffered saline, sodium borate buffer, boric acid buffer), 2-Amino-2-m ethyl- 1,3 -propanediol (AMPD) buffer, N- cyclohexyl-2-hydroxyl-3 -aminopropanesulfonic acid (CAPSO) buffer, 2-Amino-2-methyl-l- propanol (AMP) buffer, 4-(Cyclohexylamino)-l -butanesulfonic acid (CABS) buffer, glycine- NaOH buffer, N-Cyclohexyl-2-aminoethanesulfonic acid (CHES) buffer, tri s(hydroxymethyl)aminom ethane (Tris) buffer, or a N-cy cl ohexyl-3 -aminopropanesulfonic acid (CAPS) buffer. In embodiments, the buffer is a borate buffer. In embodiments, the buffer is a CHES buffer.

[0065] Where a range of values is provided, it is understood that each intervening value, to the tenth of the unit of the lower limit unless the context clearly indicates otherwise, between the upper and lower limit of that range, and any other stated or unstated intervening value in, or smaller range of values within, that stated range is encompassed within the invention. The upper and lower limits of any such smaller range (within a more broadly recited range) may independently be included in the smaller ranges, or as particular values themselves, and are also encompassed within the invention, subject to any specifically excluded limit in the stated range. Where the stated range includes one or both of the limits, ranges excluding either or both of those included limits are also included in the invention.Atorney Docket No.: 051385-638001WO

[0066] The phrase “stringent hybridization conditions” refers to conditions under which a primer will hybridize to its target subsequence, typically in a complex mixture of nucleic acids, but to no other sequences. Stringent conditions are sequence-dependent and will be different in different circumstances. Longer sequences hybridize specifically at higher temperatures. An extensive guide to the hybridization of nucleic acids is found in Tijssen, Techniques in Biochemistry and Molecular Biology— Hybridization with Nucleic Probes, “Overview of principles of hybridization and the strategy of nucleic acid assays” (1993). Generally, stringent conditions are selected to be about 5-10°C lower than the thermal melting point (Tm) for the specific sequence at a defined ionic strength pH. The Tmis the temperature (under defined ionic strength, pH, and nucleic concentration) at which 50% of the probes complementary to the target hybridize to the target sequence at equilibrium (as the target sequences are present in excess, at Tm, 50% of the probes are occupied at equilibrium). Stringent conditions may also be achieved with the addition of destabilizing agents such as formamide. For selective or specific hybridization, a positive signal is at least two times background, preferably 10 times background hybridization. Exemplary stringent hybridization conditions can be as following: 50% formamide, 5x SSC, and 1% SDS, incubating at 42°C, or, 5x SSC, 1% SDS, incubating at 65°C, with wash in 0.2x SSC, and 0.1% SDS at 65°C.

[0067] “Synthetic” DNA polymerases refer to non-naturally occurring DNA polymerases such as those constructed by synthetic methods, mutated parent DNA polymerases such as truncated DNA polymerases and fusion DNA polymerases (e.g., as described in U.S. Pat. No. 7,541,170). Variants of the parent DNA polymerase have been engineered by mutating residues using site- directed or random mutagenesis methods known in the art. The variant is expressed in an expression system such as E. coli by methods known in the art.

[0068] The term “isolated” means altered or removed from the natural state. For example, a nucleic acid or a polypeptide naturally present in a living animal is not isolated, but the same nucleic acid or polypeptide partially or completely separated from the coexisting materials of its natural state is isolated. An isolated nucleic acid or protein can exist in substantially purified form, or can exist in a non-native environment such as, for example, a host cell. In embodiments, “isolated” refers to a nucleic acid, polynucleotide, polypeptide, protein, or other component thatAtorney Docket No.: 051385-638001WO is partially or completely separated from components with which it is normally associated (other proteins, nucleic acids, cells, etc.).

[0069] As used herein, the terms “biomolecule” or “analyte” refer to an agent (e.g., a compound, macromolecule, or small molecule), and the like derived from a biological system (e.g., an organism, a cell, or a tissue). The biomolecule may contain multiple individual components that collectively construct the biomolecule, for example, in embodiments, the biomolecule is a polynucleotide wherein the polynucleotide is composed of nucleotide monomers. The biomolecule may be or may include DNA, RNA, organelles, carbohydrates, lipids, proteins, or any combination thereof. These components may be extracellular. In some examples, the biomolecule may be referred to as a clump or aggregate of combinations of components. In some instances, the biomolecule may include one or more constituents of a cell but may not include other constituents of the cell. In embodiments, a biomolecule is a molecule produced by a biological system (e.g., an organism). The biomolecule may be any substance (e.g. molecule) or entity that is desired to be detected by the method of the invention. The biomolecule is the “target” of the assay method of the invention. The biomolecule may accordingly be any compound that may be desired to be detected, for example a peptide or protein, or nucleic acid molecule or a small molecule, including organic and inorganic molecules. The biomolecule may be a cell or a microorganism, including a virus, or a fragment or product thereof. Biomolecules of particular interest may thus include proteinaceous molecules such as peptides, polypeptides, proteins or prions or any molecule which includes a protein or polypeptide component, etc., or fragments thereof. The biomolecule may be a single molecule or a complex that contains two or more molecular subunits, which may or may not be covalently bound to one another, and which may be the same or different. Thus, in addition to cells or microorganisms, such a complex biomolecule may also be a protein complex. Such a complex may thus be a homo- or hetero-multimer. Aggregates of molecules e.g., proteins may also be target analytes, for example aggregates of the same protein or different proteins. The biomolecule may also be a complex between proteins or peptides and nucleic acid molecules such as DNA or RNA. Of particular interest may be the interactions between proteins and nucleic acids, e.g., regulatory factors, such as transcription factors, and interactions between DNA or RNA molecules.Atorney Docket No.: 051385-638001WO

[0070] As used herein, “biomaterial” refers to any biological material produced by an organism. In some embodiments, biomaterial includes secretions, extracellular matrix, proteins, lipids, organelles, membranes, cells, portions thereof, and combinations thereof. In some embodiments, cellular material includes secretions, extracellular matrix, proteins, lipids, organelles, membranes, cells, portions thereof, and combinations thereof. In some embodiments, biomaterial includes viruses. In some embodiments, the biomaterial is a replicating virus and thus includes virus infected cells. In embodiments, a biological sample includes biomaterials.

[0071] As used herein, the term “primed template DNA molecule” refers to a template DNA molecule which is associated with a primer (a short polynucleotide) that can serve as a starting point for DNA synthesis.

[0072] As used herein, the term “incorporating a nucleotide into a nucleic acid sequence” refers to the process of joining a cognate nucleotide to a nucleic acid primer by formation of a phosphodiester bond. As described herein, methods of incorporating a nucleotide into a nucleic acid sequence comprises combining in a reaction vessel: (i) a nucleic acid template, (ii) a nucleotide solution comprising a plurality of nucleotides, and (iii) a polymerase as described herein.

[0073] As used herein, the term “primer-template hybridization complex” refers to a double stranded nucleic acid complex formed as a result of a hybridization event between a DNA template molecule and a primer. In embodiments, the formation of a template complex enables elongation at the 3’ end of the primer.

[0074] A nucleic acid can be amplified by a suitable method. The term “amplified” as used herein refers to subjecting a target nucleic acid in a sample to a process that linearly or exponentially generates amplicon nucleic acids having the same or substantially the same e.g., substantially identical) nucleotide sequence as the target nucleic acid, or segment thereof, and / or a complement thereof. In embodiments an amplified product (e. ., an amplicon) can contain one or more additional and / or different nucleotides than the template sequence, or portion thereof, from which the amplicon was generated (e.g., a primer can contain “extra” nucleotides (such as a 5’ portion that does not hybridize to the template), or one or more mismatched bases within a hybridizing portion of the primer). Amplification according to the present disclosure encompasses any means by which at least a part of at least one target nucleic acid is reproduced,Atorney Docket No.: 051385-638001WO typically in a template-dependent manner, including without limitation, a broad range of techniques for amplifying nucleic acid sequences, either linearly or exponentially. Illustrative means for performing an amplifying step include ligase chain reaction (LCR), ligase detection reaction (LDR), ligation followed by Q-replicase amplification, PCR, primer extension, strand displacement amplification (SDA), hyperbranched strand displacement amplification, multiple displacement amplification (MDA), nucleic acid strand-based amplification (NASBA), two-step multiplexed amplifications, rolling circle amplification (RCA), and the like, including multiplex versions and combinations thereof, for example but not limited to, OLA (oligonucleotide ligation assay ) / PCR, PCR / OLA, LDR / PCR, PCR / PCR / LDR, PCR / LDR, LCR / PCR, PCR / LCR (also known as combined chain reaction — CCR), and the like.

[0075] In some embodiments, amplification includes at least one cycle of the sequential procedures of annealing at least one primer with complementary or substantially complementary sequences in at least one target nucleic acid; synthesizing at least one strand of nucleotides in a template-dependent manner using a polymerase; and optionally denaturing the newly -formed nucleic acid duplex to separate the strands. The cycle may or may not be repeated. Amplification can include thermocycling or can be performed isothermally.

[0076] As used herein, the term “rolling circle amplification (RCA)” refers to a nucleic acid amplification reaction that amplifies a circular nucleic acid template (e.g., single-stranded DNA circles) via a rolling circle process. Rolling circle amplification reaction is initiated by the hybridization of a primer to a circular, often single-stranded, nucleic acid template. The nucleic acid polymerase then extends the primer that is hybridized to the circular nucleic acid template by continuously progressing around the circular nucleic acid template to replicate the sequence of the nucleic acid template over and over again (rolling circle mechanism). The rolling circle amplification typically produces concatemers including tandem repeat units of the circular nucleic acid template sequence. The rolling circle amplification may be a linear RCA (LRCA), exhibiting linear amplification kinetics (e g., RCA using a single specific primer), or may be an exponential RCA (ERCA) exhibiting exponential amplification kinetics. Rolling circle amplification may also be performed using multiple primers (multiply primed rolling circle amplification or MPRCA) leading to hyper-branched concatemers. For example, in a doubleprimed RCA, one primer may be complementary, as in the linear RCA, to the circular nucleicAtorney Docket No.: 051385-638001WO acid template, whereas the other may be complementary to the tandem repeat unit nucleic acid sequences of the RCA product. Consequently, the double-primed RCA may proceed as a chain reaction with exponential (geometric) amplification kinetics featuring a ramifying cascade of multiple-hybridization, primer-extension, and strand-displacement events involving both the primers. This often generates a discrete set of concatemeric, double-stranded nucleic acid amplification products. The rolling circle amplification may be performed in-vitro under isothermal conditions using a suitable nucleic acid polymerase such as a polymerase described herein. RCA may be performed by using any of the DNA polymerases described herein.

[0077] A nucleic acid can be amplified by a thermocycling method or by an isothermal amplification method. In some embodiments a rolling circle amplification method is used. In some embodiments amplification takes place on a solid support (e. , within a flow cell) where a nucleic acid, nucleic acid library or portion thereof is immobilized. In certain sequencing methods, a nucleic acid library is added to a flow cell and immobilized by hybridization to anchors under suitable conditions. This type of nucleic acid amplification is often referred to as solid phase amplification. In some embodiments of solid phase amplification, all or a portion of the amplified products are synthesized by an extension initiating from an immobilized primer. Solid phase amplification reactions are analogous to standard solution phase amplifications except that at least one of the amplification oligonucleotides (e.g., primers) is immobilized on a solid support.

[0078] In some embodiments solid phase amplification includes a nucleic acid amplification reaction including only one species of oligonucleotide primer immobilized to a surface or substrate. In certain embodiments solid phase amplification includes a plurality of different immobilized oligonucleotide primer species. In some embodiments solid phase amplification may include a nucleic acid amplification reaction including one species of oligonucleotide primer immobilized on a solid surface and a second different oligonucleotide primer species in solution. Multiple different species of immobilized or solution-based primers can be used.

[0079] As used herein, the terms “cluster” and “colony” are used interchangeably to refer to a discrete site on a solid support that includes a plurality of immobilized polynucleotides and a plurality of immobilized complementary polynucleotides. The term “clustered array” refers to an array formed from such clusters or colonies. In this context the term “array” is not to beAtorney Docket No.: 051385-638001WO understood as requiring an ordered arrangement of clusters. The term “array” is used in accordance with its ordinary meaning in the art, and refers to a population of different molecules that are attached to one or more solid-phase substrates such that the different molecules can be differentiated from each other according to their relative location. An array can include different molecules that are each located at different addressable features on a solid-phase substrate. The molecules of the array can be nucleic acid primers, nucleic acid probes, nucleic acid templates or nucleic acid enzymes such as polymerases or ligases. Arrays useful in the invention can have densities that ranges from about 2 different features to many millions, billions or higher. The density of an array can be from 2 to as many as a billion or more different features per square cm. For example an array can have at least about 100 features / cm2, at least about 1,000 features / cm2, at least about 10,000 features / cm2, at least about 100,000 features / cm2, at least about 10,000,000 features / cm2, at least about 100,000,000 features / cm2, at least about 1,000,000,000 features / cm2, at least about 2,000,000,000 features / cm2or higher. In embodiments, the arrays have features at any of a variety of densities including, for example, at least about 10 features / cm2, 100 features / cm2, 500 features / cm2, 1,000 features / cm2, 5,000 features / cm2, 10,000 features / cm2, 50,000 features / cm2, 100,000 features / cm2, 1,000,000 features / cm2, 5,000,000 features / cm2, or higher.

[0080] Provided herein are methods and compositions for analyzing a sample (e.g., sequencing nucleic acids within a sample). A sample (e.g., a sample including nucleic acid) can be obtained from a suitable subject. A sample can be isolated or obtained directly from a subject or part thereof. In some embodiments, a sample is obtained indirectly from an individual or medical professional. A sample can be any specimen that is isolated or obtained from a subject or part thereof. A sample can be any specimen that is isolated or obtained from multiple subjects. Nonlimiting examples of specimens include fluid or tissue from a subject, including, without limitation, blood or a blood product e.g., serum, plasma, platelets, buffy coats, or the like), umbilical cord blood, chorionic villi, amniotic fluid, cerebrospinal fluid, spinal fluid, lavage fluid (e.g., lung, gastric, peritoneal, ductal, ear, arthroscopic), a biopsy sample, celocentesis sample, cells (blood cells, lymphocytes, placental cells, stem cells, bone marrow derived cells, embryo or fetal cells) or parts thereof (e.g., mitochondrial, nucleus, extracts, or the like), urine, feces, sputum, saliva, nasal mucous, prostate fluid, lavage, semen, lymphatic fluid, bile, tears, sweat, breast milk, breast fluid, the like or combinations thereof. A fluid or tissue sample from whichAtorney Docket No.: 051385-638001WO nucleic acid is extracted may be acellular (e.g., cell-free). Non-limiting examples of tissues include organ tissues (e.g., liver, kidney, lung, thymus, adrenals, skin, bladder, reproductive organs, intestine, colon, spleen, brain, the like or parts thereof), epithelial tissue, hair, hair follicles, ducts, canals, bone, eye, nose, mouth, throat, ear, nails, the like, parts thereof or combinations thereof. A sample may include cells or tissues that are normal, healthy, diseased (e.g., infected), and / or cancerous (e.g., cancer cells). A sample obtained from a subject may include cells or cellular material (e.g., nucleic acids) of multiple organisms (e.g., virus nucleic acid, fetal nucleic acid, bacterial nucleic acid, parasite nucleic acid).

[0081] In some embodiments, a sample includes one or more nucleic acids, or fragments thereof. A sample can include nucleic acids obtained from one or more subjects. In some embodiments a sample includes nucleic acid obtained from a single subject. In some embodiments, a sample includes a mixture of nucleic acids. A mixture of nucleic acids can include two or more nucleic acid species having different nucleotide sequences, different fragment lengths, different origins (e.g., genomic origins, cell or tissue origins, subject origins, the like or combinations thereof), or combinations thereof. Biological samples can include analytes (e.g., protein, RNA, and / or DNA) embedded in a 3D matrix. In some embodiments, amplicons (e.g., rolling circle amplification products) derived from or associated with analytes (e.g., protein, RNA, and / or DNA) can be embedded in a 3D matrix. In some embodiments, a 3D matrix may comprise a network of natural molecules and / or synthetic molecules that are chemically and / or enzymatically linked, e.g., by crosslinking. In some embodiments, a 3D matrix may comprise a synthetic polymer. In some embodiments, a 3D matrix comprises a hydrogel.

[0082] A subject can be any living or non-living organism, including but not limited to a human, non-human animal, plant, bacterium, fungus, virus or protist. A subject may be any age (e.g., an embryo, a fetus, infant, child, adult). A subject can be of any sex (e.g., male, female, or combination thereof). A subject may be pregnant. In some embodiments, a subject is a mammal. In some embodiments, a subject is a human subject. A subject can be a patient (e.g., a human patient). In some embodiments a subject is suspected of having a genetic variation or a disease or condition associated with a genetic variation.

[0083] The methods and kits of the present disclosure may be applied, mutatis mutandis, to the sequencing of RNA, or to determining the identity of a ribonucleotide.Atorney Docket No.: 051385-638001WO

[0084] As used herein, the term “kit” refers to any delivery system for delivering materials. In the context of reaction assays, such delivery systems include systems that allow for the storage, transport, or delivery of reaction reagents (e.g., oligonucleotides, enzymes, etc. in the appropriate containers) and / or supporting materials (e.g., buffers, written instructions for performing the assay, etc.) from one location to another. For example, kits include one or more enclosures (e.g., boxes) containing the relevant reaction reagents and / or supporting materials. As used herein, the term “fragmented kit” refers to a delivery system including two or more separate containers that each contain a subportion of the total kit components. The containers may be delivered to the intended recipient together or separately. For example, a first container may contain an enzyme for use in an assay, while a second container contains oligonucleotides. In contrast, a “combined kit” refers to a delivery system containing all of the components of a reaction assay in a single container (e.g., in a single box housing each of the desired components). The term “kit” includes both fragmented and combined kits.II. Compositions & Kits

[0085] Provided herein are compositions including mutant polypeptides (i.e., mutant polymerases) exhibiting increased incorporation of nucleotides and solubility and stability relative to a control (e.g., wildtype polymerase). Mutations in the polymerases described herein variously include one or more changes to amino acid residues present in the polypeptide sequence. Additions, substitutions, or deletions are all examples of mutations that are used to generate mutant polypeptides. Substitutions in some instances include the exchange of one amino acid for an alternative amino acid, and such alternative amino acids differ from the original amino acid with regard to size, shape, conformation, or chemical structure. Mutations in some instances are conservative or non-conservative. Conservative mutations comprise the substitution of an amino acid with an amino acid that possesses similar chemical properties. Additions often comprise the insertion of one or more amino acids at the N-terminal, C-terminal, or internal positions of the polypeptide. In some embodiments, additions include fusion polypeptides, wherein one or more additional polypeptides (i.e., a polypeptide from a different source) is connected (e.g., covalently linked to the N- or C- terminus) to the polymerase as described herein. Such additional polypeptides include domains with additional activity, or sequences with additional function (e.g., improve expression, aid purification, improve solubility, attach to a solid support, or other function).Atorney Docket No.: 051385-638001WO

[0086] Wild type polymerase sequences are typical initial sequences for protein or enzyme engineering to generate mutant polymerases. In some embodiments, a polypeptide differs from a wild-type sequence (naturally occurring) by at least one amino acid. Any number of mutations is introduced into a polypeptide or portion of a polypeptide described herein, such as 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 20, 50, or more than 50 mutations. In embodiments, the polymerase differs from a wild-type sequence by at least two amino acids. In embodiments, the polymerase differs from a wild-type sequence by at least three, four, five, or at least six amino acids.

[0087] In an aspect is provided a polymerase including one or more amino acid substitutions as described herein. In embodiments, the polymerase includes an amino acid sequence that is at least 50% identical to SEQ ID NO:1 (e.g., 50% identical, 60% identical, 70% identical, 80% identical, 90% identical, or greater); and includes an amino acid substitution, wherein the amino acid substitution is as described herein. In embodiments, the polymerase includes an amino acid sequence that is at least 50%, 60%, 70%, 80%, 85%, 90%, 95%, or at least 99% identical to SEQ ID NO: 1; and includes an amino acid substitution as described herein. In any of the embodiments herein, the isolated recombinant polymerase can be at least 85%, at least 90%, or at least 95% soluble. In embodiments, the polymerase includes an amino acid sequence that is at least 50%, 60%, 70%, 80%, 85%, 90%, 95%, or at least 99% identical to SEQ ID NO: 1, SEQ ID NO: 1, SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO: 10, SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO:20, SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO:25, SEQ ID NO 26, SEQ ID NO:27, SEQ ID NO:28, SEQ ID NO:29, SEQ ID NO:30, SEQ ID NO:31, SEQ ID NO:32, SEQ ID NO 33, SEQ ID NO:34, SEQ TD NO:35, SEQ ID NO:36, SEQ ID NO:37, SEQ ID NO:38, SEQ ID NO:39, SEQ ID NO:40, SEQ ID NO:41, SEQ ID NO:42, SEQ ID NO:43, SEQ ID NO:44, SEQ ID NO:45, SEQ ID NO:46, SEQ ID NO:47, SEQ ID NO:48, SEQ ID NO:49, SEQ ID NO:50, SEQ ID NO:51, or SEQ ID NO:52; and includes an amino acid substitution as described herein and / or one or more monovalent polyethylene glycol moieties covalently attached to the polymerase.

[0088] In an aspect is provided an enzyme (e.g., polymerase) including an amino acid sequence that is at least 80% identical to SEQ ID NO: 1. In embodiments, the polymeraseAtorney Docket No.: 051385-638001WO includes a first mutation at amino acid position corresponding to position 8. In embodiments, the first mutation is lysine, alanine, histidine, asparagine, glutamine, or arginine. In embodiments, the polymerase includes a first mutation at amino acid position corresponding to position 270, 309, 93, 178, 470, or 369.

[0089] In embodiments, the polymerase includes the amino acid substitution M8A. In embodiments, the polymerase includes the amino acid substitution M8H. In embodiments, the polymerase includes the amino acid substitution M8K. In embodiments, the polymerase includes the amino acid substitution M8N. In embodiments, the polymerase includes the amino acid substitution M8Q. In embodiments, the polymerase includes the amino acid substitution M8R.

[0090] In embodiments, the polymerase includes a mutation at amino acid position corresponding to position 270, 309, 93, 178, 470, or 369. In embodiments, the polymerase includes a mutation at the amino acid position corresponding to position 270. In embodiments, the polymerase includes a mutation at the amino acid position corresponding to position 309. In embodiments, the polymerase includes a mutation at the amino acid position corresponding to position 93. In embodiments, the polymerase includes a mutation at the amino acid position corresponding to position 178. In embodiments, the polymerase includes a mutation at the amino acid position corresponding to position 470. In embodiments, the polymerase includes a mutation at the amino acid position corresponding to position 369.

[0091] In embodiments, the mutation is a mutation at amino acid position corresponding to position 270, wherein the mutation is arginine, lysine, histidine, or glutamine. In embodiments, the mutation at amino acid position 270 is arginine, lysine, histidine, or glutamine. In embodiments, the mutation at amino acid position 270 is arginine. In embodiments, the mutation at amino acid position 270 is lysine. In embodiments, the mutation at amino acid position 270 is histidine. In embodiments, the mutation at amino acid position 270 is glutamine.

[0092] In embodiments, the mutation is a mutation at amino acid position corresponding to position 309, wherein the second mutation is aspartic acid, glutamic acid, asparagine, or glutamine. In embodiments, the mutation at amino acid position 309 is aspartic acid, glutamic acid, asparagine, or glutamine. In embodiments, the mutation at amino acid position 309 is aspartic acid. In embodiments, the mutation at amino acid position 309 is glutamic acid. InAtorney Docket No.: 051385-638001WO embodiments, the mutation at amino acid position 309 is asparagine. In embodiments, the mutation at amino acid position 309 is glutamine.

[0093] In embodiments, the mutation is a mutation at amino acid position corresponding to position 93, wherein the second mutation is aspartic acid, glutamic acid, asparagine, or glutamine. In embodiments, the mutation at amino acid position 93 is aspartic acid, glutamic acid, asparagine, or glutamine. In embodiments, the mutation at amino acid position 93 is aspartic acid. In embodiments, the mutation at amino acid position 93 is glutamic acid. In embodiments, the mutation at amino acid position 93 is asparagine. In embodiments, the mutation at amino acid position 93 is glutamine.

[0094] In embodiments, the mutation is a mutation at amino acid position corresponding to position 178, wherein the second mutation is arginine, lysine, histidine, or glutamine. In embodiments, the mutation at amino acid position 178 is arginine, lysine, histidine, or glutamine. In embodiments, the mutation at amino acid position 178 is arginine. In embodiments, the mutation at amino acid position 178 is lysine. In embodiments, the mutation at amino acid position 178 is histidine. In embodiments, the mutation at amino acid position 178 is glutamine.

[0095] In embodiments, the mutation is a mutation at amino acid position corresponding to position 470, wherein the second mutation is lysine, arginine, histidine, or glutamine. In embodiments, the mutation at amino acid position 470 is lysine, arginine, histidine, or glutamine. In embodiments, the mutation at amino acid position 470 is lysine. In embodiments, the mutation at amino acid position 470 is arginine. In embodiments, the mutation at amino acid position 470 is histidine. In embodiments, the mutation at amino acid position 470 is glutamine.

[0096] In embodiments, the mutation is a mutation at amino acid position corresponding to position 369, wherein the second mutation is lysine, arginine, histidine, or glutamine. In embodiments, the mutation at amino acid position 369 is lysine, arginine, histidine, or glutamine. In embodiments, the mutation at amino acid position 369 is lysine. In embodiments, the mutation at amino acid position 369 is arginine. In embodiments, the mutation at amino acid position 369 is histidine. In embodiments, the mutation at amino acid position 369 is glutamine.

[0097] In embodiments, the polymerase includes a glutamic acid, aspartic acid, alanine, glycine, or threonine at an amino acid position corresponding to position 536 of SEQ ID NO: 1.Attorney Docket No.: 051385-638001WO

[0098] In embodiments, the polymerase includes a polyethylene glycol moiety covalently attached to the polymerase. In embodiments, the polymerase includes a monovalent polyethylene glycol moiety covalently attached to the polymerase. In embodiments, the polyethylene glycol moiety is attached to the polymerase via a bioconjugate reactive linker. In embodiments, the bioconjugate reactive linker is formed by a reaction between an amine moiety and an N- Hydroxysuccinimide (NHS) ester moiety. In embodiments, the polyethylene glycol moiety has the formula:10, 12, 18, or 24. In embodiments, the polyethylene glycol moiety is attached to the polymerase via a bioconjugate reactive linker formed via the bioconjugation reaction depicted in Scheme 1.

[0099] Scheme 1. Bioconjugation of polyethylene glycol to an amine moiety, where n is as described herein and Rp is the remainder of the protein (e.g., the polymerase).

[0100] In embodiments, the polyethylene glycol moiety has the formula:10, 12, 18, or 24. In embodiments, the polyethylene glycol moiety is attached to the polymerase via a bioconjugate reactive linker formed via the bioconjugation reaction depicted in Scheme 2.Atorney Docket No.: 051385-638001WO

[0101] Scheme 2. Bioconjugation of maleimide-polyethylene glycol to an thiol moiety, where n is as described herein and Rp is the remainder of the protein (e.g., the polymerase).

[0102] In embodiments, two polyethylene glycol moieties are covalently attached to the polymerase. In embodiments, three polyethylene glycol moieties are covalently attached to the polymerase. In embodiments, four polyethylene glycol moieties are covalently attached to the polymerase. In embodiments, five polyethylene glycol moieties are covalently attached to the polymerase. In embodiments, a range of 2 to 5 polyethylene glycol moieties are covalently attached to the polymerase. In embodiments, a range of 2 to 6 polyethylene glycol moieties are covalently attached to the polymerase. In embodiments, a range of 2 to 7 polyethylene glycol moieties are covalently attached to the polymerase. In embodiments, a range of 2 to 8 polyethylene glycol moieties are covalently attached to the polymerase. In embodiments, a range of 2 to 9 polyethylene glycol moieties are covalently attached to the polymerase. In embodiments, a range of 2 to 10 polyethylene glycol moieties are covalently attached to the polymerase. In embodiments, a range of 2 to 11 polyethylene glycol moieties are covalently attached to the polymerase. In embodiments, a range of 2 to 12 polyethylene glycol moieties are covalently attached to the polymerase. In embodiments, a range of 2 to 13 polyethylene glycol moieties are covalently attached to the polymerase. In embodiments, a range of 2 to 14 polyethylene glycol moieties are covalently attached to the polymerase. In embodiments, a range of 2 to 15 polyethylene glycol moieties are covalently attached to the polymerase. In embodiments, a range of 2 to 16 polyethylene glycol moieties are covalently attached to the polymerase. In embodiments, a range of 2 to 17 polyethylene glycol moieties are covalently attached to the polymerase. In embodiments, a range of 2 to 18 polyethylene glycol moieties are covalently attached to the polymerase. In embodiments, a range of 2 to 19 polyethylene glycol moieties are covalently attached to the polymerase. In embodiments, a range of 2 to 20 polyethylene glycol moieties are covalently attached to the polymerase.

[0103] In embodiments, the polymerase includes M8K and V270R. In embodiments, the polymerase includes M8K and F309D. In embodiments, the polymerase includes M8K and I93E. In embodiments, the polymerase includes M8K and I93E and V270R. In embodiments, the polymerase includes M8K and L178R and F309K. In embodiments, the polymerase includesAtorney Docket No.: 051385-638001WOM8K and L178R and V470K. In embodiments, the polymerase includes M8K and L178R and Y369K. In embodiments, the polymerase includes M8K and I93E and V270R and F309D.

[0104] In another aspect is provided a polymerase including an amino acid sequence that is at least 80% identical to SEQ ID NO: 1 and including one or more monovalent polyethylene glycol moieties covalently attached to the polymerase. In embodiments, the polymerase includes one or more mutations as described herein (e.g., M8K). In embodiments, the polymerase includes an amino acid sequence that is at least 50%, 60%, 70%, 80%, 85%, 90%, 95%, or at least 99% identical to SEQ ID NO: 1; and includes an amino acid substitution as described herein.

[0105] In embodiments, mutations may include substitution of the amino acid in the parent amino acid sequences with an amino acid, which is not the parent amino acid. In embodiments, the mutations may result in conservative amino acid changes. In embodiments, non-polar amino acids may be converted into polar amino acids (threonine, asparagine, glutamine, cysteine, tyrosine, aspartic acid, glutamic acid or histidine) or the parent amino acid may be changed to an alanine.

[0106] In embodiments, the polymerase is a recombinant DNA polymerase. In embodiments, the recombinant DNA polymerase is homologous to a 029 DNA polymerase or mutant thereof, a Taq polymerase, an exonuclease deficient Taq polymerase, a DNA Pol I polymerase, a T7 polymerase, an RB69 polymerase, a T5 polymerase, or a polymerase corresponding to a Klenow fragment of a DNA Pol I polymerase. For example, the recombinant DNA polymerase can be homologous to a wild-type or exonuclease deficient 029 DNA polymerase, e.g., as described in U.S. Pat. Nos. 5,001,050, 5,198,543, or 5,576,204. Similarly, the recombinant DNA polymerase can be homologous to 029, B103, GA-1, PZA, 015, BS32, M2Y, Nf, Gl, Cp-1, PRD1, PZE, SFS, Cp-5, Cp-7, PR4, PR5, PR722, or L17, or the like. For nomenclature, see also, Meijer et al. (2001) “029 Family of Phages” Microbiology and Molecular Biology Reviews, 65(2):261-287, which is incorporated herein by reference for all purposes.

[0107] In an aspect is provided a composition including an enzyme as described herein and a container. In embodiments, the container is a tube, such as a microcentrifuge tube, configured to securely hold and store the enzyme composition for laboratory or diagnostic use. In embodiments, the container is a vial, designed with a snap-cap or crimp seal to provide anAtorney Docket No.: 051385-638001WO airtight environment and prevent contamination. In embodiments, the container is a screw-cap container, equipped with a threaded closure to ensure a secure seal and maintain the stability of the enzyme composition. In embodiments, the container is a cryovial, suitable for low- temperature storage of the enzyme to preserve its activity over extended periods. In embodiments, the container is a dropper bottle, allowing precise dispensing of the enzyme composition for applications requiring controlled volumes. In embodiments, the container is a single-use ampoule, pre-fdled and sealed to ensure sterility and convenience for immediate use. In embodiments, the container is a multi-compartment container, providing separate chambers for the enzyme and other reagents, enabling mixing immediately before use. In embodiments, the container is a syringe-style container, designed for easy and precise delivery of the enzyme composition in liquid form. In embodiments, the container is a pipette-compatible reservoir, facilitating efficient and accurate pipetting of the enzyme solution during experimental workflows. In embodiments, the container is a pressurized container, suitable for dispensing aerosolized forms of the enzyme composition for specialized applications.

[0108] In embodiments, the composition includes a plurality of native DNA nucleotides including a plurality of dATP (2’-deoxyadenosine-5’-triphosphate) nucleotides, dCTP (2’- deoxycytidine-5’ -triphosphate) nucleotides, dTTP (2’-deoxythymidine-5’-triphosphate) nucleotides, and dGTP (2’-deoxyguanosine-5’-triphosphate) nucleotides. In embodiments, the composition includes a plurality of dATP (2’ -deoxyadenosine-5’ -triphosphate) nucleotides, dCTP (2’-deoxycytidine-5’-triphosphate) nucleotides, dTTP (2’-deoxythymidine-5’- triphosphate) nucleotides, and dGTP (2 ’-deoxyguanosine-5’ -triphosphate) nucleotides. In embodiments, the composition includes a plurality of native DNA nucleotides including a plurality of dATP nucleotides, dCTP nucleotides, dTTP nucleotides, or dGTP nucleotides. In embodiments, the composition includes a plurality of dATP nucleotides, dCTP nucleotides, dTTP nucleotides, or dGTP nucleotides. In embodiments, the composition includes a plurality of dATP nucleotides. In embodiments, the composition includes a plurality of dCTP nucleotides. In embodiments, the composition includes a plurality of dTTP nucleotides. In embodiments, the composition includes a plurality of dGTP nucleotides. In embodiments, the composition includes a plurality of dUTP (2’ -deoxy cytidine-5’ -triphosphate) nucleotides. In embodiments, the composition consists of a plurality of dA nucleotides, a plurality of dC nucleotides, a plurality of dT nucleotides, and a plurality of dG nucleotides. In embodiments, the composition consists of aAtorney Docket No.: 051385-638001WO plurality of dA nucleotides, a plurality of dC nucleotides, a plurality of dT nucleotides, a plurality of dU nucleotides, and a plurality of dG nucleotides.

[0109] In embodiments, the composition includes a plurality of native RNA nucleotides (i.e., native ribonucleotides) including a plurality of ATP (adenosine-5’ -triphosphate) nucleotides, CTP (cytidine-5’ -triphosphate) nucleotides, UTP (uridine-5’ -triphosphate) nucleotides, and GTP (guanosine-5 ’-triphosphate) nucleotides. In embodiments, the composition includes a plurality of native RNA nucleotides including a plurality of ATP nucleotides, CTP nucleotides, UTP nucleotides, or GTP nucleotides. In embodiments, the composition includes a plurality of ATP nucleotides. In embodiments, the composition includes a plurality of CTP nucleotides. In embodiments, the composition includes a plurality of UTP nucleotides. In embodiments, the composition includes a plurality of GTP nucleotides. In embodiments, the composition consists of a plurality of A ribonucleotides, a plurality of C ribonucleotides, a plurality of U ribonucleotides, and a plurality of G ribonucleotides.

[0110] In embodiments, the composition includes a plurality of cleavable site nucleotides. The term “cleavable site nucleotide” refers to a nucleotide that allows for controlled cleavage of the polynucleotide strand following contact with a cleaving agent (e.g., uracil DNA glycosylase (UDG)). Additional examples of cleavable site nucleotides include deoxyuracil triphosphates (dUTPs), deoxy-8-oxo-guanine triphosphates (d-8-oxoGs), methylated nucleotides, or ribonucleotides. In embodiments, the cleavable site nucleotide is dUTP and the cleaving agent is UDG. In embodiments, the cleavable site nucleotide is a ribonucleotide and the cleaving agent is RNase. In embodiments, the cleavable site nucleotide is 8-oxo-7,8-dihydroguanine (8oxoG) and the cleaving agent is formamidopyrimidine DNA glycosylase (Fpg). In embodiments, the cleavable site nucleotide is 5 -methyl cytosine and the cleaving agent is McrBC.[OlH] In embodiments, the cleavable site includes one or more deoxyuracil triphosphates (dUTPs), deoxy-8-oxo-guanine triphosphates (d-8-oxoGs), methylated nucleotides, or ribonucleotides. In embodiments, the cleavable site includes one or more deoxyuracil triphosphates (dUTPs). In embodiments, the cleavable site includes one or more deoxy-8-oxo- guanine triphosphates (d-8-oxoGs). In embodiments, the cleavable site includes one or more methylated nucleotides. In embodiments, the cleavable site includes one or more ribonucleotides. The one or more cleavable sites may include a modified nucleotide, ribonucleotide, or aAtorney Docket No.: 051385-638001WO sequence containing a modified or unmodified nucleotide that is specifically recognized by a cleavage agent. The cleavable site(s) may be deoxyuracil triphosphate (dUTP), deoxy-8-Oxo- guanine triphosphate (d-8-oxoG), or other modified nucleotide(s), such as those described, for example, in US 2012 / 0238738, which is incorporated herein by reference for all purposes, and include modified ribonucleotides and deoxyribonucleotides including abasic sugar phosphates, inosine, deoxyinosine, 2,6-diamino-4-hydroxy-5-formamidopyrimidine (foramidopyrimidine- guanine, (fapy)-guanine), 8-oxoadenine, l,N6-ethenoadenine, 3-methyladenine, 4,6-diamino-5- formamidopyrimidine, 5,6-dihydrothymine, 5,6-dihydroxyuracil, 5-formyluracil, 5-hydroxy-5- methylhydanton, 5-hydroxycytosine, 5-hydroxymethylcystosine, 5-hydroxymethyluracil, 5- hydroxyuracil, 6-hydroxy-5,6-dihydrothymine, 6-methyladenine, 7,8-dihydro-8-oxoguanine (8- oxoguanine), 7-methylguanine, aflatoxin Bl -fapy-guanine, fapy-adenine, hypoxanthine, methyl- fapy-guanine, methyltartonylurea and thymine glycol. In embodiments, the cleavable site includes an abasic site, deoxyuracil triphosphate (dUTP), deoxy-8-Oxo-guanine triphosphate (d- 8-oxoG), methylated nucleotide, ribonucleotide, or a sequence containing a modified or unmodified nucleotide that is specifically recognized by a cleaving agent. In embodiments, the cleavable site includes one or more ribonucleotides. In embodiments, the cleavable site includes 2 to 5 ribonucleotides. In embodiments, the cleavable site includes one ribonucleotide. In embodiments, the cleavable sites can be cleaved at or near a modified nucleotide or bond by enzymes or chemical reagents, collectively referred to here and in the claims as “cleaving agents.” Examples of cleaving agents include DNA repair enzymes, glycosylases, DNA cleaving endonucleases, or ribonucleases. For example, cleavage at dUTP may be achieved using uracil DNA glycosylase and endonuclease VIII (USER™, NEB, Ipswich, Mass.), as described in U.S. Pat. No. 7,435,572. In embodiments, when the modified nucleotide is a ribonucleotide, the cleavable site can be cleaved with an endoribonuclease. In embodiments, cleaving an extension product includes contacting the cleavable site with a cleaving agent, wherein the cleaving agent includes a reducing agent, sodium periodate, RNase, formamidopyrimidine DNA glycosylase (Fpg), endonuclease, restriction enzyme, or uracil DNA glycosylase (UDG). In embodiments, the cleaving agent is an endonuclease enzyme such as nuclease Pl, AP endonuclease, T7 endonuclease, T4 endonuclease IV, Bal 31 endonuclease, Endonuclease I (endo I), Micrococcal nuclease, Endonuclease II (endo VI, exo III), nuclease BAL-31 or mung bean nuclease. In embodiments, the cleaving agent includes a restriction endonuclease, including, for example aAtorney Docket No.: 051385-638001WO type IIS restriction endonuclease. In embodiments, the cleaving agent is an exonuclease (e.g., RecBCD), restriction nuclease, endoribonuclease, exoribonuclease, or RNase (e.g., RNAse I, II, or III). In embodiments, the cleaving agent is a restriction enzyme. In embodiments, the cleaving agent includes a glycosylase and one or more suitable endonucleases. In embodiments, cleavage is performed under alkaline (e.g., pH greater than 8) buffer conditions at between 40°C to 80°C.

[0112] In an aspect is provided a kit. In embodiments, the kit includes a polymerase as described herein. In embodiments, the kit includes the reagents and containers useful for performing the methods as described herein. Generally, the kit includes one or more containers providing a composition and one or more additional reagents (e.g., a buffer suitable for polynucleotide extension). The kit may also include a template nucleic acid (DNA and / or RNA), one or more primer polynucleotides, nucleoside triphosphates (including, for example, deoxyribonucleotides, ribonucleotides, and / or modified nucleotides), buffers, salts, and / or labels (e.g., fluorophores).

[0113] In embodiments, the kit includes a solid support (i.e., a substrate), and reagents for sample preparation and purification, amplification, and / or sequencing (e.g., one or more sequencing reaction mixtures). In embodiments, amplification reagents and other reagents may be provided in lyophilized form. In embodiments, amplification reagents and other reagents may be provided in a container which the lyophilized reagent may be reconstituted.

[0114] In embodiments, the kit includes components useful for circularizing template polynucleotides using a ligation enzyme (e.g., CircLigase™ enzyme, Taq DNA Ligase, HiFi Taq DNA Ligase, T4 ligase, SplintR® ligase, or Ampligase® DNA Ligase). For example, such a kit further includes the following components: (a) reaction buffer for controlling pH and providing an optimized salt composition for a ligation enzyme (e.g., CircLigase™enzyme, Taq DNA Ligase, HiFi Taq DNA Ligase, T4 ligase, SplintR® ligase, or Ampligase® DNA Ligase), and (b) ligation enzyme cofactors. In embodiments, the kit further includes instructions for use thereof. CircLigase™ and Ampligase® are trademarks of Epicentre. SplintR® is a registered trademark of NEB.

[0115] In embodiments, kits described herein include a polymerase. In embodiments, the polymerase is a DNA polymerase. In embodiments, the kit includes a strand-displacing polymerase. In embodiments, the polymerase is a DNA polymerase. In embodiments, the DNAAtorney Docket No.: 051385-638001WO polymerase is a thermophilic nucleic acid polymerase. In embodiments, the DNA polymerase is a modified archaeal DNA polymerase. In embodiments, the kit includes a strand-displacing polymerase, such as a polymerase as described herein.

[0116] In embodiments, the kit includes a sequencing solution, hybridization solution, and / or extension solution. In embodiments, the sequencing solution include labeled nucleotides including differently labeled nucleotides, wherein the label (or lack thereof) identifies the type of nucleotide. For example, each adenine nucleotide, or analog thereof; a thymine nucleotide; a cytosine nucleotide, or analog thereof; and a guanine nucleotide, or analog thereof may be labeled with a different fluorescent label.

[0117] In embodiments, the composition and / or kit includes a buffered solution. Typically, the buffered solutions contemplated herein are made from a weak acid and its conjugate base or a weak base and its conjugate acid. For example, sodium acetate and acetic acid are buffer agents that can be used to form an acetate buffer. Other examples of buffer agents that can be used to make buffered solutions include, but are not limited to, Tris, bicine, tricine, HEPES, TES, MOPS, MOPSO and PIPES. Additionally, other buffer agents that can be used in enzyme reactions, hybridization reactions, and detection reactions are known in the art. In embodiments, the buffered solution can include Tris. With respect to the embodiments described herein, the pH of the buffered solution can be modulated to permit any of the described reactions. In some embodiments, the buffered solution can have a pH greater than pH 7.0, greater than pH 7.5, greater than pH 8.0, greater than pH 8.5, greater than pH 9.0, greater than pH 9.5, greater than pH 10, greater than pH 10.5, greater than pH 11.0, or greater than pH 11.5. In other embodiments, the buffered solution can have a pH ranging, for example, from about pH 6 to about pH 9, from about pH 8 to about pH 10, or from about pH 7 to about pH 9. In embodiments, the buffered solution can include one or more divalent cations. Examples of divalent cations can include, but are not limited to, Mg2+, Mn2+, Zn2+, and Ca2+. In embodiments, the buffered solution can contain one or more divalent cations at a concentration sufficient to permit hybridization of a nucleic acid. In embodiments, the buffered solution can contain one or more divalent cations at a concentration sufficient to permit hybridization of a nucleic acid. In some embodiments, a concentration can be more than about 1 pM, more than about 2 pM, more than about 5 pM, more than about 10 pM, more than about 25 pM, more than about 50 pM, more thanAtorney Docket No.: 051385-638001WO about 75 yM. more than about 100 yM, more than about 200 yM, more than about 300 yM, more than about 400 yM, more than about 500 yM, more than about 750 yM, more than about 1 mM, more than about 2 mM, more than about 5 mM, more than about 10 mM, more than about 20 mM, more than about 30 mM, more than about 40 mM, more than about 50 mM, more than about 60 mM, more than about 70 mM, more than about 80 mM, more than about 90 mM, more than about 100 mM, more than about 150 mM, more than about 200 mM, more than about 250 mM, more than about 300 mM, more than about 350 mM, more than about 400 mM, more than about 450 mM, more than about 500 mM, more than about 550 mM, more than about 600 mM, more than about 650 mM, more than about 700 mM, more than about 750 mM, more than about 800 mM, more than about 850 mM, more than about 900 mM, more than about 950 mM or more than about 1 M. In embodiments, the buffered solution includes about 10 mM Tris, about 20 mM Tris, about 30 mM Tris, about 40 mM Tris, or about 50 mM Tris. In embodiments the buffered solution includes about 50 mM NaCl, about 75 mM NaCl, about 100 mM NaCl, about 125 mM NaCl, about 150 mM NaCl, about 200 mM NaCl, about 300 mM NaCl, about 400 mM NaCl, or about 500 mM NaCl. In embodiments, the buffered solution includes about 0.05 mM EDTA, about 0.1 mM EDTA, about 0.25 mM EDTA, about 0.5 mM EDTA, about 1.0 mM EDTA, about 1.5 mM EDTA or about 2.0 mM EDTA. In embodiments, the buffered solution includes about 0.01% Triton™ X-100, about 0.025% Triton™ X-100, about 0.05% Triton™ X-100, about 0.1% Triton™ X-100, or about 0.5% Triton™ X-100. In embodiments, the buffered solution includes 20 mM Tris pH 8.0, 100 mM NaCl, 0.1 mM EDTA, 0.025% Triton™ X-100. In embodiments, the buffered solution includes 20 mM Tris pH 8.0, 150 mM NaCl, 0.1 mM EDTA, 0.025% Triton™ X-100. In embodiments, the buffered solution includes 20 mM Tris pH 8.0, 300 mM NaCl, 0.1 mM EDTA, 0.025% Triton™ X-100. In embodiments, the buffered solution includes 20 mM Tris pH 8.0, 400 mM NaCl, 0.1 mM EDTA, 0.025% Triton™ X-100. In embodiments, the buffered solution includes 20 mM Tris pH 8.0, 500 mM NaCl, 0.1 mM EDTA, 0.025% Triton™ X-100. Triton™ is a registered trademark of Dow Chemical Company.

[0118] In embodiments, the composition has a pH of about 8.5. In some cases, the composition includes between about 5 mM and about 10 mM MgCh. In embodiments, the composition includes between about 0.1 mM and about 0.2 mM dNTPs. In some instances, the composition includes between about 1 mM and about 4 mM DTT. In embodiments, the composition includes between 1 mg / mL and 3 mg / mL BSA (e.g., about 2 mg / mL BSA). In embodiments, theAtorney Docket No.: 051385-638001WO composition includes between 5 mM and 15 mM NH4SO4 (e.g., about 10 mM NH4SO4). In some embodiments the composition includes between about 0.05 pM and about 1 pM of the polymerase. In embodiments, the composition includes between about 0.5 pM and about 1 pM, between about 0.2 pM and about 1 pM, between about 0.8 pM and about 1 pM, or between about 0.05 UM and about 0.5 pM of the polymerase. In embodiments, the composition includes at least 1, 10, 50, or 100 UM of the polymerase. In embodiments, the composition includes between 10 and 50, between 10 and 100, between 50 and 100, between 25 and 50, or between 25 and 100 pM of the polymerase. In embodiments herein, the polymerase may be at least 85%, at least 90%, or at least 95% soluble in the composition. In embodiments herein, the polymerase may be at least 85%, at least 90%, or at least 95% soluble in the composition at 20° C., 30° C., 37° C., 40° C., or 42° C. In embodiments, the composition has a pH of about 8.5 and includes 10 mM MgCL, 10 mM NH4SO4, 4 mM DTT, and 0.2 mg / mL BSA. In embodiments, the composition includes 0.2 mM dNTPs. In embodiments, the composition has a pH of about 8.5 and includes 10 mM MgCL, 10 mM NH4SO4, 1 mM DTT, and 0.2 mg / mL BSA. In embodiments, the composition further includes 0.1 mM dNTPs. In embodiments, the composition has a pH of about 8.5 and includes 5 mM MgC12, 10 mM NH4SO4, 1 mM DTT, and 0.2 mg / mL BSA. In embodiments, the composition further includes 0.2 mM dNTPs.

[0119] In embodiments, the kit includes, without limitation, nucleic acid primers, probes, adapters, enzymes (e.g., an enzyme as described herein) and are each packaged in a container, such as, without limitation, a vial, tube or bottle, in a package suitable for commercial distribution, such as, without limitation, a box, a sealed pouch, a blister pack and a carton. The package typically contains a label or packaging insert indicating the uses of the packaged materials. As used herein, “packaging materials” includes any article used in the packaging for distribution of reagents in a kit, including without limitation containers, vials, tubes, bottles, pouches, blister packaging, labels, tags, instruction sheets and package inserts.

[0120] In addition to the above components, the subject kits may further include instructions for practicing the subject methods. These instructions may be present in the subject kits in a variety of forms, one or more of which may be present in the kit. One form in which these instructions may be present is as printed information on a suitable medium or substrate, e.g., a piece or pieces of paper on which the information is printed, in the packaging of the kit, in aAtorney Docket No.: 051385-638001WO package insert, etc. Yet another means would be a computer readable medium, e g., diskette, CD, digital storage medium, etc., on which the information has been recorded. Yet another means that may be present is a website address which may be used via the Internet to access the information at a removed site. Any convenient means may be present in the kits.

[0121] In another aspect is provided a polynucleotide sequence encoding any of the polymerases described herein. In embodiments, the polynucleotide sequences encoding the polymerase or a portion thereof is codon-optimized for a host cell or a recombinant expression system to generate the polymerase. Codon optimization typically involves selecting codons based on the abundance of corresponding transfer RNAs in the host cell to prevent overloading or limiting resources. Since most amino acids are encoded by multiple codons, optimization aligns codon usage with the host cell’s preferences to improve protein expression and reduce potential immunogenicity. Codon selection aims to maintain the amino acid sequence, often through silent mutations that preserve the encoded protein. In most cases, modifying the third nucleotide of a codon does not alter the resulting amino acid, enabling such optimizations without impacting the polymerase’s structure or function. In embodiments, the recombinant nucleic acid molecule further includes a transcription regulatory sequence operatively linked with the polynucleotide. In embodiments, the transcription regulatory sequence includes a promoter selected from among a bacterial, viral, and mammalian promoter. In embodiments, the promoter is a non-viral promoter or a viral promoter, such as a cytomegalovirus (CMV) promoter, an SV40 promoter, an RSV promoter, and a promoter found in the long-terminal repeat of the murine stem cell virus.TTI. Methods

[0122] In an aspect is provided a method of incorporating a nucleotide into a primed nucleic acid template (e.g., a primer hybridized to a template nucleic acid). In embodiments, the method includes combining in a reaction vessel: (i) a primer hybridized to a nucleic acid template, (ii) a nucleotide solution including a plurality of nucleotides, and (iii) a polymerase, wherein the polymerase is a polymerase as described herein.

[0123] In embodiments, the template polynucleotide includes genomic DNA, complementary DNA (cDNA), cell-free DNA (cfDNA), messenger RNA (mRNA), transfer RNA (tRNA), ribosomal RNA (rRNA), cell-free RNA (cfRNA), or noncoding RNA (ncRNA). InAtorney Docket No.: 051385-638001WO embodiments, the template polynucleotide includes double-stranded DNA. Tn embodiments, the method of forming the template polynucleotide includes ligating a hairpin adapter to an end of a linear polynucleotide. In embodiments, the method of forming the template polynucleotide includes ligating hairpin adapters to both ends of the linear polynucleotide. In embodiments, the method of forming the template polynucleotide includes ligating a Y-shaped adapter to an end of a linear polynucleotide. In embodiments, the method of forming the template polynucleotide includes ligating a Y-shaped adapter to both ends of a linear polynucleotide. In embodiments, the template polynucleotide is a circular polynucleotide. In embodiments, the circular polynucleotide includes a copy of a gene sequence. In embodiments, the circular polynucleotide includes a copy of an RNA sequence.

[0124] In embodiments, the template polynucleotide is about 100 to 1000 nucleotides in length. In embodiments, the template polynucleotide is about 350 nucleotides in length. In embodiments, the template polynucleotide is about 10, 20, 50, 100, 150, 200, 300, or 500 nucleotides in length. The template polynucleotide molecules can vary length, such as about 100- 300 nucleotides long, about 300-500 nucleotides long, or about 500-1000 nucleotides long. In embodiments, the template polynucleotide molecular is about 100-1000 nucleotides, about 150- 950 nucleotides, about 200-900 nucleotides, about 250-850 nucleotides, about 300-800 nucleotides, about 350-750 nucleotides, about 400-700 nucleotides, or about 450-650 nucleotides. In embodiments, the template polynucleotide molecule is about 150 nucleotides. In embodiments, the template polynucleotide is about 100-1000 nucleotides long. In embodiments, the template polynucleotide is about 100-300 nucleotides long. In embodiments, the template polynucleotide is about 300-500 nucleotides long. In embodiments, the template polynucleotide is about 500-1000 nucleotides long. In embodiments, the template polynucleotide molecule is about 100 nucleotides. In embodiments, the template polynucleotide molecule is about 300 nucleotides. In embodiments, the template polynucleotide molecule is about 500 nucleotides. In embodiments, the template polynucleotide molecule is about 1000 nucleotides.

[0125] In embodiments the template polynucleotide (e.g., genomic template DNA) is first treated to form single-stranded linear fragments (e.g., ranging in length from about 50 to about 600 nucleotides). Treatment typically entails fragmentation, such as by chemical fragmentation, enzymatic fragmentation, or mechanical fragmentation, followed by denaturation to produceAtorney Docket No.: 051385-638001WO single-stranded DNA fragments. In embodiments, the template polynucleotide includes an adapter. The adaptor may have other functional elements including tagging sequences (i.e., a barcode), attachment sequences, palindromic sequences, restriction sites, sequencing primer binding sites, functionalization sequences, and the like. Barcodes can be of any of a variety of lengths. In embodiments, the primer includes a barcode that is 10-50, 20-30, or 4-12 nucleotides in length. In embodiments, the adapter includes a primer binding sequence that is complementary to at least a portion of a primer (e.g., a sequencing primer). Primer binding sites can be of any suitable length. In embodiments, a primer binding site is about or at least about 10, 15, 20, 25, 30, or more nucleotides in length. In embodiments, a primer binding site is 10-50, 15-30, or 20- 25 nucleotides in length.

[0126] In embodiments, the template polynucleotide is single-stranded DNA, double-stranded DNA, single-stranded RNA, or double-stranded RNA. In embodiments, the template is singlestranded DNA or single-stranded RNA and is about 10, 20, 50, 100, 150, 200, 300, 500, or 1000 nucleotides in length. In embodiments, the template polynucleotide is double-stranded DNA or double-stranded RNA and is about 10, 20, 50, 100, 150, 200, 300, 500, or 1000 base pairs in length. In embodiments, the template polynucleotide includes single-stranded circular DNA. In embodiments, the template polynucleotide is single-stranded circular DNA. In embodiments, the template polynucleotide includes double-stranded DNA. In embodiments, the template polynucleotide is double-stranded DNA. In embodiments, the template polynucleotide includes single-stranded RNA. In embodiments, the template polynucleotide is single-stranded RNA. In embodiments, the template polynucleotide includes double-stranded RNA. In embodiments, the template polynucleotide is double-stranded RNA. In embodiments, the template polynucleotide includes primer binding sequences that are complementary to one or more substrate-bound primers. In embodiments, the substrate-bound primers are immobilized to a substrate by a covalent linker. In embodiments, the substrate-bound primers are immobilized to a solid support at the 5' end, preferably via a covalent attachment. In embodiments, the template polynucleotide includes primer binding sequences that are complementary to one or more immobilized primers. In embodiments, the immobilized primers are immobilized to a matrix (e.g., a matrix in a cell) by a covalent linker. In embodiments, the immobilized primers are attached to a matrix at the 5' end, preferably via a covalent attachment. In embodiments, at least some of the substrate-bound primers are phosphorothioated primers. In embodiments, a fraction of the total of the substrate-Atorney Docket No.: 051385-638001WO bound primers are phosphorothioated primers. In embodiments, at least some of the immobilized primers are phosphorothioated primers. In embodiments, a fraction of the total of the immobilized primers are phosphorothioated primers.

[0127] In another aspect is provided a method of amplifying a nucleic acid sequence, the method including hybridizing a nucleic acid template to a primer to form a primer-template hybridization complex; contacting the primer-template hybridization complex with a DNA polymerase and a plurality of nucleotides, wherein the DNA polymerase is the polymerase is as described herein; and subjecting the primer-template hybridization complex to conditions which enable the polymerase to incorporate one or more nucleotides into the primer-template hybridization complex to generate amplification products, thereby amplifying a nucleic acid sequence.

[0128] In embodiments, the nucleic acid template is DNA, RNA, or analogs thereof. In embodiments, the nucleic acid template includes a primer hybridized to the template. In embodiments, the nucleic acid template is a primer. Primers are usually single-stranded for maximum efficiency in amplification, but may alternatively be double- stranded. If doublestranded, the primer is usually first treated to separate its strands before being used to prepare extension products. This denaturation step is typically affected by heat, but may alternatively be carried out using alkaline conditions, followed by neutralization. Thus, a “primer” is complementary to a nucleic acid template, and complexes by hydrogen bonding or hybridization with the template to give a primer / template complex for initiation of synthesis by a polymerase, which is extended by the addition of covalently bonded bases linked at their 3’ end complementary to the template in the process of DNA synthesis. The DNA template for an amplification reaction will typically include a double-stranded region having a free 3’ hydroxyl group which serves as a primer or initiation point for the addition of further nucleotides in the reaction. In embodiments, the primer is hybridized to a polynucleotide in suitable hybridization conditions (e g., saline-sodium citrate (SSC) buffer (pH 7.0), which is commonly used in nucleic acid hybridization techniques at concentrations from 0.1X to 20X). For example, hybridization may occur in the presence of an hybridization solution as described herein. For example, the hybridization solution may include 40% (v / v) formamide, 5* SSC, 5* Denhardt's solution, 0.1% (w / v) SDS, and dextran sulfate. In embodiments, the hybridization solution includes a bufferedAtorney Docket No.: 051385-638001WO solution including salts (e.g., NaCl or KC1), a surfactant (e.g., Triton™ X-100 or Tween®-20), and, optionally, a chelator. In embodiments, the hybridization solution has a pH of about 7.5, 8.0, 8.2, 8.4, 8.6, 8.8, or 9.0. In embodiments, the hybridization solution includes NaCl or KC1, Tris (e.g., pH 8.0), Triton™ X-100, and a chelator (e.g., EDTA). In embodiments, the hybridization solution includes NaCl, Tris (e.g., pH 8.5), Triton™ X-100, and a chelator (e.g., EDTA). In embodiments, the hybridization solution includes NaCl, Tris (e.g., pH 8.8), Triton™ X-100, and a chelator (e.g., EDTA). In embodiments, the hybridization solution includes NaCl, Tris (e.g., pH 8.5), Tween®-20, and a chelator (e.g., EDTA). In embodiments, the hybridization solution includes NaCl, Tris (e.g., pH 8.8), Tween®-20, and a chelator (e.g., EDTA). In embodiments, the hybridization solution includes 3 M NaCl, 0.1 M Tris-HCl (pH 6.8), 0.1 M NaPCE buffer (pH 6.8), and 50 mM EDTA. In embodiments, the hybridization solution includes formamide. In embodiments, the hybridization solution includes dextran sulfate. In embodiments, the hybridization solution includes 140 mM HEPES, pH 8,0, containing 1% SDS, 1.7 M NaCl, 7 x Denhardt’s solution, 0.2 mM EDTA, and 3% PEG. In embodiments, the hybridization solution includes acetonitrile at 25-50% by volume, formamide at 5-10% by volume; 2-(N- morpholino)ethanesulfonic acid (MES); and polyethylene glycol (PEG) at 5-35%. In some embodiments, the hybridization solution further includes betaine.

[0129] In embodiments, extending is performed in the presence of an extension solution. In embodiments, the extension solution includes a buffered solution including salts (e.g., NaCl or KC1), a surfactant (e.g., Triton™ X-100 or Tween®-20), and a chelator. In embodiments, the extension solution includes nucleotides and a polymerase (e.g., a polymerase as described herein). In embodiments, the polymerase is a strand-displacing polymerase as described herein. In embodiments, the extension solution includes about 0.5, about 1, about 2, about 3, about 4, about 5, about 6, about 7, about 8, about 9, about 10, about 11, about 12, about 13, about 14, or about 15 mM Mg2+. In embodiments, the extension solution includes a dNTP mixture including dATP, dCTP, dGTP and dTTP (for DNA amplification) or dATP, dCTP, dGTP and dUTP (for RNA amplification). In embodiments, the extension solution has a pH of about 7.5, 8.0, 8.2, 8.4, 8.6, 8.8, or 9.0. In embodiments, the extension solution includes Tris-HCl (e.g., pH 8.0), salt (e.g, NaCl or KC1), MgSCU, a surfactant (e.g., Tween®-20 or Triton™ X-100), dNTPs, BstLF, betaine (e.g., between about 0 to about 3.5M betaine), and / or DMSO (e g., between about 0% to about 12% DMSO). In embodiments, the extension solution includes bicine (e.g., pH 8.5), saltAtorney Docket No.: 051385-638001WO(e.g., NaCl or KCl), MgSO4, a surfactant (e.g., Tween-20 or Triton X-100), dNTPs, BstLF, (e.g., between about 0 to about 3.5M betaine), and / or DMSO (e.g., between about 0% to about 12% DMSO).

[0130] In embodiments, the method includes incubating the polymerase and the primer (e.g., single stranded oligonucleotide) prior to contacting the template polynucleotide, a process referred to as “pre-binding.” In embodiments, the polymerase is incubated at 25°C with the primer for 10 to 60 minutes. In embodiments, the polymerase is incubated at 35°C with the primer for 10 to 60 minutes. In embodiments, the polymerase is incubated at 40°C with the primer for 10 to 60 minutes. In embodiments, the polymerase is incubated at 42°C with the primer for 10 to 60 minutes. In embodiments, the polymerase is incubated at 42°C with the primer for 10 to 30 minutes. In embodiments, the polymerase is incubated at 42°C with the primer for 10 minutes.

[0131] In embodiments, the hybridization solution and / or the extension solution includes a buffer such as, phosphate buffered saline (PBS), succinate, citrate, histidine, acetate, Tris, TAPS, MOPS, PIPES, HEPES, MES, and the like. The choice of appropriate buffer will generally be dependent on the target pH of the hybridization solution and / or the extension solution. In general, the desired pH of the buffer solution will range from about pH 4 to about pH 8.4. In some embodiments, the buffer pH may be at least 4.0, at least 4.5, at least 5.0, at least 5.5, at least 6.0, at least 6.2, at least 6.4, at least 6.6, at least 6.8, at least 7.0, at least 7.2, at least 7.4, at least 7.6, at least 7.8, at least 8.0, at least 8.2, or at least 8.4. In some embodiments, the buffer pH may be at most 8.4, at most 8.2, at most 8.0, at most 7.8, at most 7.6, at most 7.4, at most 7.2, at most 7.0, at most 6.8, at most 6.6, at most 6.4, at most 6.2, at most 6.0, at most 5.5, at most 5.0, at most 4.5, or at most 4.0. Any of the lower and upper values described in this paragraph may be combined to form a range included within the present disclosure, for example, in some instances, the desired pH may range from about 6.4 to about 7.2. Those of skill in the art will recognize that the buffer pH may have any value within this range, for example, about 7.25.

[0132] Suitable detergents for use in the hybridization solution and / or the extension solution include, but are not limited to, zwitterionic detergents (e.g., l-Dodecanoyl-sn-glycero-3- phosphocholine, 3 -(4-tert-Butyl- 1 -pyridinio)- 1 -propanesulfonate, 3 -(N,N- Dimethylmyristylammonio)propanesulfonate, 3-(N,NDimethylmyristylammonio)Atorney Docket No.: 051385-638001WO propanesulfonate, ASB-C80, C7BzO, CHAPS, CHAPS hydrate, CHAPSO, DDMAB, Dimethylethylammoniumpropane sulfonate, N,N-Dimethyldodecylamine Noxide, N-Dodecyl- N,N-dimethyl-3-ammonio-l -propanesulfonate, or N-Dodecyl-N,N-dimethyl-3-ammonio-l- propanesulfonate) and anionic, cationic, and non-ionic detergents. Examples of nonionic detergents include poly(oxyethylene) ethers and related polymers (e.g. Brij®, TWEEN®, TWEEN®-20, TRITON™, TRITON™ X-100 and IGEPAL® CA-630), bile salts, and glycosidic detergents. In embodiments, the hybridization solution and / or the extension solution include antioxidants and reducing agents, carbohydrates, BSA, polyethylene glycol, dextran sulfate, betaine, other additives.

[0133] In embodiments, the method includes amplifying the polynucleotide via an isothermal amplification process. Isothermal amplification processes include rolling-circle amplification (RCA), strand displacement amplification (SDA), loop-mediated isothermal amplification (LAMP), smart amplification process (SMAP), isothermal and chimeric primer-initiated amplification (ICAN), and simple method for amplifying RNA targets (SMART). In these techniques, the extension reaction proceeds at a constant temperature, for example using strand displacement reactions. Amplification can be completed in a single step, by incubating the mixture of samples, primers, DNA polymerase with strand displacement activity, and substrates at a constant temperature (e g., about 25°C to about 40°C). This reduces the number of steps required, eliminating thermal ramping steps and reducing the total cycle time for each amplification cycle, while simultaneously decreasing the reaction time required for each cycle. In embodiments, the method includes an amplification technique capable of amplifying a circular template polynucleotide. In embodiments, the method includes exponential rolling circle amplification (eRCA). Exponential RCA is similar to the linear process except that it uses a second primer having a sequence that is identical to at least a portion of the circular template (Lizardi et al. Nat. Genet. 19:225 (1998)). This two-primer system achieves isothermal, exponential amplification. Exponential RCA has been applied to the amplification of noncircular DNA through the use of a linear probe that binds at both of its ends to contiguous regions of a target DNA followed by circularization using DNA ligase (Nilsson et al. Science 265(5181):208 5(1994)). In embodiments, the method includes hyberbranched rolling circle amplification (HRCA). Hyperbranched RCA uses a second primer complementary to the first amplification product. This allows products to be replicated by a strand-displacementAtorney Docket No.: 051385-638001WO mechanism, which can yield a drastic amplification within an isothermal reaction (Lage et al., Genome Research 13:294-307 (2003), which is incorporated herein by reference in its entirety). Strand displacement amplification (SDA) relies on a strand displacement DNA polymerase and nicking endonuclease, which are used in repeated nicking and extending steps to generate downstream displacement strands capable of participating in exponential amplification (Walker et al. Nucleic Acids Res. 20: 1691-1696 (1992)). Loop-mediated isothermal amplification (LAMP) utilizes DNA polymerase in the presence of four specially designed primers to recognize six distinct sequences of target DNA, which enables the target DNA to be synthesized and displaced to produce a stem-loop DNA structure (Notomi et al. Nucleic Acids Res. 28:e63 (2000)). Smart amplification process (SMAP) is an isothermal nucleic acid amplification method that uses strand displacing Aac DNA polymerase, asymmetrical primers, and mismatch binding protein Taq MutS to amplify target alleles with minimal background amplification (Mitani et al. Nat. Methods. 4:257-62 (2007)). Isothermal and chimeric primer-initiated amplification (ICAN) relies on a pair of 5'-DNA-RNA-3' chimeric primers, thermostable RNaseH and a strand displacing DNA polymerase to amplify target DNA (Uemori et al. J Biochem. 142:283-92 (2007)). Simple method for amplifying RNA targets (SMART) relies on initially capturing target RNA using a target specific primers immobilized to the surface of magnetic beads, followed by hybridizing target RNA sequences to highly specific amplifiable SMART probes to facilitate the reverse transcription of the target RNA to cDNA and amplification of the target templates (McCalla et al. J Mol Diagn. 14: 328-335 (2012)). In embodiments, the template polynucleotide includes single-stranded circular DNA. In embodiments, the template polynucleotide is singlestranded circular DNA. In embodiments, the template polynucleotide includes double-stranded DNA. In embodiments, the template polynucleotide is double-stranded DNA. In embodiments, the template polynucleotide includes single stranded RNA. In embodiments, the template polynucleotide is single stranded RNA. In embodiments, the template polynucleotide includes double stranded RNA. In embodiments, the template polynucleotide is double stranded RNA. In embodiments, the method further includes forming the template polynucleotide by ligating ends of a linear polynucleotide (e.g., a single stranded polynucleotide) together to form a circular template polynucleotide. In embodiments, the method further includes forming the template polynucleotide by ligating a hairpin adapter to an end of a linear polynucleotide. In embodiments, the method includes forming the template polynucleotide includes ligating hairpinAtorney Docket No.: 051385-638001WO adapters to both ends of the linear polynucleotide thereby forming a circular template polynucleotide. In embodiments, the template polynucleotide is single-stranded circular DNA. Methods for forming circular DNA templates are known in the art, for example linear polynucleotides are circularized in a non-template driven reaction with circularizing ligase, such as CircLigase™, Taq DNA Ligase, HiFi Taq DNA Ligase, T4 ligase, or Ampligase® DNA Ligase. In embodiments, the method of forming the template polynucleotide includes ligating ends of a linear polynucleotide together. In embodiments, the two ends of the template polynucleotide are ligated directly together. In embodiments, the two ends of the template polynucleotide are ligated together with the aid of a bridging oligonucleotide (sometimes referred to as a splint oligonucleotide) that is complementary with the two ends of the template polynucleotide. In embodiments, the bridging oligonucleotide contains the amplification primer.

[0134] Circular polynucleotides of virtually any sequence can be produced using a variety of techniques (see for example U.S. Pat. No. 5,426,180; Dolinnaya et al. Nucleic Acids Research, 21 : 5403-5407 (1993); or Rubin et al. Nucleic Acids Research, 23: 3547-3553 (1995), which are incorporated herein by reference). In embodiments, the template polynucleotide of step (a) is a circular polynucleotide that is about 100 to about 1000 nucleotides in length, about 100 to about 300 nucleotides in length, about 300 to about 500 nucleotides in length, or about 500 to about 1000 nucleotides in length. In embodiments, the circular polynucleotide is about 300 to about 600 nucleotides in length. In embodiments, the circular polynucleotide is about 100-1000 nucleotides, about 150-950 nucleotides, about 200-900 nucleotides, about 250-850 nucleotides, about 300-800 nucleotides, about 350-750 nucleotides, about 400-700 nucleotides, or about 450- 650 nucleotides in length. In embodiments, the circular polynucleotide molecule is about 100- 1000 nucleotides in length. In embodiments, the circular polynucleotide molecule is about 100- 300 nucleotides in length. In embodiments, the circular polynucleotide molecule is about SOO- SOO nucleotides in length. In embodiments, the circular polynucleotide molecule is about 500- 1000 nucleotides in length. In embodiments, the circular polynucleotide molecule is about 100 nucleotides. In embodiments, the initial template polynucleotide molecule is about 300 nucleotides. In embodiments, the circular polynucleotide molecule is about 500 nucleotides. In embodiments, the circular polynucleotide molecule is about 1000 nucleotides. Circular polynucleotides may be conveniently isolated by a conventional purification column, digestion of non-circular DNA by one or more appropriate exonucleases, or both.Atorney Docket No.: 051385-638001WO

[0135] In embodiments, the template polynucleotide is a circular polynucleotide that is about 100 to about 1000 nucleotides in length, about 100 to about 300 nucleotides in length, about 300 to about 500 nucleotides in length, or about 500 to about 1000 nucleotides in length. In embodiments, the circular polynucleotide is about 300 to about 600 nucleotides in length. In embodiments, the circular polynucleotide includes at least one cleavable site. In embodiments, the method includes forming the template polynucleotide. The template polynucleotide can be a circular, dumbbell-shaped, or other closed nucleic acid molecule configuration that does not have a free 3’ or 5’ end. Typical library preparation steps may be performed on a linear template such that it is circularized (e.g., such as the protocols described in Kershaw, C. J., & O'Keefe, R. T. (2012) 941, 257-269). The initial template polynucleotide molecules can vary length, such as about 100-300 nucleotides long, about 300-500 nucleotides long, or about 500-1000 nucleotides long. In embodiments, the initial template polynucleotide molecular is about 100-1000 nucleotides, about 150-950 nucleotides, about 200-900 nucleotides, about 250-850 nucleotides, about 300-800 nucleotides, about 350-750 nucleotides, about 400-700 nucleotides, or about 450- 650 nucleotides. In embodiments, the initial template polynucleotide molecule is about 150 nucleotides. In embodiments, the initial template polynucleotide is about 100-1000 nucleotides long. In embodiments, the initial template polynucleotide is about 100-300 nucleotides long. In embodiments, the initial template polynucleotide is about 300-500 nucleotides long. In embodiments, the initial template polynucleotide is about 500-1000 nucleotides long. In embodiments, the initial template polynucleotide molecule is about 100 nucleotides. In embodiments, the initial template polynucleotide molecule is about 300 nucleotides. In embodiments, the initial template polynucleotide molecule is about 500 nucleotides. In embodiments, the initial template polynucleotide molecule is about 1000 nucleotides.

[0136] In embodiments, the polymerase used in methods that involve using a circular or circularizable construct hybridized to the nucleic acid of interest to generate a circular nucleic acid. In embodiments, the RCA includes a linear RCA. In embodiments, the RCA includes a branched RCA. In embodiments, the RCA includes a dendritic RCA. In embodiments, the RCA includes any combination of the foregoing. In embodiments, the circular nucleic acid is a construct formed using ligation. In embodiments, the circular construct is formed using template primer extension followed by ligation. In embodiments, the circular construct is formed by providing an oligonucleotide between ends to be ligated. In embodiments, the ligation is a DNA-Atorney Docket No.: 051385-638001WODNA templated ligation. In embodiments, the ligation is an RNA-RNA templated ligation. In some embodiments, the ligation is an RNA-DNA templated ligation. In embodiments, a splint is provided as a template for ligation. Following formation of the circular nucleic acid, in some instances, an amplification primer is added. In embodiments, the amplification primer may also be complementary to the target nucleic acid and the circularizable probe (e.g., a SNAIL probe). In embodiments, a washing step is performed to remove any unbound probes or primers. In embodiments, the wash is a stringency wash. Washing steps can be performed at any point during the process to remove non-specifically bound probes.

[0137] In embodiments, extending the amplification primer includes incubation with the polymerase (e.g., a polymerase described herein) in suitable conditions and for a suitable amount of time. In embodiments, the step of extending the amplification primer includes incubation with the polymerase (i) for about 10 seconds to about 30 minutes, and / or (ii) at a temperature of about 20°C to about 50°C. In embodiments, incubation with the polymerase is for about 0.5 minutes to about 16 minutes. In embodiments, incubation with the polymerase is for about 0.5 minutes to about 10 minutes. In embodiments, incubation with the polymerase is for about 1 minutes to about 5 minutes. In embodiments, the method includes amplifying a template polynucleotide by extending an amplification primer with a polymerase for about 10 seconds to about 30 minutes. In embodiments, the method includes amplifying a template polynucleotide by extending an amplification primer with a polymerase for about 30 seconds to about 16 minutes. In embodiments, the method includes amplifying a template polynucleotide by extending an amplification primer with a polymerase for about 30 seconds to about 10 minutes. In embodiments, the method includes amplifying a template polynucleotide by extending an amplification primer with a polymerase for about 30 seconds to about 5 minutes. In embodiments, the method includes amplifying a template polynucleotide by extending an amplification primer with a polymerase for about 1 second to about 5 minutes. In embodiments, the method includes amplifying a template polynucleotide by extending an amplification primer with a polymerase for about 1 second to about 2 minutes.

[0138] In embodiments, incubation with the polymerase is at a temperature of about 35°C to 42°C. In embodiments, incubation with the polymerase is at a temperature of about 37°C to 40°C. In embodiments, incubation with the thermostable polymerase is at a temperature of aboutAtorney Docket No.: 051385-638001WO40°C to 80°C. In embodiments, the method includes amplifying a template polynucleotide by extending an amplification primer with a polymerase at a temperature of about 20°C to about 50°C. In embodiments, the method includes amplifying a template polynucleotide by extending an amplification primer with a polymerase at a temperature of about 30°C to about 50°C. In embodiments, the method includes amplifying a template polynucleotide by extending an amplification primer with a polymerase at a temperature of about 25°C to about 45°C. In embodiments, the method includes amplifying a template polynucleotide by extending an amplification primer with a polymerase at a temperature of about 35°C to about 45°C. In embodiments, the method includes amplifying a template polynucleotide by extending an amplification primer with a polymerase at a temperature of about 35 °C to about 42°C. In embodiments, the method includes amplifying a template polynucleotide by extending an amplification primer with a polymerase at a temperature of about 37°C to about 40°C. In embodiments, the RCA is performed at a temperature of at or about 5° C., 10° C., 15° C., 20° C., 25° C , 30° C„ 35° C„ 37° C„ 40° ° C , 42° ° C„ 45° C„ 50° ° C or higher.

[0139] In embodiments, the polymerase is contacted with the amplification primer and template polynucleotide in the absence of dNTPs; optionally, excess polymerase is removed; and amplification buffer with dNTPs is added to initiate amplification. In embodiments, the polymerase is contacted with the amplification primer and template polynucleotide in the absence of dNTPs; and optionally, excess polymerase is removed. In embodiments, the amplification primer is hybridized to the template polynucleotide prior to contact with the polymerase. For example, the amplification primer and template polynucleotide form a complex, and the polymerase subsequently binds to this complex. In embodiments, the polymerase is contacted with the amplification primer and template polynucleotide in the absence of dNTPs and / or Mg2+. The polymerization rate of a polymerase is the speed at which the polymerase is able to catalyze nucleotide incorporation into a growing polynucleotide strand during nucleic acid amplification, which can be described as the number of nucleotide bases incorporated per units of time (e.g., bases / second). The polymerization rate is dependent on different reaction conditions including temperature, buffer composition, substrate, substrate concentration, and time. In some embodiments, the provided polymerases described herein exhibits improved polymerization rate relative to a control (e.g., a reference Phi29 polymerase).Atorney Docket No.: 051385-638001WO

[0140] In embodiments, the method includes contacting a biological sample with a circularizable probe. The circularizable probe comprises one or more hybridization regions and optionally one or more barcode regions, where the barcode regions correspond to a sequence of the target nucleic acid, and the hybridization regions bind to or are complementary to the target nucleic acid or a portion thereof. In embodiments, the method includes ligating the ends of the circularizable probe, with ligation templated by the binding of the hybridization regions to the target nucleic acid, thereby generating a circularized probe. The circularized probe is then amplified by RCA, generating an amplification product that contains multiple copies of a sequence complementary to the circularized probe. Detection of the RCA product may involve sequencing the barcode sequences, in situ hybridization, or hybridization with a detection oligonucleotide labeled with a fluorophore, isotope, mass tag, or a combination thereof. Detection can also involve imaging the RCA product. Both amplification and detection may be performed in situ within a fixed or permeabilized biological sample, such as tissue. The target nucleic acid may include deoxyribonucleotide or ribonucleotide residues.

[0141] In embodiments, the polymerase generates a higher density of detected rolling-circle amplification (RCA) products in an RCA compared to a control (e.g., reference Phi29 polymerase). In embodiments, the polymerase generates a higher signal intensity of RCA products in an RCA compared to a control (e.g., reference Phi29 polymerase). In embodiments, the polymerase generates a smaller or more compact RCA product compared to a control (e.g., reference Phi29 polymerase). In embodiments, the polymerase exhibits improved solubility compared to a control (e.g., reference Phi29 polymerase). In embodiments, the polymerase exhibits improved processivity compared to a control (e.g., reference Phi29 polymerase). In embodiments, the polymerase exhibits improved fidelity compared to a control (e.g., reference Phi29 polymerase). In embodiments, the polymerase exhibits improved polymerization rate compared to a control (e.g., reference Phi29 polymerase). In embodiments, the control is the wild type polymerase. In embodiments, the control is MS-65.

[0142] In embodiments, polymerase processivity is the ability of a nucleic acid polymerase to carry out continuous nucleic acid synthesis on a template nucleic acid without frequent dissociation. Processivity can be measured by the average number of nucleotides incorporated by a nucleic acid polymerase on a single association-disassociation event. In embodiments, theAtorney Docket No.: 051385-638001WO polymerase described herein may exhibit an average processivity of at least 10 kb, at least 20 kb, at least 30 kb, at least 40 kb, at least 50 kb, at least 60 kb, or at least 70 kb. In embodiments, the polymerase described herein exhibits improved processivity relative to a reference polymerase, wherein the improved processivity improves polymerase activity. In embodiments, the polymerase exhibits improved processivity compared to a reference polymerase (e.g., a wild-type Phi29 polymerase). In embodiments, the polymerase produces more amplified nucleic acid products compared to a reference polymerase. In embodiments, the polymerase amplifies nucleic acid faster compared to a reference polymerase. In embodiments, the nucleic acid amplification is an RCA reaction. In embodiments, the polymerase generates a higher density of detected RCA products in an RCA reaction compared to a reference polymerase. In embodiments, the polymerase generates a higher product signal intensity in an RCA reaction compared to a reference polymerase.

[0143] In embodiments, the RCA reaction is assessed or evaluated for generated RCA product (RCP) signal intensity, from detection of the produced RCA product. In embodiments, the product signal intensity is calculated as a local signal intensity over a local background mean intensity. In embodiments, the polymerase produces an RCA product signal intensity at or about or at least at or about 1%, 5%, 10%, 15%, 20%, 25%, 30%, 40%, 50%, 75%, 100%, 150%, or 200% higher compared to a reference polymerase (e.g., a wild-type Phi29 polymerase). In some embodiments, the polymerase produce an RCA product signal intensity at least between 10% and 200%, 25% and 175%, 50% and 150%, 75% and 125%, 90% and 110%, or 95% and 105% higher compared to a reference polymerase (e.g., a wild-type Phi29 polymerase).

[0144] In embodiments, the provided polymerase is used to amplify nucleic acids, such as DNA, in a rolling-circle amplification (RCA) reaction. The polymerase generates a higher density of detected RCA products (RCPs) in an RCA reaction compared to a reference polymerase, such as a wild-type Phi29 polymerase. Determining the density of detected RCPs involves the use and detection of a specific fluorescent dye, with the density measured in RCA product counts per pm2sample area and potentially related to the total sample signal. In embodiments, the polymerase produces a density of detected RCPs that is at least 1%, 5%, 10%, 15%, 20%, 25%, 30%, 40%, 50%, 75%, 100%, 150%, or 200% higher compared to a reference polymerase. In embodiments, the density of detected RCPs is within 10% to 200%, 25% toAtorney Docket No.: 051385-638001WO175%, 50% to 150%, 75% to 125%, 90% to 1 10%, or 95% to 105% higher than that of the reference polymerase.

[0145] In embodiments, the polymerase described herein exhibits improved thermostability relative to a control. In embodiments, the improved thermostability includes improved polymerase activity. In embodiments, the improved thermostability includes improved, more rapid, more robust, and / or more accurate polymerase activity, for example at elevated temperatures (e.g., greater than 37°C).

[0146] In an aspect is provided a method of amplifying a template polynucleotide, the method including: contacting a template polynucleotide with an amplification primer, and amplifying the template polynucleotide by extending an amplification primer with a polymerase to generate an extension product including one or more complements of the template polynucleotide; wherein the polymerase is a polymerase as described herein. In embodiments, the method includes amplifying the circular oligonucleotide by extending an amplification primer hybridized to the circular oligonucleotide with a strand-displacing polymerase (e.g., a polymerase as described herein), wherein the amplification primer extension generates an extension product including multiple complements of the circular oligonucleotide. In embodiments, the method further includes sequencing the circular oligonucleotide. A variety of sequencing methodologies can be used such as sequencing-by-synthesis (SBS), pyrosequencing, sequencing by ligation (SBL), or sequencing by hybridization (SBH). Pyrosequencing detects the release of inorganic pyrophosphate (PPi) as particular nucleotides are incorporated into a nascent nucleic acid strand (Ronaghi, et al., Analytical Biochemistry 242(1), 84-9 (1996); Ronaghi, Genome Res. 11(1), 3- 11 (2001); Ronaghi et al. Science 281(5375), 363 (1998); U.S. Pat. Nos. 6,210,891; 6,258,568; and. 6,274,320, each of which is incorporated herein by reference in its entirety). In pyrosequencing, released PPi can be detected by being converted to adenosine triphosphate (ATP) by ATP sulfurylase, and the level of ATP generated can be detected via light produced by luciferase. In this manner, the sequencing reaction can be monitored via a luminescence detection system. In both SBL and SBH methods, target nucleic acids, and amplicons thereof, that are present at features of an array are subjected to repeated cycles of oligonucleotide delivery and detection. SBL methods, include those described in Shendure et al. Science 309: 1728-1732 (2005); U.S. Pat. Nos. 5,599,675; and 5,750,341, each of which is incorporatedAtorney Docket No.: 051385-638001WO herein by reference in its entirety; and the SBH methodologies are as described in Bains et al., Journal of Theoretical Biology 135(3), 303-7 (1988); Drmanac et al., Nature Biotechnology 16, 54-58 (1998); Fodor et al., Science 251(4995), 767-773 (1995); and WO 1989 / 10977, each of which are incorporated herein by reference in their entirety.

[0147] In SBS, extension of a nucleic acid primer along a nucleic acid template is monitored to determine the sequence of nucleotides in the template. The underlying chemical process can be catalyzed by a polymerase, wherein fluorescently labeled nucleotides are added to a primer (thereby extending the primer) in a template dependent fashion such that detection of the order and type of nucleotides added to the primer can be used to determine the sequence of the template. A plurality of different nucleic acid fragments that have been attached at different locations of an array can be subjected to an SBS technique under conditions where events occurring for different templates can be distinguished due to their location in the array. In embodiments, the sequencing step includes annealing and extending a sequencing primer to incorporate a detectable label that indicates the identity of a nucleotide in the target polynucleotide, detecting the detectable label, and repeating the extending and detecting steps. In embodiments, the methods include sequencing one or more bases of a target nucleic acid by extending a sequencing primer hybridized to a target nucleic acid (e.g., an amplification product produced by the amplification methods described herein). In embodiments, the sequencing step may be accomplished by a sequencing-by-synthesis (SBS) process. In embodiments, sequencing includes a sequencing by synthesis process, where individual nucleotides are identified iteratively, as they are polymerized to form a growing complementary strand. In embodiments, nucleotides added to a growing complementary strand include both a label and a reversible chain terminator that prevents further extension, such that the nucleotide may be identified by the label before removing the terminator to add and identify a further nucleotide. Such reversible chain terminators include removable 3’ blocking groups, for example as described in U.S. Pat. Nos. 10,738,072, 7,541,444 and 7,057,026. Once such a modified nucleotide has been incorporated into the growing polynucleotide chain complementary to the region of the template being sequenced, there is no free 3'-OH group available to direct further sequence extension and therefore the polymerase cannot add further nucleotides. Once the identity of the base incorporated into the growing chain has been determined, the 3’ block may be removed to allow addition of the next successive nucleotide. By ordering the products derived using theseAtorney Docket No.: 051385-638001WO modified nucleotides it is possible to deduce the DNA sequence of the DNA template. Nonlimiting examples of suitable labels are described in U.S. Pat. No. 8,178,360, U.S. Pat. No. 5,188,934 (4,7-dichlorofluorscein dyes); U.S. Pat. No. 5,366,860 (spectrally resolvable rhodamine dyes); U.S. Pat. No. 5,847,162 (4,7-dichlororhodamine dyes); U.S. Pat. No. 4,318,846 (ether- substituted fluorescein dyes); U.S. Pat. No. 5,800,996 (energy transfer dyes); U.S. Pat. No. 5,066,580 (xanthene dyes): U.S. Pat. No. 5,688,648 (energy transfer dyes); and the like.

[0148] Sequencing includes, for example, detecting a sequence of signals. Examples of sequencing include, but are not limited to, sequencing by synthesis (SBS) processes in which reversibly terminated nucleotides carrying fluorescent dyes are incorporated into a growing strand, complementary to the target strand being sequenced. In embodiments, the nucleotides are labeled with up to four unique fluorescent dyes. In embodiments, the nucleotides are labeled with at least two unique fluorescent dyes. In embodiments, the readout is accomplished by epifluorescence imaging. A variety of sequencing chemistries are available, non-limiting examples of which are described herein.

[0149] In embodiments, sequencing includes sequencing-by-binding (see, e g., U.S. Pat. Pubs. US2017 / 0022553 and US2019 / 0048404, each of which are incorporated herein by reference in their entirety). As used herein, “sequencing-by-binding” refers to a sequencing technique wherein specific binding of a polymerase and cognate nucleotide to a primed template nucleic acid molecule (e g., blocked primed template nucleic acid molecule) is used for identifying the next correct nucleotide to be incorporated into the primer strand of the primed template nucleic acid molecule. The specific binding interaction need not result in chemical incorporation of the nucleotide into the primer. In some embodiments, the specific binding interaction can precede chemical incorporation of the nucleotide into the primer strand or can precede chemical incorporation of an analogous, next correct nucleotide into the primer. Thus, detection of the next correct nucleotide can take place without incorporation of the next correct nucleotide. As used herein, the “next correct nucleotide” (sometimes referred to as the “cognate” nucleotide) is the nucleotide having a base complementary to the base of the next template nucleotide. The next correct nucleotide will hybridize at the 3 '-end of a primer to complement the next template nucleotide. The next correct nucleotide can be, but need not necessarily be, capable of being incorporated at the 3' end of the primer. For example, the next correct nucleotide can be aAtorney Docket No.: 051385-638001WO member of a ternary complex that will complete an incorporation reaction or, alternatively, the next correct nucleotide can be a member of a stabilized ternary complex that does not catalyze an incorporation reaction. A nucleotide having a base that is not complementary to the next template base is referred to as an “incorrect” (or “non-cognate”) nucleotide. In embodiments, sequencing includes sequencing-by-avidity (SBA). Some embodiments of SBA approaches are described in U.S. Pat. No. 10,768,173 B2, the content of which is herein incorporated by reference in its entirety. In embodiments, SBA comprises detecting a multivalent binding complex formed between a fluorescently-labeled polymer-nucleotide conjugate, and a one or more primed target nucleic acid sequences (e.g., barcode sequences). Fluorescence imaging is used to detect the bound complex and thereby determine the identity of the N+l nucleotide in the target nucleic acid sequence (where the primer extension strand is N nucleotides in length). Following the imaging step, the multivalent binding complex is disrupted and washed away, the correct blocked nucleotide is incorporated into the primer extension strand, and the sequencing cycle is repeated.

[0150] Flow cells provide a convenient format for housing an array of clusters produced by the methods described herein, in particular when subjected to an SBS or other detection technique that involves repeated delivery of reagents in cycles. For example, to initiate a first SBS cycle, one or more labeled nucleotides and a DNA polymerase in a buffer, can be flowed into / through a flow cell that houses an array of clusters. The clusters of an array where primer extension causes a labeled nucleotide to be incorporated can then be detected. Optionally, the nucleotides can further include a reversible termination moiety that temporarily halts further primer extension once a nucleotide has been added to a primer. For example, a nucleotide analog having a reversible terminator moiety can be added to a primer such that subsequent extension cannot occur until a deblocking agent (e.g., a reducing agent) is delivered to remove the moiety. Thus, for embodiments that use reversible termination, a deblocking reagent (e.g., a reducing agent) can be delivered to the flow cell (before, during, or after detection occurs). Washes can be carried out between the various delivery steps as needed. The cycle can then be repeated A times to extend the primer by A nucleotides, thereby detecting a sequence of length A. Example SBS procedures, fluidic systems and detection platforms that can be readily adapted for use with an array produced by the methods of the present disclosure are described, for example, in Bentley et al., Nature 456:53-59 (2008), US Patent Publication 2018 / 0274024, WO 2017 / 205336, USAtorney Docket No.: 051385-638001WOPatent Publication 2018 / 0258472, each of which are incorporated herein in their entirety for all purposes.

[0151] Use of the sequencing method outlined above is a non-limiting example, as essentially any sequencing methodology which relies on successive incorporation of nucleotides into a polynucleotide chain can be used. Suitable alternative techniques include, for example, pyrosequencing methods, FISSEQ (fluorescent in situ sequencing), MPSS (massively parallel signature sequencing), or sequencing by ligation-based methods.

[0152] In embodiments, generating a sequencing read includes determining the identity of the nucleotides in the template polynucleotide (or complement thereof). In embodiments, a sequencing read, e.g., a first sequencing read or a second sequencing read, includes determining the identity of a portion (e.g., 1, 2, 5, 10, 20, 50 nucleotides) of the total template polynucleotide. In embodiments the first sequencing read determines the identity of 5-10 nucleotides and the second sequencing read determines the identity of more than 5-10 nucleotides (e.g., 11 to 200 nucleotides). In embodiments the first sequencing read determines the identity of more than 5-10 nucleotides (e g., 11 to 200 nucleotides) and the second sequencing read determines the identity of 5-10 nucleotides.

[0153] In embodiments, the sequencing method relies on the use of modified nucleotides that can act as reversible reaction terminators. Once the modified nucleotide has been incorporated into the growing polynucleotide chain complementary to the region of the template being sequenced there is no free 3 ’-OH group available to direct further sequence extension and therefore the polymerase cannot add further nucleotides. Once the identity of the base incorporated into the growing chain has been determined, the 3’ reversible terminator may be removed to allow addition of the next successive nucleotide. These such reactions can be done in a single experiment if each of the modified nucleotides has attached a different label, known to correspond to the particular base, to facilitate discrimination between the bases added at each incorporation step. Alternatively, a separate reaction may be carried out containing each of the modified nucleotides separately.

[0154] The modified nucleotides may carry a label (e.g., a fluorescent label) to facilitate their detection. Each nucleotide type may carry a different fluorescent label. However, the detectable label need not be a fluorescent label. Any label can be used which allows the detection of anAtorney Docket No.: 051385-638001WO incorporated nucleotide. One method for detecting fluorescently labeled nucleotides includes using laser light of a wavelength specific for the labeled nucleotides, or the use of other suitable sources of illumination. The fluorescence from the label on the nucleotide may be detected (e.g., by a CCD camera or other suitable detection means).

[0155] In embodiments, fluorescence microscopy is used for detection and imaging of the detection probe. In embodiments, a fluorescence microscope is an optical microscope that uses fluorescence and phosphorescence instead of, or in addition to, reflection and absorption to study properties of organic or inorganic substances. In fluorescence microscopy, a sample is illuminated with light of a wavelength which excites fluorescence in the sample. The fluoresced light, which is usually at a longer wavelength than the illumination, is then imaged through a microscope objective. Two filters may be used in this technique; an illumination (or excitation) filter which ensures the illumination is near monochromatic and at the correct wavelength, and a second emission (or barrier) filter which ensures none of the excitation light source reaches the detector. Alternatively, these functions may both be accomplished by a single dichroic filter. The fluorescence microscope can be any microscope that uses fluorescence to generate an image, whether it is a simpler set up like an epifluorescence microscope, or a more complicated design such as a confocal microscope, which uses optical sectioning to get better resolution of the fluorescent image. Other types of microscopy that can be employed comprise bright field microscopy, oblique illumination microscopy, dark field microscopy, phase contrast, differential interference contrast (DIC) microscopy, interference reflection microscopy (also known as reflected interference contrast, or RIC), single plane illumination microscopy (SPIM), superresolution microscopy, laser microscopy, electron microscopy (EM), Transmission electron microscopy (TEM), Scanning electron microscopy (SEM), reflection electron microscopy (REM), Scanning transmission electron microscopy (STEM) and low-voltage electron microscopy (LVEM), scanning probe microscopy (SPM), atomic force microscopy (ATM), ballistic electron emission microscopy (BEEM), chemical force microscopy (CFM), conductive atomic force microscopy (C-AFM), electrochemical scanning tunneling microscope (ECSTM), electrostatic force microscopy (EFM), fluidic force microscope (FluidFM), force modulation microscopy (FMM), feature-oriented scanning probe microscopy (FOSPM), kelvin probe force microscopy (KPFM), magnetic force microscopy (MFM), magnetic resonance force microscopy (MRFM), near-field scanning optical microscopy (NSOM) (or SNOM, scanning near-fieldAtorney Docket No.: 051385-638001WO optical microscopy, SNOM, Piezoresponse Force Microscopy (PFM), PSTM, photon scanning tunneling microscopy (PSTM), PTMS, photothermal microspectroscopy / microscopy (PTMS), SCM, scanning capacitance microscopy (SCM), SECM, scanning electrochemical microscopy (SECM), SGM, scanning gate microscopy (SGM), SHPM, scanning Hall probe microscopy (SHPM), SICM, scanning ion-conductance microscopy (SICM), SPSM spin polarized scanning tunneling microscopy (SPSM), SSRM, scanning spreading resistance microscopy (SSRM), SThM, scanning thermal microscopy (SThM), STM, scanning tunneling microscopy (STM), STP, scanning tunneling potentiometry (STP), SVM, scanning voltage microscopy (SVM), and synchrotron x-ray scanning tunneling microscopy (SXSTM), and intact tissue expansion microscopy (exM).

[0156] In embodiments, sequences can be analyzed in situ, e.g., by incorporation of a labelled nucleotide (e g., fluorescently labelled mononucleotides or dinucleotides) in a sequential, template-dependent manner or hybridization of a labelled primer (e.g., a labelled random hexamer) to a nucleic acid template such that the identities (e.g., nucleotide sequence) of the incorporated nucleotides or labelled primer extension products can be determined, and consequently, the nucleotide sequence of the corresponding template nucleic acid.

[0157] In embodiments, the methods of sequencing a nucleic acid include extending a complementary polynucleotide (e.g., a primer) that is hybridized to the nucleic acid by incorporating a first nucleotide. In embodiments, the method includes a buffer exchange or wash step. In embodiments, the methods of sequencing a nucleic acid include a sequencing solution. The sequencing solution includes (a) an adenine nucleotide, or analog thereof; (b) (i) a thymine nucleotide, or analog thereof, or (ii) a uracil nucleotide, or analog thereof; (c) a cytosine nucleotide, or analog thereof; and (d) a guanine nucleotide, or analog thereof

[0158] In embodiments, the method includes amplifying the template polynucleotide in a cell. In embodiments, the method includes amplifying the template polynucleotide in a tissue. In embodiments, the method includes amplifying the template polynucleotide one a solid support (e.g., a multiwell container or a flowcell). In embodiments, the amplification primer is immobilized on a solid support.

[0159] In embodiments, the method includes incorporating one or more cleavable sites into the amplification product. For example, in embodiments, the method includes contacting the primedAtorney Docket No.: 051385-638001WODNA template with a composition including a plurality of native DNA nucleotides and cleavable site nucleotides, thereby forming a plurality of amplification products, wherein the amplification products include a cleavable site nucleotide at a different position relative to each other.

[0160] In embodiments, the polymerases described herein have improved polymerase activity (i.e., improved relative to a control). Polymerase activity, in some instances, includes the measurable quantity kcat, kcat / Km, or yields of incorporated nucleotides for a given time period. In embodiments, the polymerases described herein have increased extension activity (i.e., increased relative to a control). Increased extension activity variously refers to an increase in reaction kinetics (increased kcat), increased KD, decreased Km, increased kcat / Kmratio, faster turnover rate, higher turnover number, or other metric that is beneficial to the use of the polypeptide for nucleic acid extension with nucleotides. The polypeptides described herein often incorporate at least 30% more nucleotides than the wild-type polymerase in total or in a given duration of time.

[0161] In embodiments, the polymerases described herein often incorporate at least 10%, 20%, 30%, 50%, 75%, 100%, 125%, 150%, 200%, 500%, more nucleotides than a control (e.g., the wild-type polymerase) for a fixed amount of time and same nucleotide concentration. In embodiments, the polymerases described herein incorporate nucleotides at least 1.5, 2, 2.5, 5, 10, 15, 20, 25, or at least 50 times faster than a control (e.g., the wild-type polymerase) for a fixed amount of time. Such measurements are often measured under conditions such as a set period of time, such as at least, at most, or exactly 1, 2, 3, 5, 8, 10, 15, 20, or more than 20 minutes. Such measurements are often measured under conditions such as a set nucleotide concentration, such as less than 10 pM, 20 pM, 50 pM, 100 pM, 200 pM, 300 pM, 500 pM, or more than 500 pM, or any concentration within the range identified herein.

[0162] In embodiments, the methods described herein further includes detecting the amplification product (e.g., detecting via sequencing method, or for example, by fluorescent detection methods). In any of the embodiments herein, the detecting can include contacting the biological sample with one or more detectably-labeled probes bind to the one or more amplification products, and detecting signals associated with the one or more detectably-labeled probes.

[0163] The methods and compositions provided herein can be used to detect and analyze a wide variety of different analytes. In some aspects, an analyte can include any biologicalAtorney Docket No.: 051385-638001WO substance, structure, moiety, or component to be analyzed. In some aspects, a target disclosed herein may similarly include any analyte of interest. In some examples, a target or analyte can be directly or indirectly detected. Analytes can be derived from a specific type of cell and / or a specific sub-cellular region. For example, analytes can be derived from cytosol, from cell nuclei, from mitochondria, from microsomes, and more generally, from any other compartment, organelle, or portion of a cell. Permeabilizing agents that specifically target certain cell compartments and organelles can be used to selectively release analytes from cells for analysis, and / or allow access of one or more reagents (e.g., probes for analyte detection) to the analytes in the cell or cell compartment or organelle. The analyte may include any biomolecule or chemical compound, including a macromolecule such as a protein or peptide, a lipid or a nucleic acid molecule, or a small molecule, including organic or inorganic molecules. The analyte may be a cell or a microorganism, including a virus, or a fragment or product thereof. An analyte can be any substance or entity for which a specific binding partner (e.g. an affinity binding partner) can be developed. Such a specific binding partner may be a nucleic acid probe (for a nucleic acid analyte) and may lead directly to the generation of a RCA template (e.g. a circularizable probe or circular probe). Alternatively, the specific binding partner may be coupled to a nucleic acid, which may be detected using an RCA strategy, e.g. in an assay which uses or generates a circular nucleic acid molecule which can be the RCA template. Analytes of particular interest may include nucleic acid molecules, such as DNA (e.g. genomic DNA, mitochondrial DNA, plastid DNA, viral DNA, etc.) and RNA (e.g. mRNA, microRNA, rRNA, snRNA, viral RNA, etc ), and synthetic and / or modified nucleic acid molecules, (e.g. including nucleic acid domains comprising or consisting of synthetic or modified nucleotides such as LNA, PNA, morpholino, etc.), proteinaceous molecules such as peptides, polypeptides, proteins or prions or any molecule which includes a protein or polypeptide component, etc., or fragments thereof, or a lipid or carbohydrate molecule, or any molecule which comprise a lipid or carbohydrate component. The analyte may be a single molecule or a complex that contains two or more molecular subunits, e.g. including but not limited to protein-DNA complexes, which may or may not be covalently bound to one another, and which may be the same or different. Thus in addition to cells or microorganisms, such a complex analyte may also be a protein complex or protein interaction. Such a complex or interaction may thus be a homo- or hetero-multimer. Aggregates of molecules, e.g. proteins may also be target analytes, for example aggregates of the same proteinAtorney Docket No.: 051385-638001WO or different proteins. The analyte may also be a complex between proteins or peptides and nucleic acid molecules such as DNA or RNA, e.g. interactions between proteins and nucleic acids, e.g. regulatory factors, such as transcription factors, and DNA or RNA.EXAMPLESExample 1. Development of Polymerase variants

[0164] DNA amplification has many applications in molecular biology research and medical diagnostics. There are two main strategies for amplifying a defined sequence of nucleic acid: polymerase chain reaction (PCR) and isothermal amplification. The polymerase chain reaction relies upon thermal cycling to denature dsDNA templates, followed by annealing primers at specific sites in the denatured template, and extension of the primers by a thermostable DNA polymerase. Isothermal amplification of DNA, as the name implies, typically includes amplification of the dsDNA at a defined temperature. The lack of thermal cycling in isothermal amplification technologies reduces equipment needs and improves the time to answer, especially for point-of-care applications.

[0165] A variety of isothermal amplification methods have been developed, for example, strand displacement amplification (SDA) (Walker, G. T. et al. Nucleic Acids Res 20, 1691-6 (1992); and Walker, G. T., Little, M. C, Nadeau, J. G. & Shank, D. D. PNAS 89, 392-6 (1992)), rolling circle amplification (RCA) (Fire, A. & Xu, S. Q. PNAS 92, 4641-5 (1995)), cross priming amplification (CPA) (Xu, G. et al. Sci. Rep. 2, 246; (2012)) and loop mediated amplification (LAMP) (Notomi, T. et al. Nucleic Acids Res 28, e63 (2000)). While some isothermal amplification mechanisms depend upon multiple enzymes, e.g., nickases, recombinases, and ligases, to achieve continuous replication, RCA and LAMP require only polymerases (e.g., a polymerase described herein) and primers. These methods, like many other isothermal amplification methods, require the use of a DNA polymerase with a strong strand displacement activity to displace downstream DNA, thereby enabling continuous replication without thermal cycling. Thus, a DNA polymerase suitable for these methods must be a stable DNA polymerase with a strong strand displacement activity.

[0166] One such DNA polymerase, Phi29 DNA polymerase, is a monomeric enzyme of 66 kDa, a DNA-dependent enzyme belonging to the eukaryotic-type family of DNA polymerases (family B). Referred to as proofreading, phi29 contains an exonuclease domain that catalyzesAtorney Docket No.: 051385-638001WO3’— >5’ exonucleolysis of mismatched nucleotides, thereby enhancing replication fidelity at least 100-fold. Additionally, wild-type phi29 DNA polymerase reliably binds to single stranded DNA, and performs DNA synthesis without processivity cofactors, accounting for the highest known processivity (>70 kb) among other DNA polymerases. Strong processivity, robust strand displacement activity, and high accuracy allow the enzyme to amplify whole genomes with minimal amplification bias compared to PCR based amplification methods.

[0167] FIG. 1 provides an alignment of seven sequences described herein. A portion of the entire amino acid sequence is shown, aligning the wild type polymerase (SEQ ID NO:1) to BSTP6 (SEQ ID NO: 12), MinWT (SEQ ID NO:2), BSTP4 (SEQ ID NO:8), Whitingl8 (SEQ ID NO: 17), Phage M2 (SEQ ID NO:20), and Beecentumtrevirus (SEQ ID NO:22). The alignment highlights a negative three (-3) frameshift in amino acid positions of MinWT, BSTP4, Whitingl8, Phage M2, and Beecentumtrevirus relative to wild type, wherein the amino acid positions are shifted by three positions. For example, the amino acid position 312 of the wild type sequence corresponds to the amino acid position 309 if BSTP4 (SEQ ID NO:8). Amino acid substitutions relative to the wild type sequence are shaded.

[0168] Structural analysis of DNA polymerases portray the enzymes as analogous to a human right hand, with three domains: a ‘fingers’ domain that interacts with the incoming dNTP and paired template base, and that closes at each nucleotide addition step; a ‘palm’ domain that catalyzes the phosphoryl-transfer reaction; and a ‘thumb’ domain that interacts with duplex DNA. Compared with the structure of other family B DNA polymerases, phi29 DNA polymerase shows a common (right hand) fold containing palm, thumb and fingers subdomains. The main structural difference between phi29 DNA polymerase and other family B DNA polymerases is the presence of two additional subdomains, called TPR1 and TPR2, that are insertions between the fingers and palm subdomains. TPR2 helps to form a narrow tunnel around the downstream DNA, forcing separation of the second strand before its entry into polymerase active site. Additionally the palm, thumb, TPR1 , and TPR2 subdomains form a doughnut-shaped structure around the upstream duplex product, providing maximal DNA-binding stability which enhances processivity in a manner analogous to sliding-clamp proteins. The unique structure provides high processivity and strand displacement activity, which enables phi29 DNA polymerase to be used in isothermal multiple displacement amplification (MDA) or rolling circle amplification (RCA).Atorney Docket No.: 051385-638001WO

[0169] Diagnostics and therapeutic interventions rely on amplification technologies using the phi29 DNA polymerase and have several advantages compared to classical PCR DNA amplification methods. For example, amplification products (e.g., amplicons) synthesized by the phi29 DNA polymerase can be much larger comparing to those obtained by PCR. Additionally, isothermal DNA amplification reactions do not require special laboratory equipment such as thermal cyclers. These advantages make phi29 DNA polymerase suitable for detection and analysis of known and unknown circular viral genomes, replication of pathogenic plasmids, amplification of very small DNA samples, e.g., replication from filter paper samples containing a blood spot, and for in situ transcript and proteomic detection. The ability to perform reactions at increased temperature would be advantageous so that amplification reaction kinetics would be faster. Phi29 DNA polymerase is a typical mesophilic enzyme with an optimal reaction temperature of about 30°C to about 40°C. Elevated reaction temperatures could improve DNA amplification efficiency and decrease formation of template-independent, non-specific reaction products. The relatively low working temperature of phi29 DNA polymerase limits its application; a more thermostable enzyme could be used in many more DNA amplification techniques, generate more product, work faster, and increase amplification reaction sensitivity.

[0170] Attempts to improve phi29 DNA polymerase have been performed by introducing amino acid mutations. For example, exonuclease activity was reduced by introducing an alanine at acid moiety at positions 12, 14, or 16 as described in US Pat. No. 5,198,543, which is incorporated herein by reference. Efforts to reduce the processivity (i.e., increase the amount of time a nucleotide analog resides in the binding pocket of the enzyme) of the polymerase for single molecule applications are described in US Pub. 2008 / 0108082, which is incorporated herein by reference. Previous attempt to evolve a mesophilic polymerase was described in Povilaitis et al. Protein Engineering, Design & Selection, 2016, vol. 29 no. 12, pp. 617-628).

[0171] Enzymes commonly used in diagnostics often form fdms and aggregates, which compromise their activity, stability, and the accuracy of imaging-based detection by obscuring fluorescence signals. Film formation interference significantly reduces the reliability and sensitivity of diagnostic outcomes, particularly in detecting amplification products. Disclosed herein are mutant enzymes that improve enzyme solubility and prevent aggregation, ensuringAtorney Docket No.: 051385-638001WO clear detection and consistent, reliable performance in diagnostic applications. Additional point mutations are explored to further enhance amplification rates.Example 2. Rational Design to Minimize Enzyme Aggregates

[0172] Phi29 DNA polymerase is the replicative polymerase from the Bacillus subtilis phage phi29 belonging to the eukaryotic-type family of proofreading DNA polymerases (Family B). Phi29 DNA polymerase has the highest known processivity (>70 kb), robust strand displacement activity, and high accuracy. Therefore, the enzyme is widely used in applications such as isothermal DNA amplification and whole genome amplification. Despite its exceptional enzymatic properties, phi29 DNA polymerase frequently aggregates during in situ assay experiments and during its purification process, resulting in significant loss of yield and compromised activity and stability. Aggregation not only limits its practical utility in in situ applications but also poses challenges in large-scale production, where maintaining high enzyme solubility and yield are critical.

[0173] To address these issues, a rational design approach was employed to enhance the solubility of phi29 DNA polymerase by targeting aggregation-prone regions (APRs) for mutagenesis. A variant of phi29 DNA polymerase served as the focus of this study. Both Al phaF old-predicted models and published crystal structures of phi29 DNA polymerase were utilized as inputs for computational tools to identify APRs and propose solubility-enhancing mutations.

[0174] To elucidate the aggregation tendencies of phi29, the AlphaFold protein structure prediction tool was used to generate a high-resolution model. This structural model provided critical insights into folding dynamics, revealing potential APRs contributing to aggregation during purification and functional assays. Additionally, crystal structures representing different conformational states of phi29 DNA polymerase were incorporated: 1XHX: Standalone (unbound) structure; 1XHZ: Complex with single-stranded DNA; 2PZS: Complex with primed DNA template; 2PYJ: Ternary complex with primed DNA and nucleotide. The structures were analyzed using Aggrescan 3D (A3D), Aggrescan 4D (A4D), and SolubiS to identify APRs and design solubility-enhancing mutations. See, Conchillo-Sole, O. et al. BMC Bioinformatics 8, 65 (2007) and Joost Van Durme, et al. Protein Engineering, Design and Selection, Volume 29, Issue 8, August 2016, Pages 285-289. SolubiS combines two computational algorithms TANGOAtorney Docket No.: 051385-638001WO and FoldX to propose mutations that disrupt APRs while maintaining protein stability. TANGO identifies regions prone to beta-sheet aggregation by analyzing sequence hydrophobicity and aggregation-driving motifs. FoldX evaluates the impact of mutations on protein stability by calculating the free energy change (ddG). Stabilizing mutations decrease ddG, while destabilizing mutations increase ddG. By integrating TANGO and FoldX, SolubiS suggests mutations that reduce aggregation while preserving or enhancing protein stability. Graphical outputs highlight APRs, solubility scores, and optimal mutation recommendations. A3D predicts APRs in the folded state of proteins by analyzing 3D structures (e g., X-ray, NMR, or AlphaFold models). The tool assigns aggregation propensity scores to residues, considering their structural context. A4D extends A3D by incorporating pH-dependent aggregation properties. It uses a pH- dependent lipophilicity scale and calculates pKa values to evaluate aggregation tendencies across pH ranges (4-9).

[0175] Using the software SolubiS, we identified APRs within the phi29 AlphaF old-predicted model, proposed solubility-enhancing mutations, and ranked them based on their ability to reduce aggregation propensity and maintain protein stability. We identified over 200 variants for consideration.

[0176] Table 1. Aggregation Prone Regions (APR) in phi29.

[0177] The AlphaFold-predicted structure of phi29, color-mapped with TANGO predicted scores, was used to visualize the spatial context of each mutation. This helped to assess whether the proposed substitutions were located in critical functional regions or solvent-exposed surfaces. Mutations clustered in surface-exposed APRs with high TANGO scores were prioritized, as they are more likely to alleviate aggregation without interfering with enzymatic activity. Mutations were prioritized based on their position within the APRs, their contribution to aggregation risk,Atorney Docket No.: 051385-638001WO and their potential impact on solubility, as determined by SolubiS scores. The combined analysis of the SolubiS and the AlphaFold structural visualization provided a robust framework for selecting rational mutations, provided in Table 2. Several optimal mutations were selected for experimental validation, see table below.

[0178] Table 2. Mutations selected for experimental validation based on SolubiS.

[0179] To further refine the design of the phi29 DNA polymerase, Aggrescan 3D (A3D) and Aggrescan 4D (A4D) were employed to identify and analyze aggregation-prone regions (APRs). These tools provided detailed insights into aggregation risks by evaluating both static and dynamic structural properties of the enzyme. The analysis leveraged multiple structural models, and the results were summarized in an Excel sheet that included mutation summaries, residue scores, and APR profiles. The Aggrescan analysis included evaluations of phi29 DNA polymerase in various conformational states, leveraging both crystal structures and dynamic simulations, for example, the crystal structures analyzed with Aggrescan 3D included: Standalone structure (PDB: 1XHX): Represents the unbound enzyme, providing a baseline for identifying APRs; Primed DNA template complex (PDB: 2PZS): Reflects the enzyme interacting with a primed template, simulating its active state during DNA synthesis initiation; Ternary complex (PDB: 2PYJ): Depicts the enzyme bound to both a primed DNA template and nucleotide, representing the fully active form; Single-stranded DNA complex (PDB: 1XHZ): Captures interactions with single-stranded DNA, mimicking another functionally relevant state. Additionally, the 1XHX structure was further analyzed using Aggrescan 4D to incorporate protein flexibility and conformational dynamics. This allowed for the identification of APRs that may arise due to structural fluctuations or dynamic interactions during the enzyme’s functional cycle. Based on the Aggrescan analysis, point mutations were determined for experimental validation described in Table 3.Attorney Docket No.: 051385-638001WO

[0180] Table 3. Point mutations selected for experimental validation based on Aggrescan analysis.Attorney Docket No.: 051385-638001WO

[0181] The variants are referred to internally by an “MS” prefix and a sequential number in which they were identified. MS was chosen as a prefix in honor of Spanish biochemist Margarita Salas and her pioneering efforts to understand phi-29 enzymes (see Salas M. Bacteriophage. 2016 Dec 15;6(4):el271250). See the variants synthesized in Table 4.

[0182] Table 4. The following phi29 variants were synthesized and purified.Atorney Docket No.: 051385-638001WOAttorney Docket No.: 051385-638001WO

[0183] The phi29 variants cDNA were ordered from ATUM. The designed gene of interest were synthesized and cloned by ATUM in pD-451-SR plasmid, which harbored a T7 promoter, strong ribosomal binding site, and has a high plasmid copy number. Subsequently, the cloned pD451-SR plasmid was transformed into E. coli T7 Express (NEB) and incubated at 37°C with shaking in 2.0 L flasks containing 2xYT media until an ODeoo of 1 was reached. Then, isopropyl p- d-1 -thiogalactopyranoside (IPTG) was added to induce specific protein expression. The cells were incubated overnight at reduced temperature and collected by centrifugation. Cell pellets were stored at minus 80°C. Site directed mutagenesis to introduce the mutations described in Tables 1-3 were performed using NEB Q5 site directed mutagenesis kit (NEB cat# E0554S). Mutagenic primers were generated using NEBaseCHanger tool from NEB was used to generate primers and an annealing temperature. PCR, ligation and transformation were performed following the instructions of the Q5 site directed mutagenesis kit. Colonies were sequenced verify the positive clones with the correct mutation. Plasmids with the correct mutation were transformed into T7 Express cells (NEB cat# C2566H) following the manufacturer’s instructions. After 24 hours growth the cells (100 pL of cells) were lysed and run on an SDS- PAGE gel to ensure protein expression. 5xl08cells were harvested and washed three times with 500 pL SN-buffer (a buffer including 0.05 M Tris HC1, pH 7.5) before re-suspending in 500 pL buffer.Atorney Docket No.: 051385-638001WO

[0184] To assess the thermostability and activity of the variants, enzymes were pre-incubated in reaction buffer for variable minutes. The assay is as follows: preincubation of the enzyme in reaction buffer without additives for 0, 10, 20 and 40 minutes at 42°; and RCA at 37°C for 30 min after addition of dNTPs and dye. In this assay, the pre-incubation step was carried out in the absence of additives or substrates to allow for a more accurate assessment of the intrinsic thermostability of the enzyme variants. The pre-binding step and subsequent RCA followed the pre-incubation, ensuring consistent conditions for evaluating enzyme stability and activity.

[0185] Rolling Circle Amplification (RCA) Assays: RCA reactions contained reaction buffer, 0.1 nM single- stranded circular DNA, 4 pM SYTO™ 9 intercalating dye (ThermoFisher) to monitor amplification, primers, and 60-450 nM of the phi29 variant in a total reaction volume of 25 pL. Reactions were monitored using CFX96™ Real-Time System (Bio-Rad) by incubating at 37°C for 2 hours and taking measurements with the SYBR filter every minute followed by 20 minutes at 65°C to heat inactivate protein.

[0186] Each assay included testing the variant in triplicate, and suitable controls (e.g., wildtype and / or a no-template control). Upon binding to a dsDNA amplification product, the nucleic acid stain provides a strong fluorescent signal, the intensity of which is proportional to the amount of amplification products being generated captured as relative fluorescence units (RFU). The fluorescence of the nucleic acid stain is continuously monitored over time to a point of saturation.

[0187] Table 5. Quantifying rolling circle amplification rates of phi29 variants at pH 7.5. The relative amplification rate is calculated as the relative fluorescence units per minute, relative to a control, MS-65. To clearly convey relative amplification performance, results were expressed using a symbolic scoring system based on defined ranges. The control was assigned a baseline score of 1-10, represented by a single plus symbol (+). Amplification rates in the range of 11-20 were represented by two plus symbols (++), values of 21-30 by three plus symbols (+++), and so forth, increasing by one plus symbol for every additional increment of 10.Attorney Docket No.: 051385-638001WO

[0188] As evidenced by the data presented in Table 5, the enzyme variants MS-280, MS-306, MS-307, MS-308, and MS-309, represented by two plus symbols (++), exhibited amplification performance approximately 2-3 fold better than the control. Variants MS-238 and MS-310, represented by three plus symbols (+++), demonstrated performance approximately 3-4 fold better. The variant MS-312, represented by four plus symbols (++++), showed performance approximately 4-5 fold better. Finally, MS-311 and MS-314, with five plus symbols (+++++), achieved performance 5-fold better than the control.

[0189] Additional experimentation was performed to assess the thermal stability of the variants. This time course experiment tested enzyme incubation time at 42°C in reaction buffer at pH 7.5. In particular, the thermal stability assay included an additional enzyme pre-incubation step in reaction buffer at 42°C for 0, 5, 10, 20, or 40 min and RCA at 37°C for 30 minutes-2 hours. Data is presented in FIG. 2A-2B, demonstrating that incubation at 42°C dramatically reduced the amplification rate of MS-65, yet the variants MS-238, MS-254, MS-262, MS-296, MS-311, and MS-312 all demonstrate significant amplification and enhanced thermal stability.Atorney Docket No.: 051385-638001WOMS-311 and MS-312 contains the mutation in MS-238 and an additional mutation. The effect of the mutations seems to be additive, resulting in MS-311 and MS-312 capable of generating significant amplification products following 42°C incubation.Example 3. Modifications to Protein Surfaces to Enhance Solubility

[0190] Modifying the solvent accessible amino acids represents any alternative means for enhancing solubility and minimizing aggregation. For example, methyl-polyetheylene glycol (MS(PEG)n) reagents enable simple and efficient modifications of protein and other molecules with primary amines. Modification results in the addition of PEG spacers (PEGylation) with terminal methyl groups. The PEG spacer is hydrophilic (water-soluble) and this property is transferred to the macromolecule. Consequently, PEGylation of proteins and peptides can significantly increase water solubility and reduce aggregation, often without adversely affecting their biological activities.

[0191] N-Hydroxysuccinimide (NHS) esters are the most popular type of reactive group used for protein modification. In pH 7-9 buffers, NHS-ester reagents react efficiently with primary amino groups (-NH2) by nucleophilic attack, forming amide bonds and releasing the NHS. Proteins typically have many sites for labeling, including the primary amine in the side chain of each lysine (K) residue and the N-terminus of each polypeptide. The MS(PEG)n reagents are readily soluble in water or organic solvents such as DMSO, methylene chloride and DMF.

[0192] PEG reagents, MS(PEG)4, MS(PEG)12 and MS(PEG)24 were dissolved in DMSO to have a 250 mM stock solution. Wild type variants were dialyzed using Slide- A-Lyzer MINI Dialysis Device (Thermo, cat# 69576) in IL PBS + 5% glycerol for 2h. The concentration of the enzyme was measured after dialysis and 2x, 5x and lOx molar excess of MS(PEG)4, MS(PEG)12 or MS(PEG)24 were added to the enzyme in 50 ml-Falcon tube and incubated on ice while stirring using a stir bar (VWR cat# 58949-272). When the PEGylation reaction is completed the enzyme was dialyzed using Slide- A-Lyzer MINI Dialysis Device (Thermo, cat# 69576) in IL enzyme storage buffer for 2h to remove the NHS leaving group, excess of PEG reagent, and PBS.

[0193] An RCA experiment tested variants and PEGylated variants in RCA reaction buffer at pH 7.5 at 42°C. In particular, the enzyme pre-binding step was at 42°C for 30 min followed by RCA reaction at 42°C for 2 hours, and an additional enzyme pre-incubation step in reactionAtorney Docket No.: 051385-638001WO buffer at 42°C for 10 min before the pre-binding step. In the pre-incubation step, enzyme is incubated in reaction buffer without any additives (e.g., BSA) or substrates to help better assess thermostability of variants themselves. The data presented in FIG. 3 shows that the variants are stable and generate significant amplification products when performed RCA at 42°C. The data in FIG. 3 shows the amplification in relative fluorescence units (RFU per minute) at 42°C.

[0194] Under standard assay conditions, padlock probes (PLPs) hybridize with high specificity to target RNA molecules that are preserved within the spatial boundaries of the tissue section. This specificity enables accurate spatial transcriptomic profiling. However, in certain experimental runs, transcript reads were observed beyond the defined tissue boundary. This anomalous off-tissue signal is hypothesized to originate from RNA diffusion that occurs prior to the rolling circle amplification (RCA) step. Potential sources of this diffusion include assay stages such as tissue rehydration, enzymatic digestion, or other pre-amplification treatments, wherein the integrity of cellular or tissue compartmentalization may be transiently compromised. To mitigate such diffusion artifacts, the performance of pegylated enzymes was evaluated. In particular, a polymerase bearing covalently linked polyethylene glycol (PEG) moieties (e.g., MS- 65+PEG24) was tested on formalin-fixed paraffin-embedded (FFPE) tonsil and brain tissues using the G4X sequencing platform with the standard 300 gene immune-oncology transcript panel. To isolate the functional contribution of PEGylation from potential crowding effects attributable to PEG alone, control experiments included: (i) unpegylated enzymes supplemented with free PEG in the buffer system, and (ii) comparative assays lacking PEG altogether. FIGS. 4A-4C present in situ sequencing data demonstrating that off-tissue signal clusters are substantially reduced when using the MS-65+PEG24 enzyme formulation. These findings suggest that PEGylation of the enzyme, rather than the presence of PEG as a solution additive, plays a central role in retaining RNA molecules within the physical boundaries of the tissue. The effect is likely due to steric modulation or altered enzyme diffusion dynamics imparted by the PEG moieties, thereby preserving spatial fidelity during early assay stages.Example 4. Point of Care Applications

[0195] Point-of-care (POC) diagnostics are medical tools, instruments, compositions and / or devices enabling disease diagnosis, typically within in a patient community and outside a traditional hospital setting. The ideal diagnostic test should meet the “ASSURED” criteria:Atorney Docket No.: 051385-638001WOAffordable, Sensitive, Specific, User-friendly, Rapid and robust, Equipment-free and Delivered to those who need it. POC methods are preferably simple and do not require a heat source or stable power supply as these are typically not available at POC. Thus, enzymes and reagents used should work at ambient temperatures.

[0196] Detecting biomolecules, such as nucleic acids, typically require some sort of amplification to produce a sufficient quantity to enable robust detection. PCR technology has a high potential, however it still has strict limitations and requires the use of electrically powered thermal cycling equipment for repeated heating and cooling processes and skilled personnel to run the equipment. Non-PCR based methods, in particular Isothermal Amplification (IA) methods, have emerged as promising approaches. In these methods, nucleic acid amplification takes place at constant temperatures and has no need for high precision temperature cycling and control, or enzymes stable at high temperatures. Isothermal amplification methods are reported to have analytical sensitivities and specificities comparable to PCR as well as a higher tolerance to inhibitory compounds, while allowing shorter time to results and easier use. These features make isothermal amplification methods highly desirable for those developing POC molecular diagnostics platforms and aiming to meet “ASSURED” criteria. A number of different methods have in the last decade been published for isothermal amplification of nucleic acids, such as for example as summarized in Gill P, Ghaemi A. Nucleic acid isothermal amplification technologies: a review. Nucleosides Nucleotides Nucleic Acids. 2008 Mar; 27(3):224-43; and / or Botella JR. Point-of-Care DNA Amplification for Disease Diagnosis and Management. Annu Rev Phytopathol. 2022 Aug 26; 60: 1-20. Isothermal amplification approaches typically rely on the inherent strand displacement activity of the polymerase used in the reaction. The term strand displacement describes the ability of the polymerase to displace downstream DNA encountered during synthesis / extension.

[0197] Provided herein are polymerases and enzymes, capable of being used in point-of-care applications and / or isothermal amplification techniques. For example, a sample including a nucleic acid of interest (e.g., a pathogenic nucleic acid sequence, such as SARS-CoV-2) is combined in a reaction vessel with a circularizable (e.g., a padlock probe) template, with a ligase (e.g., SplintR® ligase), a polymerase as described herein, a plurality of nucleotides, and a colorimetric probe, such as RHthio-CuSC . The colorimetric probe changes color upon toAtorney Docket No.: 051385-638001WO formation of pyrophosphate resulting from isothermal gene amplification if the nucleic acid of interest is present.Example 5. Anti-CRISPR protein detection

[0198] Rolling circle amplification (RCA) is a robust and straightforward method for isothermal enzymatic nucleic acid amplification. By utilizing a circular template and a DNA or RNA primer, RCA enables the detection of nucleic acid targets at low concentrations with exceptional specificity under isothermal conditions. Linear RCA, the fundamental form of RCA- derivative technologies, employs a single primer and a corresponding circular template to initiate amplification.

[0199] The CRISPR-Cas system, originally identified as a prokaryotic adaptive immune mechanism, has transformed molecular biology by enabling the development of precise and highly tunable genome manipulation tools. CR1SPR-Cas9, in particular, has facilitated the precise editing of nearly any genome. Controlling Cas9 function, however, remains a significant challenge, as excessive activity and off-target effects can result in unintended genetic modifications, raising safety concerns. With the increasing approval of gene-editing therapies, these challenges underscore the critical need for precise regulation and control mechanisms within CRISPR-based technologies to mitigate off-target effects and ensure therapeutic safety.

[0200] Anti-CRISPR proteins (Acrs) represent a natural mechanism for regulating CRTSPR systems, serving as inhibitors of Cas9 activity. As a valuable tool for modulating Cas9 function, Acrs enable precise control over genome editing processes. Current methods for identifying and analyzing Acrs, such as gel electrophoresis and cell-based reporter assays, are labor-intensive, semi-quantitative, and lack sufficient sensitivity. The shortcomings of these conventional techniques create an unmet need for the development of more precise, rapid, and sensitive methods to detect and quantify Acrs. RCA, with its simplicity, rapid reaction kinetics, and high sensitivity, provides an attractive platform for addressing these limitations and enhancing Acr detection.

[0201] An isothermal amplification method for the detection of anti-CRISPR proteins (Acrs), triggered by the CRISPR-Cas9 system, is contemplated herein. The assay centers on a padlock probe containing a designed N20 sequence and a cleavable site, specifically engineered for recognition by Cas9. In the absence of an Acr, the padlock probe hybridizes with the primer andAtorney Docket No.: 051385-638001WO is ligated to form a circular DNA template, which serves as the substrate for subsequent cleavage by the Cas9-sgRNA complex (RNP). Under AcrIIA5 inhibition, the circular DNA template evades cleavage by Cas9, triggering rolling circle amplification (RCA) with an enzyme described herein to produce abundant amplification products. Amplification products are then probed by SYBR Green I, a fluorescent dye that binds to DNA, generating a signal proportional to the concentration of Acrs in the sample.SEQUENCES

[0202] WT; SEQ ID NO: 1 :MKHMPRKMYSCDFETTTKVEDCRVWAYGYMNIEDHSEYKIGNSLDEFMAWVLKVQA DLYFHNLI<FDGAFIINWLERNGFI<WSADGLPNTYNTIISRMGQWYMIDICLGYI<GI<RI<I HTVIYDSLKKLPFPVKKIAKDFKLTVLKGDIDYHKERPVGYKITPEEYAYIKNDIQIIAEA LLIQFKQGLDRMTAGSDSLKGFKDIITTKKFKKVFPTLSLGLDKEVRYAYRGGFTWLND RFKEKEIGEGMVFDVNSLYPAQMYSRLLPYGEPIVFEGKYVWDEDYPLHIQHIRCEFEL KEGYIPTIQIKRSRFYKGNEYLKSSGGEIADLWLSNVDLELMKEHYDLYNVEYISGLKFK ATTGLFKDFIDKWTYIKTTSEGAIKQLAKLMLNSLYGKFASNPDVTGKVPYLKENGALG FRLGEEETKDPVYTPMGVFITAWARYTTITAAQACYDRIIYCDTDSIHLTGTEIPDVIKDI VDPKKLGYWAHESTFKRAKYLRQKTYIQDIYMKEVDGKLVEGSPDDYTDIKFSVKCAG MTDKIKKEVTFENFKVGFSRKMKPKPVQVPGGVVLVDDTFTIK

[0203] MinWT; SEQ ID NO:2:MPRKMYSCDFETTTKVEDCRVWAYGYMNIEDHSEYKIGNSLDEFMAWVLKVQADLYF HNLKFDGAFIINWLERNGFKWSADGLPNTYNTIISRMGQWYMIDICLGYKGKRKIHTVI YDSLKKLPFPVKKIAKDFKLTVLKGDIDYHKERPVGYKITPEEYAYIKNDIQIIAEALLIQF KQGLDRMTAGSDSLKGFKDIITTKKFKKVFPTLSLGLDKEVRYAYRGGFTWLNDRFKE KEIGEGMVFDVNSLYPAQMYSRLLPYGEPIVFEGKYVWDEDYPLHIQHIRCEFELKEGYI PTIQIKRSRFYKGNEYLKSSGGEIADLWLSNVDLELMKEHYDLYNVEYISGLKFKATTGL FKDFIDKWTYIKTTSEGAIKQLAKLMLNSLYGKFASNPDVTGKVPYLKENGALGFRLGE EETKDPVYTPMGVFITAWARYTTITAAQACYDRIIYCDTDSIHLTGTEIPDVIKDIVDPKK LGYWAHESTFKRAKYLRQKTYIQDIYMKEVDGKLVEGSPDDYTDIKFSVKCAGMTDKI KKEVTFENFKVGFSRKMKPKPVQVPGGVVLVDDTFTIK

[0204] Salasvirus phi29; SEQ ID NO: 3 :MKHMPRKMYSCDFETTTKVEDCRVWAYGYMNIEDHSEYKIGNSLDEFMAWVLKVQA DLYFHNLI<FDGAFIINWLERNGFI<WSADGLPNTYNTIISRMGQWYMIDICLGYI<GI<RI<I HTVIYDSLKKLPFPVKKIAKDFKLTVLKGDIDYHKERPVGYKITPEEYAYIKNDIQIIAEA LLIQFKQGLDRMTAGSDSLKGFKDIITTKKFKKVFPTLSLGLDKEVRYAYRGGFTWLND RFKEKEIGEGMVFDVNSLYPAQMYSRLLPYGEPIVFEGKYVWDEDYPLHIQHIRCEFEL KEGYIPTIQIKRSRFYKGNEYLKSSGGEIADLWLSNVDLELMKEHYDLYNVEYISGLKFK ATTGLFKDFIDKWTYIKTTSEGAIKQLAKLMLNSLYGKFASNPDVTGKVPYLKENGALGAtorney Docket No.: 051385-638001WOFRLGEEETKDPVYTPMGVFITAWARYTTITAAQACYDRIIYCDTDSIHLTGTETPDVIKDI VDPKKLGYWAHESTFKRAKYLRQKTYIQDIYMKEVDGKLVEGSPDDYTDIKFSVKCAG MTDKIKKEVTFENFKVGFSRKMKPKPVQVPGGVVLVDDTFTIK

[0205] DNA polymerase Bacillus phage phi29.1 : SEQ ID NO:4:MPRKMYSCDFETTTKVEDCRVWAYGYMNIEDHSEYKIGNSLDEFMAWVLKVQADLYF HNLKFDGAFIINWLERNGFKWSADGLPNTYNTIISRMGQWYMIDICLGYKGKRKIHTVIYDSLKKLPFPVKKIAKDFKLTVLKGDIDYHKERPVGYKITPEEYAYIKNDIQIIAEALLIQF KQGLDRMTAGSDSLKGFKDIITTKKFKKVFPTLSLGLDKEVRYAYRGGFTWLNDRFKE KEIGEGMVFDVNSLYPAQMYSRLLPYGEPIVFEGKYVWDEDYPLHIQHIRCEFELKEGYI PTIQIKRSRFYKGNEYLKSSGGEIADLWLSNVDLELMKEHYDLYNVEYISGLKFKATTGL FKDFIDKWTYIKTTSEGAIKQLAKLMLNSLYGKFASNPDVTGKVPYLKENGALGFRLGE EETKDPVYTPMGVFITAWARYTTITAAQACYDRIIYCDTDSIHLTGTEIPDVIKDIVDPKKLGYWAHESTFKRAKYLRQKTYIQDIYMKEVDGKLVEGSPDDYTDIKFSVKCAGMTDKI KKEVTFENFKVGFSRKMKPKPVQVPGGVVLVDDTFTIK

[0206] DNA polymerase, partial Tannerella sp. oral taxon BU063 isolate; SEQ ID NO:5:GGFTWLNDRFKEKEIGEGMVFDVNSLYPAQMYSRLLPYGEPIVFEGKYVWDEDYPLHI QHIRCEFELKEGYIPTIQIKRSRFYKGNEYLKSSGGEIADLWLSNVDLELMKEHYDLYNVEYISGLKFKATTGLFKDFIDKWTYIKTTSEGAIKQLAKLMLNSLYGKFASNPDVTGKVPY LKENGALGFRLGEEETKDPVYTPMGVFITAWARYTTITAAQACYDRIIYCDTDSIHLTGT EIPDVIKDIVDPKKLGYWAHESTFKRAKYLRQKTYIQDIYMKEVDGKLVEGSPDDYTDI KFSVKCAGMTDKIKKEVTFENFKVGFSRKMKPKPVQVPGGVVLVDDTFTIK

[0207] DNA polymerase Tannerella sp. oral taxon BU063 isolate Cell 6 / 7 / 9; SEQ ID NO:6:MYSRLLPYGEPIVFEGKYVWDEDYPLHIQHIRCEFELKEGYIPTIQIKRSRFYKGNEYLKS SGGEIADLWLSNVDLELMKEHYDLYNVEYISGLKFKATTGLFKDFIDKWTYIKTTSEGAIKQLAKLMLNSLYGKFASNPDVTGKVPYLKENGALGFRLGEEETKDPVYTPMGVFITAW ARYTTITAAQACYDRIIYCDTDSIHLTGTEIPDVIKDIVDPKKLGYWAHESTFKRAKYLR QKTYIQDIYMKEVDGKLVEGSPDDYTDIKFSVKCAGMTDKIKKEVTFENFKVGFSRKM KPKPVQVPGGVVLVDDTFTIK

[0208] Escherichia coli HPP; SEQ ID NOY:MASMTGGQQMGRIRMPRKMYSCDFETTTKVEDCRVWAYGYMNIEDHSEYKIGNSLDE FMAWVLKVQADLYFHNLKFDGAFIINWLERNGFKWSADGLPNTYNTIISRMGQWYMID ICLGYKGKRKIHTVIYDSLKKLPFPVKKIAKDFKLTVLKGDIDYHKERPVGYKITPEEYAYIKNDIQIIAEALLIQFKQGLDRMTAGSDSLKGFKDIITTKKFKKVFPTLSLGLDKEVRYA YRGGFTWLNDRFKEKEIGEGMVFDVNSLYPAQMYSRLLPYGEPIVFEGKYVWDEDYPL HIQHIRCEFELKEGYIPTIQIKRSRFYKGNEYLKSSGGEIADLWLSNVDLELMKEHYDLY NVEYISGLKFKATTGLFKDFIDKWTYIKTTSEGAIKQLAKLMLNSLYGKFASNPDVTGK VPYLKENGALGFRLGEEETKDPVYTPMGVFITAWARYTTITAAQACYDRIIYCDTDSIHL TGTEIPDVIKDIVDPKKLGYWAHESTFKRAKYLRQKTYIQDIYMKEVDGKLVEGSPDDYAtorney Docket No.: 051385-638001WOTDIKFSVKCAGMTDKIKKEVTFENFKVGFSRKMKPKPVQVPGGVVLVDDTFTIKLEHHHHHH

[0209] DNA polymerase Bacillus phage BSTP4; SEQ ID NO:8:MPRKMYSCDFETTTKVEDCRVWAYGYMNIEDHSEYKIGNSLDEFMAWVLKVQADLYF HNLKFDGAFIINWLERNGFKWSADGLPNTYNTIISRMGQWYMIDICLGYKGKRKIHTVIYDSLKKLPFPVKKIAKDFKLTVLKGDIDYHKERPVGYKITPEEYAYIKNDIQIIAEALLIQFKQGLDRMTAGSDSLKGFKDIITTKKFKKVFPTLSLGLDKEVRYAYRGGFTWLNDRFKEKEIGEGMVFDVNSLYPAQMYSRLLPYGEPIAFEGKYVWDEDYPLHIQHIRCEFELKEGYI PTIQIKRSRFYKGNEYLKSSGGEIADLWLSNVDLELMKEHYDLYNVEYISGLKFKATTGL FKDFIDKWTYIKTTSEGAIKQLAKLMLNSLYGKFASNPDVTGKVPYLKENGALGFRLGE EETKDP VYTPMGVFIT AW ARYTTIT AAQ AC YDRIIYCDTD S IHLTGTEIPD VIKDI VDPKK LGYWAHESTFKRAKYLRQKTYIQDIYMKEVDGKLVEGSPDDYTDIKFSVKCAGMTDKIKKEVTFENFKVGFSRKMKPKPVQVPGGVVLVDDTFTIK

[0210] Bacillus phage phi29 UPP.l; SEQ ID NO:9:MKHMPRKMYSCDFETTTKVEDCRVWAYGYMNIEDHSEYKIGNSLDEFMAWVLKVQADLYFHNLKFDGAFIINWLERNGFKWSADGLPNTYNTIISRMGQWYMIDICLGYKGKRKI HTVIYDSLKKLPFPVKKIAKDFKLTVLKGDIDYHKERPVGYKITPEEYAYIKNDIQIIAER LLIQFKQGLDRMTAGSDSLKGFKDIITTKKFKKVFPTLSLGLDKEVRYAYRGGFTWLND RFKEKEIGEGMVFDVNSLYPAQMYSRLLPYGEPIVFEGKYVWDEDYPLHIQHIRCEFELKEGYIPTIQIKRSRFYKGNEYLKSSGGEIADLWLSNVDLELMKEHYDLYNVEYISGLKFKATTGLFKDFIDKWTYIKTTSEGAIKQLAKLMLNSLYGKFASNPDVTGKVPYLKENGALG FRLGEEETKDP VYTPMGVFIT AW ARYTTIT AAQACYDRIIYCDTDSIHLTGTEIPDVIKDI VDPKKLGYWAHESTFKRVKYLRQKTYIQDIYMKEVDGKLVEGSPDDYTDIKFSVKCAG MTDKIKKEVTFENFKVGFSRKMKPKPVQVPGGVVLVDDTFTIK

[0211] Bacillus phage phi29 UPP.2; SEQ ID NOTO:MKHMPRKMYSCDFETTTKVEDCRVWAYGYMNIEDHSEYKIGNSLDEFMAWVLKVQADLYFHNLI<FDGAFIINWLERNGFI<WSADGLPNTYNTIISRMGQWYMIDICLGYI<GI<RI<IHTVIYDSLKKLPFPVKKIAKDFKLTVLKGDIDYHKERPVGYKITPEEYAYIKNDIQIIAERLLIQFKQGLDRMTAGSDSLKGFKDIITTKKFKKVFPTLSLGLDKEVRYAYRGGFTWLNDRFKEKEIGEGMVFDVNSLYPAQMYSRLLPYGEPIVFEGKYVWDEDYPLHIQHIRCEFELKEGYIPTIQIKRSRFYKGNEYLKSSGGEIADLWLSNVDLELMKEHYDLYNVEYISGLKFKVTTGLFKDFIDKWTYIKTTSEGAIKQLAKLMLNSLYGKFASNPDVTGKVPYLKENGALG FRLGEEETKDP VYTPMGVFIT AW ARYTTIT AAQ AC YDRIIYCDTDSIHLTGTEIPD VIKDI VDPKKLGYWAHESTFKRAKYLRQKTYIQDIYMKEVDGKLVEGSPDDYTDIKFSVKCAGMTDKIKKEVTFENFKVGFSRKMKPKPVQVPGGVVLVDDTFTIK

[0212] Chain A, DNA polymerase Salasvirus phi29; SEQ ID NO: 11 :MKHMPRKMYSCAFETTTKVEDCRVWAYGYMNIEDHSEYKIGNSLDEFMAWVLKVQA DLYFHNLKFAGAFIINWLERNGFKWSADGLPNTYNTIISRMGQWYMIDICLGYKGKRKIAtorney Docket No.: 051385-638001WOHTVIYDSLKKLPFPVKKIAKDFKLTVLKGDIDYHKERPVGYKITPEEYAYIKNDIQIIAEA LLIQFKQGLDRMTAGSDSLKGFKDIITTKKFKKVFPTLSLGLDKEVRYAYRGGFTWLND RFKEKEIGEGMVFDVNSLYPAQMYSRLLPYGEPIVFEGKYVWDEDYPLHIQHIRCEFEL KEGYIPTIQIKRSRFYKGNEYLKSSGGEIADLWLSNVDLELMKEHYDLYNVEYISGLKFK ATTGLFKDFIDKWTYIKTTSEGAIKQLAKLMLNSLYGKFASNPDVTGKVPYLKENGALG FRLGEEETKDPVYTPMGVF1TAWARYTT1TAAQACYDR11YCDTDS1HLTGTE1PDV1KD1 VDPKKLGYWAHESTFKRAKYLRQKTYIQDIYMKEVDGKLVEGSPDDYTDIKFSVKCAG MTDKIKKEVTFENFKVGFSRKMKPKPVQVPGGVVLVDDTFTIK

[0213] Primer terminal protein Bacillus phage BSTP6; SEQ ID NO: 12:MKHMPRKMYSCDFETTTKVEDCRVWAYGYMNIENHSEYKIGNSLDEFMAWVLKVQA DLYFHNLKFDGAFIINWLERNGFKWSADGLPNTYNTIISRMGQWYMIDICLGYKGKRKI HTVIYDSLKKLPFPVKKIAKDFKLTVLKGDIDYHKERPVGYKITPEEYAYIKNDIQIIAEA LLIQFKQGLDRMTAG SD SLKGFKDIITTKKFKKVFPTLSLGLDKEVRYAYRGGFTWLND RFKEKEIGEGMVFDVNSLYPAQMYSRLLPYGEPIVFEGKYVWDEDYPLHIQHIRCEFEL KEGYIPTIQIKRSRFYKGNEYLKSSGGEIADLWVSNVDLELMKEHYDLYNVEYISGLKFK ATTGLFKDFIDKWTYIKTTSEGAIKQLAKLMLNSLYGKFASNPDVTGKVPYLKENGALG FRLGEEETKDPVYTPMGVFITAWARYTTITAAQACYDRIIYCDTDSIHLTGTEIPDVIKDI VDPKKLGYWAHESTFKRAKYLRQKTYIQDIYMKEVDGKLVEGSPDDYTDIKFSVKCAG MTDKIKKEVTFDNFKVGFSRKMKPKPVQVPGGVVLVDDTFTIK

[0214] Bacillus velezensis HP; SEQ ID NO: 13:MNIEDHSDYKIGNSLDEFMAWAMKVQADLYFHNLKFDGAFIINWLERNGFKWSADGL PNTYNTIISRMGQWYMIDICLGYKGKRKIHTVIYDSLKKLPFPVKKIAKDFKLTVLKGDI DYHKERPVGYKITPEEYAYIKNDIQIIAEALLIQFKQGLDRMTAGSDSLKGFKDIITTKKF KKVFPTLSLGLDKEVRYAYRGGFTWLNDRFKEKEIGEGMVFDVNSLYPAQMYSRLLPY GEPIVFEGKYVWDEDYPLHIQHIRCEFELKEGYIPTIQIKRSRFYKGNEYLKSSGGEIADL WLSNVDLELMKEHYDLYNVEYISGLKFKATTGLFKDFIDKWTYIKTTSEGAIKQLAKLM LNSLYGKFASNPDVTGKVPYLKENGALGFRLGEEETKDPVYTPMGVFITAWARYTTITA AQACYDRIIYCDTDSIHLTGTEIPDVIKDIVDPKKLGYWAHESTFKRAKYLRQKTYIQDIY MKEVDGKLVEGSPDDYTDIKFSVKCAGMTDKIKKEVTFENFKVGFSRKMKPKPVQVPG GVVLVDDTFTIK

[0215] Bacillus velezensis, HPP.2; SEQ ID NO: 14:MSRKMYSCDFETTTKVEDCRVWAYGYMNIEDHSDYKIGNSLDEFMAWAMKVQADLY FHNLKFDGAFIINWLERNGFKWSADGLPNTYNTIISRMGQWYMIDICLGYKGKRKIHTVI YDSLKKLPFPVKKIAKDFKLTVLKGDIDYHKERPVGYKITPEEYAYIKNDIQIIAEALLIQF KQGLDRMTAGSDSLKGFKDIITTKKFKKVFPTLSLGLDKEVRYAYRGGFTWLNDRFKE KEIGEGMVFDVNSLYPAQMYSRLLPYGEPIVFEGKYVWDEDYPLHIQHIRCEFELKEGYI PTIQIKRSRFYKGNEYLKSSGGEIADLWLSNVDLELMKEHYDLYNVEYISGLKFKATTGL FKDFIDKWTYIKTTSEGAIKQLAKLMLNSLYGKFASNPDVTGKVPYLKENGALGFRLGE EETKDPVYTPMGVFITAWARYTTITAAQACYDRIIYCDTDSIHLTGTEIPDVIKDIVDPKKAtorney Docket No.: 051385-638001WOLGYWAHESTFKRAKYLRQKTYIQDIYMKEVDGKLVEGSPDDYTDIKFSVKCAGMTDKIKKEVTFENFKVGFSRKMKPKPVQVPGGVVLVDDTFTIK

[0216] DNA polymerase Bacillus phage Arbol; SEQ ID NO: 15:MPRKMYSCDFETTTKVEDCRVWAYGYMNIEDHSEYKIGNSLDEFMAWVLKVQADLYFHNLKFDGAFIINWLERNGFKWSADGLPNTYNTIISRMGQWYMIDICLGYKGKRKIHTVIYDSLKKLPFPVKKIAKDFKLTVLKGDIDYHKERPVGYEITPDEYAYIKNDIQIIAEALLIQFKQGLDRMTAGSDSLKGFKDIITTKKFKKVFPTLSLGLDKEVRYAYRGGFTWLNDRFKEKEIGEGMVFDVNSLYPAQMYSRLLPYGEPIVFEGKYVWDEDYPLHIQHIRCEFELKEGYIPTIQIKRSRFYKGNEYLKSSGGEIADLWVSNVDLELMKEHYDLYNVEYISGLKFKATTGLFKDFIDKWTHIKTTSEGAIKQLAKLMLNSLYGKFASNPDVTGKVPYLKENGALGFRLGEEETKDPVYTPMGVFITAWARYTTITAAQACFDRIIYCDTDSIHLTGTEIPDVIKDIVDPKKLGYWAHESTFKRAKYLRQKTYIQDIYMKEVDGKLVEGSPDDYTTIKFSVKCAGMTDKIKKEVTFDNFKVGFSRKMKPKPVQVPGGVVLVDDTFTIK

[0217] H3027_gp06 Bacillus phage PZA; SEQ ID NO: 16:MPRKMYSCDFETTTKVEDCRVWAYGYMNIEDHSEYKIGNSLDEFMAWVLKVQADLYFHNLKFDGAFIINWLERNGFKWSADGLPNTYNTIISRMGQWYMIDICLGYKGKRKIHTVIYDSLKKLPFPVKKIAKDFKLTVLKGDIDYHKERPVGYEITPDEYAYIKNDIQIIAEALLIQFKQGLDRMTAGSDDLKGFKDIITTKKFKKVFPTLSLGLDKEVRYAYRGGFTWLNDRFKEKEIGEGMVFDVNSLYPAQMYSRLLPYGEPIVFEGKYVWDEDYPLHIQHIRCEFELKEGYIPTIQIKRSRFYKGNEYLKSSGGEIADLWVSNVDLELMKEHYDLYNVEYISGLKFKATTGLFKDFIDKWTHIKTTSEGAIKQLAKLMLNSLYGKFASNPDVTGKVPYLKENGALGFRLGEEETKDPVYTPMGVFITAWARYTTITAAQACFDRIIYCDTDSIHLTGTEIPDVIKDIVDPKKLGYWAHESTFKRAKYLRQKTYIQDIYMKEVDGKLVEGSPDDYTTIKFSVKCAGMTDKIKKEVTFENFKVGFSRKMKPKPVQVPGGVVLVDDTFTIK

[0218] Bacillus phage Whitingl8; SEQ ID NO:17:MPRKMYSCDFETTTKVEDCRVWAYGYMNIEDHSEYKIGNSLDEFMAWVLKVQADLYFHNLKFDGAFIINWLERNGFKWSADGLPNTYNTIISRMGQWYMIDICLGYKGKRKIHTVIYDSLKKLPFPVKKIAKDFKLTVLKGDIDYHKERPVGYEITPDEYAYIKNDIQIIAEALLIQFKQGLDRMTAGSDSLKGFKDIITTKKFKKVFPTLSLGLDKEVRYAYRGGFTWLNDRFKEKEIGEGMVFDVNSLYPAQMYSRLLPYGEPIVFEGKYVWDEDYPLHIQHIRCEFELKEGYIPTIQIKRSRFYKGNEYLKSSGGEIADLWVSNVDLELMKEHYDLYNVEYISGLKFKATTGLFKDFIDKWTHIKTTSEGAIKQLAKLMLNSLYGKFASNPDVTGKVPYLKENGALGFRLGEEETKDPVYTPMGVFITAWARYTTITAAQACFDRIIYCDTDSIHLTGTETPDVIKDIVDPKKLGYWAHESTFKRAKYLRQKTYIQDIYMKEVDGKLVEGSPDDYTTIKFSVKCAGMTDKIKKEVTFDNFKVGFSRKMKPKPVQVPGGVVLVDDTFTIK

[0219] DNA polymerase Bacillus phage vB_BveP-Goe6; SEQ ID NO: 18:MPRKMYSCDFETTTKVEDCRVWAYGYMNIEDHSEYKIGNSLDEFMAWVLKVQADLYFHNLKFDGAFIINWLERNGFKWSADGLPNTYNTIISRMGQWYMIDICLGYKGKRKIHTVIAtorney Docket No.: 051385-638001WOYDSLKKLPFPVKKIAKDFKLTVLKGDIDYHKERPVGYKITPEEYAYIKNDIQIIAEALLIQF KQGLDRMTAGSDSLKGFKDIITTKKFKKVFPTLSLGLDKEVRYAYRGGFTWLNDRFKE KEIGEGMVFDVNSLYPSQMYSRLLPYGEPIVFEGKYVWDEDYPLHIQHIRCEFELKEGYIPTIQIKRSRFYKGNEYLKSSGGEIADLWLSNVDLELMKEHYDLYNVEYISGLKFKATTGL FKDFIDKWTYIKTTSEGAIKQLAKLMLNSLYGKFASNPDVTGKVPYLKENGALGFRLGE EETKDPVYTPMGVF1TAWARYTT1TAAQACYDR11YCDTDS1HLTGTEIPDV1KD1VDDNK LGYWAHESTFKRAKYLRQKTYIQDIYMKEVNGKPVPASPDDYTFIKFSVKCAGMTDKI KKEVTFDNFKVGFSRKMKPKPVQVPGGVVLVDDTFTIK

[0220] DNA polymerase protein Bacillus phage TB A3; SEQ ID NO: 19:MPRKMYSCDFETTTKVEDCRVWAYGYMNIEDHSEYKIGNSLDEFMAWVMKVQADLY FHNLKFDGAFIINWLERNGFKWSADGLPNTYNTIISRMGQWYMIDICLGYKGKRKIHTVIYDSLKKLPFPVKKIAKDFKLTVLKGDIDYHKERPVGYKITPEEYAYIKNDIQIIAEALLIQF KQGLDRMTAGSDSLKGFKDIITAKKFKKVFPTLSLGLDKEVRYAYRGGFTWLNDRFKD KEIGEGMVFDVNSLYPSQMYTRLLPYGEPIVFEGKYVWDEDYPLHIQHIRCEFELKEGYIPTIQIKRSRFYKGNEYLKSSGGEIADLWLSNVDLELMKEHYDLYNVEYISGLKFKATTGL FKDFIDKWTYIKTTSEGAIKQLAKLMLNSLYGKFASNPDVTGKVPYLKENGALGFRLGE EETKDP VYTPMGVFIT AW ARYTTIT AAQ AC YDRIIYCDTD S IHLTGTEIPD VIKDI VDDNK LGYWAHESTFKRAKYLRQKTYIQDIYMKEVDGKPVPASPDDYTFIKFSVKCAGMTDKI KKEVTFENFKVGFSRKMKPKPVQVPGGVVLVDDTFTIK

[0221] Bacillus phage M2; SEQ ID NO:20:MSRKMFSCDFETTTKLDDCRVWAYGYMEIGNLDNYKIGNSLDEFMQWVMEIQADLYF HNLKFDGAFIVNWLEQHGFKWSNEGLPNTYNTIISKMGQWYMIDICFGYKGKRKLHTVI YDSLKKLPFPVKKIAKDFQLPLLKGDIDYHTERPVGHEITPEEYEYIKNDIEIIARALDIQF KQGLDRMTAGSDSLKGFKDILSTKKFNKVFPKLSLPMDKEIRKAYRGGFTWLNDKYKE KEIGEGMVFDVNSLYPSQMYSRPLPYGAPIVFQGKYEKDEQYPLYIQRIRFEFELKEGYIP TIQIKKNPFFKGNEYLKNSGVEPVELYLTNVDLELIQEHYELYNVEYIDGFKFREKTGLF KDFIDKWTYVKTHEEGAKKQLAKLMLNSLYGKFASNPDVTGKVPYLKDDGSLGFRVG DEEYKDPVYTPMGVFITAWARFTTITAAQACYDRIIYCDTDSIHLTGTEVPEIIKDIVDPKKLGYWAHESTFKRAKYLRQKTYIQDIYVKEVDGKLKECSPDEATTTKFSVKCAGMTDT IKKK VTFDNF AVGF S SMGKPKPVQ VNGGV VL VD S VFTIK

[0222] DNA polymerase Bacillus phage vB BsuP-Goel; SEQ ID NO:21 :MSRKMFSCDFETTTKLDDCRVWAYGYMEIGNLDNYKIGNSLDEFMKWVMEIQADLYF HNLKFDGAFIVNWLEQHGFKWSNEGLPNTYHTIISKMGQWYMIDICFGYRGKRKLHTVI YDSLKKLPFPVKKIAKDFQLPLLKGDIDYHTERPVGHEITPEEYEYIKNDIEIIARALDIQF KQGLDRMTAGSDSLKGFKDILSTKKFNKVFPKLSLPMDKEIRKAYRGGFTWLNDKYKE KEIGEGMVFDVNSLYPSQMYSRPLPYGAPIVFQGKYEKDEQYPLYIQRIRFEFELKEGYIP TIQIKKNPFFKGNEYLKNSGAEPVELYLTNVDLELIQEHYELYNVEYIDGFKFREKTGLF KDFIDKWTYVKTHEEGAKKQLAKLMLNSLYGKFASNPDVTGKVPYLKDDGSLGFRVG DEEYKDPVYTPMGVFITAWARFTTITAAQACYDRIIYCDTDSIHLTGTEVPEIIKDIVDPKAtorney Docket No.: 051385-638001WOKLGYWAHESTFKRAKYLRQKTYIQDIYVKEVDGKLKECSPDEATTTKFSVKCAGMTDTIKKKVTFDNFKVGF S SMGKPKPVQVNGGVVL VD S VFTIK

[0223] DNA polymerase Beecentumtrevirus Nf; SEQ ID NO:22:MSRKMFSCDFETTTKLDDCRVWAYGYMEIGNLDNYKIGNSLDEFMQWVMEIQADLYF HNLKFDGAFIVNWLEQHGFKWSNEGLPNTYNTIISKMGQWYMIDICFGYRGKRKLHTVI YDSLKKLPFPVKKIAKDFQLPLLKGDIDYHTERPVGHEITPEEYEYIKNDIEIIARALDIQF KQGLDRMTAGSDSLKGFKDILSTKKFNKVFPKLSLPMDKEIRKAYRGGFTWLNDKYKE KEIGEGMVFDVNSLYPSQMYSRPLPYGAPIVFQGKYEKDEQYPLYIQRIRFEFELKEGYIP TIQIKKNPFFKGNEYLKNSGVEPVELYLTNVDLELIQEHYELYNVEYIDGFKFREKTGLF KDFIDKWTYVKTHEEGAKKQLAKLMLNSLYGKFASNPDVTGKVPYLKDDGSLGFRVG DEEYKDPVYTPMGVFITAWARFTTITAAQACYDRIIYCDTDSIHLTGTEVPEIIKDIVDPK KLGYWAHESTFKRAKYLRQKTYIQDIYVKEVDGKLKECSPDEATTTKFSVKCAGMTDTIKKKVTFDNF AVGF S SMGKPKPVQVNGGVVL VD S VFTIK

[0224] DNA polymerase Beecentumtrevirus Bl 03; SEQ ID NO:23:MPRKMFSCDFETTTKLDDCRVWAYGYMEIGNLDNYKIGNSLDEFMQWVMEIQADLYF HNLKFDGAFIVNWLEHHGFKWSNEGLPNTYNTIISKMGQWYMIDICFGYKGKRKLHTVI YDSLKKLPFPVKKIAKDFQLPLLKGDIDYHAERPVGHEITPEEYEYIKNDIEIIARALDIQF KQGLDRMTAGSDSLKGFKDILSTKKFNKVFPKLSLPMDKEIRRAYRGGFTWLNDKYKE KEIGEGMVFDVNSLYPSQMYSRPLPYGAPIVFQGKYEKDEQYPLYIQRIRFEFELKEGYIP TIQIKKNPFFKGNEYLKNSGAEPVELYLTNVDLELIQEHYEMYNVEYIDGFKFREKTGLF KEFIDKWTYVKTHEKGAKKQLAKLMFDSLYGKFASNPDVTGKVPYLKEDGSLGFRVG DEEYKDPVYTPMGVFITAWARFTTITAAQACYDRIIYCDTDSIHLTGTEVPEIIKDIVDPK KLGYWAHESTFKRAKYLRQKTYIQDIYAKEVDGKLIEC SPDEATTTKF S VKCAGMTDTI KKKVTFDNFRVGF S S TGKPKP VQ VNGGVVL VD S VFTIK

[0225] DNA polymerase Cytobacillus phage Bfspl; SEQ ID NO:24:MARKQFMCDFETTTDIDDCRVWAYGYMEIGNKQNYKIGNSIEEFMEWAEKSRSDIYFH NLKFDGSFIVNWLLNNGYTWEHMNEKSGKPKTFTTIISNMGQWYMVDICYGRKGKRLL HTKIYDSLKKLPFSVKQIAKAFKLPIMKGDIDYTKPRPVGYEITPDEEEYIYGDLFIVASAL ETQFEQGLTKMTSGSDSLSGFKDILTPKMFDKFFPVLDLRIDSEIRKAYRGGFTWVNDTI QGQTIGEGMVFD VNSL YP SRMYDCDLPYGTPEKFEGEYTYNET YPL YIQ VLKC SFELKE GYIPTIQLKQTARYRDNEYLKSSNHEIETLYVTNVDLELIKEHYDLYDVEYLGGYMFKK KNDLFREFIDYWMHIKITSTGAIKQLAKLMLNSLYGKFASNPVVTGKIPYLKEDGRNGF KLPVKEGEFQEVKGKLIPVIDEEYKEPVYTAMGAFITAWARHYTITTAQKCFDRICYCDTDSIHIKGTEIPEVIKDIIDPDKLGYWNHESTFIRAKFIRQKTYIEDTCFKMVEKNGKLEKVG AGLDDYEFTEIEVKCAGMPENLKKYVTWENFNVFNEDMLLPARDGYWYGKLMPKQVP GGVVLVESDFA1R

[0226] DNA polymerase Bacillus phage vB Bpu PumAl; SEQ ID NO:25:Atorney Docket No.: 051385-638001WOMARKKYSCDFETTTDPLDCRVWAYGYMEIGKDSNYKIGNSLDEFMEWVSKCNADLYF HNLRFDGEFILIWLLQNGFKWSDKRKPEPMTFNGVISRDNAVYRYDICYGYTNSGKKIH TVIYDSYKKLPYPVKVIAKAFNLTQLKGDIDYDAYRPVGHKITKEEYKYIYNDIKIIADA LKIQFEQGLKKMTIGSDSLNGFKSIFGKKQFEKTFPVLDMLTDDFIRLSYKGGFTWLNPK FANIVINKGRVYDVNSMYPAIMYNELLPYGVPVRFKGKYEKDDKYPLYIQQISCIFELKE GK1PM1QVKNEPLKFKGSEYLTSSKGYEVKLTLTNVELELFLENYKLNCVEYLGGYKFR GVRGLFKTFIDKWMNIKMNSEGAIRELAKLMLNNLYGKFATNPDVTGKYPELKEDGSL GFKMKPRELSEPVYTAMGSFITAYGRCMTVRTGQSCYDRFIYADTDSVHVAGNEDIPEI ADKIDSKKLGYWDHEATFETGKYVRSKAYFLNLYAKKVVKDGEEIIKPCGEEEATTRK RKVACAGMPETLRNIVPFEEFKIGYTGTRLAPRHVKGGIVLVDAPYTLKEDIWRYA

[0227] FF38 11041 Lucilia cuprina; SEQ ID NO:26:MARI<RPIRITI<NDRAEYI<RLSKNAI<SI<LNRTVI<NYGIDLSNDVDIPI<LSDFI<TRKEFND WKQKITSFTSRSNQEYQFRKNEYGVVASVKELNEIKRNTKKAQKLAKEKIDKAMKLDFY VEGERQGKVKDRIKLMKKEEVAGVSVPVDFDFDKIQTRKRLEDKAGFMEERATGDYY RKKDIQMKENFISMIEQGFNSDADEVIKKLKKIPPDDFVELTIVTDEIDFRNYGSKNEGGI NDEDKLEELNNTLNDYFNETTTDVNDCRVWAYGWMEIGKTSNYKIGTDFNEFMEWMI HS S SRLYFHNLKFDGSFIVNWLLHNGYTWTKRPSKEGQFSTLISKMGQWYGITIC SGRD GRKKKLTTIHDSLKKLPFPVRKIGKDFKLNVLKGDIDYHKPRPIGYEIDDEEYQYIKNDIQ IIAEALEVQTVQGLTGMTNGKDALDEFVNMSGKLYEKLFPVFSLELNEEIRKAYRGGFT WLNPVYGTKKYVKDGIVFDVNSLYPSQMYDRDLPCGVPIPFEGEYVYDKSHPLYIQKLT FEFELKENYIPTIQLKNSRFGFKGNEYLSSSNGERITISVSSVDWELIREHYHVYDVEFEK GWKFRSTKQAFRQYIDKWMLVKNMSAGAKKAIAKLMLNSLYGKFATNPDITGKRPYL REDGSNGFELMEEEFRDPVYTPVGIFITSWARYTTITSAQKCYDRIIYCDTDSMHLEGLD VPESIKDIVADDVLGYWKKEGQFKQGKFIRQKTYMEEYYAKYVRDENGEIKYDDEKPI KTICDKEESDTTIIEIKCAGMPDNIKKHVTFDNFDIGFTMEGKLKPKQVYGGVVLVEETY TMK

[0228] DNA polymerase Bacillus phage BeachBum; SEQ ID NO: 27:MGNKKRKIYSCDFETTTDVNDCRVWAYGLMEIDGKFENYKEGNNIDEFMEWAEKEQG DLYFHNLRFDGEFIVNWLLHKGYRFNNTRKAGTFNAVISSMGQWYKIDIYYGREGKKV FKTSIYDSLKKLPFPVKTIAKAFKLPIEKGDIDYDAPRPVGHQITPDESKYIKNDVEIIARA LHSQLNTAKLTKMTIGSDALDGFKHSLHKSPKVSKRMYDHHFPVISNAIHEEFKKAYRG GFTWANPKYAGKVIGNGLVFDVNSLYPSVMYDKPLPYGLPVPFSGEYEYDETHPLFIQH IRCGFELKEGHIPTIQIKKNFRFADNEYLHSSEGN1LDLHVTNVDLALIKEHYTLYEEEYL QGYKFKQVTGLFKNYIDYWSDKKINAEDPAIRQMAKLMLNSLYGKFGTSIDVTGKEVF LKEDGSTGFRKGQKEERDPVYMPMGAFITAYARDVTIRTAQKCYDRILYCDTDSIHLVG TEIPEAIKDRIHDKKLGYWAHESTFWRAKFIRQKTYIEDLCMRFEGERVNGEWKFKMVE EKD1TKATARELSVKCAGMPAQVKQYVTFDNFGVDFKHDPNDFTEEE1KRKNIKFKLKP THRKGGQVLVPTPFTIK

[0229] DNA polymerase Bacillus phage Harambe; SEQ ID NO:28:Atorney Docket No.: 051385-638001WOMGNKKRKIYSCDFETTTDVNDCRVWAYGLMEIDGKFENYKEGNNIDEFMEWTEQEQG DLYFHNLRFDGEFIVNWLLHKGYRFNNTRKAGTFNAVISSMGQWYKIDIYYGREGKKV FKTSIYDSLKKLPFPVKTIAKAFKLPIEKGDIDYDAPRPVGHQITPDESKYIKNDVEIIARA LHSQLNTAKLTKMTIGSDALDGFKHSLHKSPKVSKRMYDHHFPVISNAIHEEFKKAYRG GFTWANPKYAGKVIGNGLVFDVNSLYPSVMYDKPLPYGLPVPFSGEYEYDETHPLFIQH 1KCGFELKDGH1PT1Q1KKNFRFADNEYLHSSEGN1LDLHVTNVDLAL1KEHYTLYEEEYL QGYKFKQVTGLFKNYIDYWSDKKINAEDPAIRQMAKLMLNSLYGKFGTSIDVTGKEVFLKEDGSTGFRKGQKEERDPVYMPMGAFITAYARDVTIRTAQKCYDRILYCDTDSIHLVG TEIPEAIKDRIHDKKLGYWAHESTFWRAKFIRQKTYIEDLCMRFEGEKVNGEWKFKMVE EKDITKATARELSVKCAGMPAQVKQYVTFDNFGVDFKHDPNDYTDEEIKRKNIKFKLK PTHRKGGQVLVPTPFTIK

[0230] Enterococcus faecium HP.l; SEQ ID NO:29:MIKKYTGDFETTTDLNDCRVWSWGVCDIDNVDNITFGLDIDSFFEWCEMQGSTDIYFHN EKFDGEFMLSWLFKNGFKWSKETKEERTFSTLISNMGQWYALEICWEVNYATTKSGKT KKEKVRTIIYDSLKKYPFPVKQIAEAFNFPIKKGEIDYTKERPIGYKPTKDEWEYLKNDIQ IMAMALKIQFDQGLTRMTRGSDALGDYQDWVKTTYGKSRFKQWFPILSLGFDKDLRK AYKGGFTWVNKVFQGKEIGDGIVFDVNSLYPSQMYVRPLPYGTPLFYEGEYKPNNDYP L YIQNIKVRFRLKEGYIPTIQ VKQ S SLFIQNEYLD S S VNKLGVDELIDLTLTNVDLELFFEH YDILEIHYTYGYMFKASCDMFKGWIDKWIEVKNTTEGARKANAKGMLNSLYGKFGTNPDITGKVPYMGEDGIVRLTLGEEELRDPVYVPLASFVTAWGRYTTITTAQKCFDRIIYCD TDSIHLVGTEVPEAIDHLVDPKKLGYWGHESTFQRAKFIRQKTYVEEIDGELNVKCAGM PDRIKELVTFDNFEVGF S S YGKLLPKRTQGGVVLVDTMFTIK

[0231] Enterococcus faecium HP.2; SEQ ID NO:30:MIKKYTGDFETTTDLNDCRVWSWGVCDIDNVDNITFGLDIDSFFEWCEMQGSTDIYFHN EKFDGEFMLSWLFKNGFKWSKETKEERTFSTLISNMGQWYALEICWEVNYTTTKSGKT KKEKSRTIIYDSLKKYPFPVKQIAEAFNFPIKKGEIDYTKERPIGYKPTKDEWEYLKNDIQI MAMALKIQFDQGLTRMTRGSDALGDYQDWVKTTYGKSRFKQWFPILSLGFDKDLRKA YKGGFTWVNKVFQGKEIGDGIVFDVNSLYPSQMYVRPLPYGTPLFYEGEYKPNNDYPL YIQNIKVRFRLKEGYIPTIQ VKQ S SLFIQNEYLD S S VNKLGVDELIDLTLTNVDLELFFEHY DILEIHYTYGYMFKASCDMFKGWIDKWIEVKNTTEGARKANAKGMLNSLYGKFGTNPDITGKVPYMGEDGIVRLTLGEEELRDPVYVPLASFVTAWGRYTTITTAQKCFDRIIYCDT DSIHLVGTEVPEAIDHLVDPKKLGYWGHESTFQRAKFIRQKTYVEEIDGELNVKCAGMP DRIKELVTFDNFEVGF S S YGKLLPKRTQGGVVLVDTMFTIK

[0232] Enterococcus faecium HP.3; SEQ ID NO:31 :MMIKKYTGDFETTTDLNDCRVWSWGVCDIDNVDNITFGLEIDSFFEWCKMQGSTDIYF HNEKFDGEFMLSWLFKNGFKWCKEAKEDRTFSTLISNMGQWYALEICWEVNYTTTKSG KTKKEKSRTIIYDSLKKYPFPVKQIAEAFNFPIKKGEIDYTKERPIGYKPTKDEWEYLKND IQIMAMALKIQFDQGLTRMTRGSDALGDYKDWLKATHGKTTFKQWFPTLSLGFDKDLR KAYKGGFTWVNKVFQGKEIGDGIVFDVNSLYPSQMYVRPLPYGTPLFYEGEYKPNNDY PL YIQNIKVRFRLKEGYIPTIQ VKQ S SLFIQNEYLD S S VNKLGVDELIDLTLTNVDLELFFEAtorney Docket No.: 051385-638001WOHYDILEIHYTYGYMFKASCDMFKGWIDKWIEVKNTSEGARKANAKGMLNSLYGKFGT NPDITGKVPYMGEDGIVRLTLGEEELRDPVYVPLASFVTAWGRYTTITTAQKCFDRIIYCDTDSIHLVGTEVPEAIDHLVDPKKLGYWGHESTFQRAKFIRQKTYVEEIDGELNVKCAG MPDRIKELVTFDNFEVGF S S YGKLLPKRTQGGVVL VDTMFTIK

[0233] Enterococcus faecium HP.4; SEQ ID NO:32:MMIKKYTGDFETTTDLNDCRVWSWGVCDIDNVDNITFGLEIDSFFEWCKMQGSTDIYF HNEKFDGEFMLSWLFKNGFKWCKEAKEDRTFSTLISNMGQWYALEICWEVNYTTTKSG KTKKEKSRTIIYDSLKKYPFPVKQIAEAFNFPIKKGEIDYTKERPIGYKPTKDEWEYLKNDIQIMAMALKIQFDQGLTRMTRGSDALGDYQDWVKTTYGKSRFKQWFPILSLGFDKDLR KAYKGGFTWVNKVFQGKEIGDGIVFDVNSLYPSQMYVRPLPYGTPLFYEGEYKPNNDY PL YIQNIKVRFRLKEGYIPTIQ VKQ S SLFIQNE YLD S S VNKLGVDELIDLTLTNVDLELFFE HYDILEIHYTYGYMFKASCDMFKGWIDKWIEVKNTSEGARKANAKGMLNSLYGKFGT NPDITGKVPYMGEDGIVRLTLGEEELRDPVYVPLASFVTAWGRYTTITTAQKCFDRIIYC DTDSIHLVGTEVPEAIDHLVDPKKLGYWGHESTFQRAKFIRQKTYVEEIDGELNVKCAG MPDRIKELVTFDNFEVGF S S YGKLLPKRTQGGVVL VDTMFTIK

[0234] Enterococcus faecium HP.5; SEQ ID NO:33:MMIKKYTGDFETTTDLNDCRVWSWGVCDIDNVDNITFGLDIDSFFEWCEMQGSTDIYF HNEKFDGEFMLSWLFKNGFKWSKETKEERTFSTLISNMGQWYALEICWEVNYTTTKSG KTKKEKSRTIIYDSLKKYPFPVKQIAEAFNFPIKKGEIDYTKERPIGYKPTKDEWEYLKNDIQIMAMALKIQFDQGLTRMTRGSDALGDYQDWVKTTYGKSRFKQWFPILSLGFDKDLR KAYKGGFTWVNKVFQGKEIGDGIVFDVNSLYPSQMYVRPLPYGTPLFYEGEYKPNNDY PL YIQNIKVRFRLKEGYIPTIQ VKQ S SLFIQNE YLD S S VNKLGVDELIDLTLTNVDLELFFE HYDILEIHYTYGYMFKASCDMFKGWIDKWIEVKNTTEGARKANAKGMLNSLYGKFGT NPDITGKVPYMGEDGIVRLTLGEEELRDPVYVPLASFVTAWGRYTTITTAQKCFDRIIYC DTDSIHLVGTEVPEAIDHLVDPKKLGYWGHESTFQRAKFIRQKTYVEEIDGELNVKCAG MPDRIKELVTFDNFEVGF S S YGKLLPKRTQGGVVL VDTMFTIK

[0235] Enterococcus faecium HP.6; SEQ ID NO:34:MIKKYTGDFETTTDLNDCRVWSWGVCDIDNVDNITFGLDIDSFFEWCEMQGSTDIYFHN EKFDGEFMLSWLFKNGFKWSKETKEERTFSTLISNMGQWYALEICWEVNYATTKSGKT KKEKVRTIIYDSLKKYPFPVKQIAEAFNFPIKKGEIDYTKERPVGYNPTDDEWDYLKNDIQIMAMALKIQFDQGLTRMTRGSDALGDYKDWLKATHGKSTFKQWFPILSLGFDKDLRK AYKGGFTWVNKVFQGKEIGDGIVFDVNSLYPSQMYVRPLPYGTPLFYEGEYKPNNDYP L YIQNIKVRFRLKEGYIPTIQ VKQ S SLFIQNE YLD S SVNKLGVDELIDLTLTNVDLELFFEH YDILEIHYTYGYMFKASCDMFKGWIDKWIEVKNTTEGARKANAKGMLNSLYGKFGTN PDITGKVPYMGEDGIVRLTLGEEELRDPVYVPLASFVTAWGRYTTITTAQRCFDRIIYCD TDSIHLVGTDVPEAIEHLVDPKKLGYWGHESTFQRAKFIRQKTYVEEIDGELNVKCAGM PDRIKELVTFDNFEVGF S S YGKLLPKRTQGGVVL VDTMFTIK

[0236] Enterococcus faecium HP.7; SEQ ID NO:35:Atorney Docket No.: 051385-638001WOMIKKYTGDFETTTDLNDCRVWSWGVCDIDNVDNITFGLDIDSFFEWCEMQGSTDIYFHN EKFDGEFMLSWLFKNGFKWSKETKEERTFSTLISNMGQWYALEICWEVNYATTKSGKT KKEKVRTIIYDSLKKYPFPVKQIAEAFNFPIKKGEIDYTKERPVGYNPTDDEWDYLKNDIQIMAMALKIQFDQGLTRMTRGSDALGDYKDWLKATHGKSTFKQWFPILSLGFDKDLRK AYKGGFTWVNKVFQGKEIGDGIVFDVNSLYPSQMYVRPLPYGTPLFYEGEYKPNNDYP LY1QN1KVRFRLKEGY1PT1QVKQSSLF1QNEYLDSSVNKLGVDEL1DLTLTNVDLELFFEH YDILEIHYTYGYMFKASCDMFKGWIDKWIEVKNTTEGARKANAKGMLNSLYGKFGTN PDITGKVPYMGEDGIVRLTLGEEELRDPVYVPLASFVTAWGRYTTITTAQRCFDRIIYCD TDSIHLVGTDAPEAIEHLVDPKKLGYWGHESTFQRAKFIRQKTYVEEIDGELNVKCAGMPDRIKELVTFDNFEVGF S S YGKLLPKRTQGGVVLVDTMFTIK

[0237] Enterococcus faecium HP.8; SEQ ID NO:36:MIKKYTGDFETTTDLNDCRVWSWGVCDIDNVNNITFGLDIDSFFEWCEMQGSTDIYFHN EKFDGEFMLSWLFKNGFKWSKETKEERTFSTLISNMGQWYALEICWEVNYTTTKSGKT KKEKVRTIIYDSLKKYPFPVKQIAEAFNFPIKKGEIDYTKERPVGYNPTDDEWDYLKNDIQIMAMALKIQFDQGLTRMTRGSDALGDYKDWLKATHGKSTFKQWFPILSLGFDKDLRK AYKGGFTWVNKVFQGKEIGDGIVFDVNSLYPSQMYVRPLPYGTPLFYEGEYKPNNDYP LYIQNIKVRFRLKEGYIPTIQVKQS SLFIQNEYLD SSVNKLGVDELIDLTLTNVDLELFFDH YDILEIHYTYGYMFKASCDMFKGWIDKWIEVKNTTEGARKANAKGMLNSLYGKFGTN PDITGKVPYMGEDGIVRLTLGEEELRDPVYVPLASFVTAWGRYTTITTAQRCFDRIIYCD TDSIHLVGTEVPEAIDHLVDPKKLGYWGHESTFQRAKFIRQKTYVEEIDGELNVKCAGM PDRIKELVTFDNFEVGF S S YGKLLPKRTQGGVVLVDTMFTIK

[0238] Enterococcus faecium HP.9; SEQ ID NO:37:MIKKYTGDFETTTDLNDCRVWSWGVCDIDNVDNITFGLDIDSFFEWCEMQGSTDIYFHN EKFDGEFMLSWLFKNGFKWSKETKEERTFSTLISNMGQWYALEICWEVNYTTTKSGKT KKEKVRTIIYDSLKKYPFPVKQIAEAFNFPIKKGEIDYTKERPVGYNPTDDEWDYLKNDIQIMAMALKIQFDQGLTRMTRGSDALGDYKDWLKATHGKSTFKQWFPILSLGFDKDLRK AYKGGFTWVNKVFQGKEIGDGIVFDVNSLYPSQMYVRPLPYGTPLFYEGEYKPNNDYP L YIQNIKVRFRLKEGYIPTIQ VKQ S SLFIQNEYLD S SVNKLGVDELIDLTLTNVDLELFFEH YDILEIHYTYGYMFI<ASCDMFKGWIDI<WIEVI<NTTEGARI<ANAKGMLNSLYGKFGTN PDITGKVPYMGEDGIVRLTLGEEELRDPVYVPLASFVTAWGRYTTITTAQRCFDRIIYCD TDSIHLVGTDVPEAIEHLVDPKKLGYWGHESTFQRAKFIRQKTYVEEIDGELNVKCAGM PDRIKELVTFDNFEVGF S S YGKLLPKRTQGGVVLVDTMFTIK

[0239] Enterococcus faecium HP.10; SEQ ID NO:38:MIKKYTGDFETTTDLNDCRVWSWGVCDIDNVDNITFGLDIDSFFEWCEMQGSTDIYFHN EKFDGEFMLSWLFKNGFKWSKETKEERTFSTLISNMGQWYALEICWEVNYTTTTKSGK MKKEKVRTIIYDSLKKYPFPVKQIAEAFNFPIKKGEIDYTKERPVGYNPTDDEWDYLKN DIQIMAMALKIQFDQGLTRMTRGSDALGDYKDWLKATHGKSTFKQWFPILSLGFDKDL RKAYKGGFTWVNKVFQGKEIGDGIVFDVNSLYPSQMYVRPLPYGTPLFYEGEYKPNND YPLYIQNIKVRFRLKEGYIPTIQVKQSSLFIQNEYLDSSVNKLGVDELIDLTLTNVDLELFF DHYDILEIHYTYGYMFKASCDMFKGWIDKWIEVKNTTEGARKANAKGMLNSLYGKFGAtorney Docket No.: 051385-638001WOTNPDITGKVPYMGEDGIVRLTLGEEELRDPVYVPLASFVTAWGRYTTITTAQRCFDRIIY CDTDSIHLVGTEVPEAIDHLVDPKKLGYWGHESTFQRAKFIRQKTYVEEIDGELNVKCA GMPDRIKELVTFDNFEVGF S S YGKLLPKRTQGGVVLVDTMFTIK

[0240] Enterococcus faecium HP.l 1; SEQ ID NO:39:MMIKKYTGDFETTTDLNDCRVWSWGVCDIDNVDNITFGLDIDSFFEWCEMQGSTDIYFHNEKFDGEFMLSWLFKNGFKWSKETKEERTFSTLISNMGQWYALEICWEVNYATTKSG KTKKEKVRTIIYDSLKKYPFPVKQIAEAFNFPIKKGEIDYTKERPVGYNPTDDEWDYLKN DIQIMAMALKIQFDQGLTRMTRGSDALGDYKDWLKATHGKSTFKQWFPILSLGFDKDL RKAYKGGFTWVNKVFQGKEIGDGIVFDVNSLYPSQMYVRPLPYGTPLFYEGEYKPNND YPLYIQNIKVRFRLKEGYIPTIQVKQSSLFIQNEYLDSSVNKLGVDELIDLTLTNVDLELFFEHYDILEIHYTYGYMFKASCDMFKGWIDKWIEVKNTTEGARKANAKGMLNSLYGKFG TNPDITGKVPYMGEDGIVRLTLGEEELRDPVYVPLASFVTAWGRYTTITTAQRCFDRIIY CDTDSIHLVGTDVPEAIEHLVDPKKLGYWGHESTFQRAKFIRQKTYVEEIDGELNVKCA GMPDRIKELVTFDNFEVGF S S YGKLLPKRTQGGVVLVDTMFTIK

[0241] Enterococcus faecium HP.12; SEQ ID NO:40:MMIKKYTGDFETTTDLNDCRVWSWGVCDIDNVDNITFGLEIDSFFEWCKMQGSTDIYF HNEKFDGEFMLSWLFKNGFKWSKEAKEDRTFSTLISNMGQWYALEICWEVNYTTTKSG KTKKDKVRTIIYDSLKKYPFPVKQIAEAFDFPIKKGEIDYNKERPIGYNPTDDEWEYLKN DIQIMAMALKIQFDQGLTRMTRGSDALGDYKDWLKATHGKSTFKQWFPILSLGFDKDL RKAYKGGFTWVNKVFQGKEIGDGIVFDVNSLYPSQMYVRPLPYGTPLFYEGEYKPNND YPLYIQNIKVRFRLKEGYIPTIQVKQSSLFIQNEYLDSSVNKLGVDELIDLTLTNVDLELFFEHYDILEIHYTYGYMFKASCDMFKGWIDKWIEVKNTTEGARKANAKGMLNSLYGKFG TNPDITGKVPYMGEDGIVRLTLGEEELRDPVYVPLASFVTAWGRYTTITTAQKCFDRIIYCDTDSIHLVGTEVPEAIDHLVDPKKLGYWGHESTFQRAKFIRQKTYVEEIDGELNVKCA GMPDRIKELVTFDNFEVGF S S YGKLLPKRTQGGVVLVDTMFTIK

[0242] Enterococcus faecium HP.13; SEQ ID NO:41:MMIKKYTGDFETTTDLNDCRVWSWGVCDIDNVDNITFGLDIDSFFEWCEMQGSTDIYF HNEKFDGEFMLSWLFKNGFKWSKETKEERTFSTLISNMGQWYALEICWNIKYTTTKSG KTKKEKVRTIIYDSLKKYPFPVKQIAEAFNFSIKKGEIDYTKERPVGYNPTDDEWDYLKN DIQIMAMALKIQFDQGLTRMTRGSDALGDYKDWLKATHGKSTFKQWFPVLSLGFDKDLRKAYKGGFTWVNKVFQGKEIGDGIVFDVNSLYPSQMYVRPLPYGTPLFYEGEYKPNND YPLYIQN IK VRF RLKEG YI PTIQ VKQS SLFIQNEYLES S VNKLGVDELIDLTLTNVDLELFF EHYDILEIHYTYGYMFKASCDMFKGWIDKWIEVKNTTEGARKANAKGMLNSLYGKFG TNPDITGKVPYMGEDGIVRLTLGEEELRDPVYVPLASFVTAWGRYTTITTAQRCFDRIIY CDTDSIHLTGTEVPETIEHLVDSKKLGYWKHESTFQRAKFIRQKTYVEEIDGELNVKCAG MPDRIKELVTFDNFEVGF S S YGKLLPKRTQGGVVLVDTMFTIK

[0243] Enterococcus faecium HP.14; SEQ ID NO:42:Atorney Docket No.: 051385-638001WOMMIKKYTGDFETTTDLNDCRVWSWGVCDIDNVNNITFGLDIDSFFEWCEMQGSTDIYF HNEKFDGEFMLSWLFKNGFKWSKETKEERTFSTLISNMGQWYALEICWEVNYTTTKSG KTKKEKVRTIIYDSLKKYPFPVKQIAEAFNFPIKKGEIDYTKERPVGYNPTDDEWDYLKN DIQIMAMALKIQFDQGLTRMTRGSDALGDYKDWLKATHGKSTFKQWFPILSLGFDKDL RKAYKGGFTWVNKVFQGKEIGDGIVFDVNSLYPSQMYVRPLPYGTPLFYEGEYKPNND YPLY1QN1KVRFRLKEGY1PT1QVKQSSLF1QNEYLDSSVNKLGVDEL1DLTLTNVDLELFF DHYDILEIHYTYGYMFKASCDMFKGWIDKWIEVKNTTEGARKANAKGMLNSLYGKFG TNPDITGKVPYMGEDGIVRLTLGEEELRDPVYVPLASFVTAWGRYTTITTAQRCFDRIIY CDTDSIHLVGTEVPEAIDHLVDPKKLGYWGHESTFQRAKFIRQKTYVEEIDGELNVKCAGMPDRIKELVTFDNFEVGF S S YGKLLPKRTQGGVVLVDTMFTIK

[0244] Enterococcus faecium HP.15; SEQ ID NO:43:MMIKKYTGDFETTTDLNDCRVWSWGVCDIDNVDNITFGLEIDSFFEWCKMQGSTDIYF HNEKFDGEFMLSWLFKNGFKWCKEAKEDRTFSTLISNMGQWYALEICWEVNYTTTKSG KTKKEKSRTIIYDSLKKYPFPVKQIAEAFNFPIKKGEIDYTKERPIGYKPTKDEWEYLKND IQIMAMALKIQFDQGLTRMTRGSDALGDYKDWLKATHGKTTFKQWFPILSLGFDKDLR KAYKGGFTWVNKVFQGKEIGDGIVFDVNSLYPSQMYVRPLPYGTPLFYEGEYKPNNDY PL YIQ N I K VRFRLKEGYIPTIQ VKQ S SLFIQNE YLD S S VNKLGVDELIDLTLTNVDLELFFE HYDILEIHYTYGYMFKASCDMFKGWIDKWIEVKNTTEGARKANAKGMLNSLYGKFGT NPDITGKVPYMGEDGIVRLTLGEEELRDPVYVPLASFVTAWGRYTTITTAQKCFDRIIYC DTDSIHLVGTEVPEAIDHLVDPKKLGYWGHESTFQRAKFIRQKTYVEEIDGELNVKCAGMPDRIKELVTFDNFEVGF S S YGKLLPKRTQGGVVLVDTMFTIK

[0245] Enterococcus faecium HP. 16; SEQ ID NO:44:MMIKKYTGDFETTTDLNDCRVWSWGVCDIDNVDNITFGLEIDSFFEWCKMQGSTDIYF HNEKFDGEFMLSWLFKNGFKWCKEAKEDRTFSTLISNMGQWYALEICWEVNYTTTKSG KTKKEKSRTIIYDSLKKYPFPVKQIAEAFNFPIKKGEIDYTKERPIGYKPTKDEWEYLKND IQIMAMALKIQFDQGLTRMTRGSDALGDYQDWVKTTYGKSRFKQWFPILSLGFDKDLR KAYKGGFTWVNKVFQGKEIGDGIVFDVNSLYPSQMYVRPLPYGTPLFYEGEYKPNNDY PL YIQ N I K VRFRLKEGYIPTIQ VKQ S SLFIQNE YLD S S VNKLGVDELIDLTLTNVDLELFFE HYDILEIHYTYGYMFKASCDMFKGWIDKWIEVKNTTEGARKANAKGMLNSLYGKFGT NPDITGKVPYMGEDGIVRLTLGEEELRDPVYVPLASFVTAWGRYTTITTAQKCFDRIIYC DTDSIHLVGTEVPEAIDHLVDPKKLGYWGHESTFQRAKFIRQKTYVEEIDGELNVKCAGMPDRIKELVTFDNFEVGF S S YGKLLPKRTQGGVVLVDTMFTIK

[0246] Enterococcus faecium HP.17; SEQ ID NO:45:MMIKKYTGDFETTTDLNDCRVWSWGVCDIDNFDNITFGLDIDSFFEWCEMQGSTDIYFH NEKFDGEFMLSWLFKNGFKWSKETKEERTFSTLISNMGQWYALEICWEVNYTTTKSGK TKKEKVRTIIYDSLKKYPFPVKQIAEAFNFPIKKGEIDYTKERPVGYNPTDDEWDYLKND IQIMAMALKIQFDQGLTRMTRGSDALGDYKDWLKATHGKSTFKQWFPILSLGFDKDLR KAYKGGFTWVNKVFQGKEIGDGIVFDVNSLYPSQMYVRPLPYGTPLFYEGEYKPNNDY PL YIQ N I K VRFRLKEGYIPTIQ VKQ S SLFIQNE YLD S S VNKLGVDELIDLTLTNVDLELFFD HYDILEIHYTYGYMFKASCDMFKGWIDKWIEVKNTTEGARKANAKGMLNSLYGKFGTAtorney Docket No.: 051385-638001WONPDITGKVPYMGEDGIVRLTLGEEELRDPVYVPLASFVTAWGRYTTITTAQRCFDRIIYC DTDSIHLVGTEVPEAIDHLVDPKKLGYWGHESTFQRAKFIRQKTYVEEIDGELNVKCAGMPDRIKELVTFDNFEVGF S S YGKLLPKRTQGGVVLVDTMFTIK

[0247] Enterococcus faecium HP.18; SEQ ID NO:46:MIKKYTGDFETTTDLNDCRVWSWGVCDIDNVDNITFGLDIDSFFEWCEMQGSTDIYFHN EKFDGEFMLSWLFKNGFKWSKETKEERTFSTLISNMGQWYALEICWEVNYATTKSGKT KKEKVRTIIYDSLKKYPFPVKQIAEAFNFPIKKGEIDYTKERPVGYNPTDDEWEYLKNDIQIMAMALKIQFDQGLTRMTRGSDALGDYKDWLKATHGKSTFKQWFPILSLGFDKDLRK AYKGGFTWVNKVFQGKEIGDGIVFDVNSLYPSQMHVRPLPYGTPLFYEGEYKPNNDYP L YIQNIKVRFRLKEGYIPTIQ VKQ SSLFIQNEYLDSSVNKLGVDELIDLTLTNVDLELFFEH YDILEIHYTYGYMFI<ASCDMFKGWIDI<WIEVKNTTEGARI<ANAI<GMLNSLYGKFGTN PDITGKVPYMGEDGIVRLTLGEEELRDPVYVPLASFVTAWGRYTTITTAQRCFDRIIYCD TDSIHLVGTDVPEAIEHLVDPKKLGYWGHESTFQRAKFIRQKTYVEEIDGELNVKCAGM PDRIKELVTFDNFEVGF S S YGKLLPKRTQGGVVLVDTMFTIK

[0248] Enterococcus faecium HP. 19; SEQ ID NO:47:MIKKYTGDFETTTDLNDCRVWSWGVCDIDNVDNITFGLDIDSFFEWCEMQGSTDIYFHN EKFDGEFMVSWLFKNGFKWSKETKEERTFSTLISNMGQWYALEICWEVNYTTTKSGKT KKEKVRTIIYDSLKKYPFPVKQIAEAFNFPIKKGEIDYTKERPVGYNPTDDEWDYLKNDIQIMAMALKIQFDQGLTRMTRGSDALGDYKDWLKATHGKLTFKQWFPILSLGFDKDLRK AYKGGFTWVNKVFQGKEIGDGIVFDVNSLYPSQMYVRPLPYGTPLFYEGEYKPNNDYP L YIQNIKVRFRLKEGYIPTIQ VKQ S SLFIQNEYLD S SVNKLGVDELIDLTLTNVDLELFF AH YDILEIHYTYGYMFKASCGMFKGWIDKWIEVKNTTEGARKANAKGMLNSLYGKFGTN PDITGKVPYMGEDGIVRLTLGEEELRDPVYVPLASFVTAWGRYTTVTTAQRCFDRIIYCD TDSIHLVGTDVPEAIEHLVDPKKLGYWGHESTFQRAKFIRQKTYVEEIDGELNVKCAGM PDRIKELVTFDNFEVGF S S YGKLLPKRTQGGVVLVDTMFTIK

[0249] Enterococcus faecium HP.20; SEQ ID NO:48:MMIKKYTGDFETTTDLNDCRVWSWGVCDIDNVDNITFGLEIDSFFEWCKMQGSTDIYF HNEKFDGEFMLSWLFKNGFKWCKEAKEDRTFSTLISNMGQWYALEICWEVNYATTKS GKTKKEKVRTIIYDSLKKYPFPVKQIAEAFNFPIKKGEIDYTKERPVGYNPTDDEWDYLKNDIQIMAMALKIQFDQGLTRMTRGSDALGDYKDWLKATHGKSTFKQWFPILSLGFDKD LRKAYKGGFTWVNKVFQGKEIGDGIVFDVNSLYPSQMYVRPLPYGTPLFYEGEYKPNNDYPLYIQNIKVRFRLKEGYIPTIQVKQSSLFIQNEYLDSSVNKLGVDELIDLTLTNVDLELF FEHYDILEIHYTYGYMFKASCDMFKGWIDKWIEVKNTTEGARKANAKGMLNSLYGKF GTNPDITGKVPYMGEDGIVRLTLGEEELRDPVYVPLASFVTAWGRYTTITTAQRCFDRIIYCDTDSIHLVGTDVPEAIEHLVDPKKLGYWGHESTFQRAKFIRQKTYVEEIDGELNVKC AGMPDRIKELVTFDNFEVGFSSYGKLLPKRTQGGVVLVDTMFTIK

[0250] Enterococcus faecium HP. 21; SEQ ID NO:49:Atorney Docket No.: 051385-638001WOMMIKKYTGDFETTTDLNDCRVWSWGVCDIDNVDNITFGLDIDSFFEWCEMQGSTDIYFHNEKFDGEFMLSWLFKNGFKWSKETKEERTFSTLISNMGQWYALEICWEVNYATTKSG KTKKEKVRTIIYDSLKKYPFPVKQIAEAFNFPIKKGEIDYTKERPVGYNPTDDEWEYLKN DIQIMAMALKIQFDQGLTRMTRGSDALGDYKDWLKATHGKSTFKQWFPILSLGFDKDL RKAYKGGFTWVNKVFQGKEIGDGIVFDVNSLYPSQMHVRPLPYGTPLFYEGEYKPNND YPLY1QN1KVRFRLKEGY1PT1QVKQSSLF1QNEYLDSSVNKLGVDEL1DLTLTNVDLELFF EHYDILEIHYTYGYMFKASCDMFKGWIDKWIEVKNTTEGARKANAKGMLNSLYGKFGTNPDITGKVPYMGEDGIVRLTLGEEELRDPVYVPLASFVTAWGRYTTITTAQRCFDRIIY CDTDSIHLVGTDVPEAIEHLVDPKKLGYWGHESTFQRAKFIRQKTYVEEIDGELNVKCA GMPDRIKELVTFDNFEVGF S S YGKLLPKRTQGGVVLVDTMFTIK

[0251] Enterococcus faecium HP.22; SEQ ID NO:50:MMIKKYTGDFETTTDLNDCRVWSWGVCDIDNVDNITFGLDIDSFFEWCEMQGSTDIYFHNEKFDGEFMLSWLFKNGFKWSKETKEERTFSTLISNMGQWYALEICWEVNYATTKSG KTKKEKVRTIIYDSLKKYPFPVKQIAEAFNFPIKKGEIDYTKERPVGYNPTDDEWDYLKN DIQIMAMALKIQFDQGLTRMTRGSDALGDYKDWLKATHGKSTFKQWFPILSLGFDKDL RKAYKGGFTWVNKVFQGKEIGDGIVFDVNSLYPSQMYVRPLPYGTPLFYEGEYKPNND YPLYIQNIKVRFRLKEGYIPTIQVKQSSLFIQNEYLDSSVNKLGVDELIDLTLTNVDLELFF EHYDILEIHYTYGYMFKASCDMFKGWIDKWIEVKNTTEGARKANAKGMLNSLYGKFGTNSDITGKVPYMGEDGIVRLTLGEEELRDPVYVPLASFVTAWGRYTTITTAQRCFDRIIY CDTDSIHLVGTDVPEAIEHLVDPKKLGYWGHESTFQRAKFIRQKTYVEEIDGELNVKCA GMPDRIKELVTFDNFEVGF S S YGKLLPKRTQGGVVLVDTMFTIK

[0252] Enterococcus faecium HP.23; SEQ ID NO:51 :MMIKKYTGDFETTTDLNDCRVWSWGVCDIDNVDNITFGLDIDSFFEWCEMQGSTDIYFHNEKFDGEFMLSWLFKNGFKWSKETKEERTFSTLISNMDQWYALEICWEVNYATTKSG KTKKEKVRTIIYDSLKKYPFPVKQIAEAFNFPIKKGEIDYTKERPVGYNPTDDEWDYLKN DIQIMAMALKIQFDQGLTRMTRGSDALGDYKDWLKATHGKSTFKQWFPILSLGFDKDL RKAYKGGFTWVNKVFQGKEIGDGIVFDVNSLYPSQMYVRPLPYGTPLFYEGEYKPNND YPLYIQNIKVRFRLKEGYIPTIQVKQSSLFIQNEYLDSSVNKLGVDELIDLTLTNVDLELFF EHYDILEIHYTYGYMFKASCDMFKGWIDKWIEVKNTTEGARKANAKGMLNSLYGKFGTNPDITGKVPYMGEDGIVRLTLGEEELRDPVYVPLASFVTAWGRYTTITTAQRCFDRIIY CDTDSIHLVGTDVPEAIEHLVDPKKLGYWGHESTFQRAKFIRQKTYVEEIDGELNVKCA GMPDRIKELVTFDNFEVGF S S YGKLLPKRTQGGVVLVDTMFTIK

[0253] Enterococcus faecium HP.24; SEQ ID NO:52:MMIKKYTGDFETTTDLNDCRVWSWGVCDIDNVDNITFGLDIDSFFEWCEMQGSTDIYFHNEKFDGEFMLSWLFKNGFKWSKETKEERTFSTLISNMGQWYALEICWNINYTTTKSG KTKKEKVRTIIYDSLKKYPFPVKQIAEAFNFPIKKGEIDYTKERPVGYNPTDDEWDYLKNDIQIMAMALKIQFDQGLTRMTRGSDALGDYKDWLKATHGKSTFKQWFPILSLGFDKDL RKAYKGGFTWVNKVFQGKEIGDGIVFDVNSLYPSQMYVRPLPYGTPLFYEGEYKPNND YPLYIQN IK VRF RLKEG YIPTIQ VKQS SLFIQNEYLES S VNKLGVDELIDLTLTNVDLELFF EHYDILEIHYTYGYMFKASCDMFKGWIDKWIEVKNTTEGARKANAKGMLNSLYGKFGAtorney Docket No.: 051385-638001WOTNPDITGKVPYMGEDGIVRLTLGEEELRDPVYVPLASFVTAWGRYTTITTAQRCFDRIIY CDTDSIHLVGTDVPEAIEHLVDPKKLGYWGHESTFQRAKFIRQKTYVEEIDGELNVKCA GMPDRIKELVTFDNFEVGF S S YGKLLPKRTQGGVVLVDTMFTIKEMBODIMENTS

[0254] Embodiment 1. A polymerase comprising an amino acid sequence that is at least 80% identical to SEQ ID NO: 1; comprising a first mutation at amino acid position corresponding to position 8; and a second mutation at amino acid position corresponding to position 270, 309, 93, 178, 470, or 369.

[0255] Embodiment 2. The polymerase of Embodiment 1, wherein the first mutation is a lysine, alanine, histidine, asparagine, glutamine, or arginine.

[0256] Embodiment 3. The polymerase of Embodiment 1, wherein the first mutation is a lysine.

[0257] Embodiment 4. The polymerase of any one of Embodiments 1 to 3, wherein the second mutation is a mutation at amino acid position corresponding to position 270, wherein the second mutation is arginine, lysine, histidine, or glutamine.

[0258] Embodiment 5. The polymerase of any one of Embodiments 1 to 3, wherein the second mutation is arginine.

[0259] Embodiment 6. The polymerase of any one of Embodiments 1 to 5, wherein the second mutation is a mutation at amino acid position corresponding to position 309, wherein the second mutation is aspartic acid, glutamic acid, asparagine, or glutamine.

[0260] Embodiment 7. The polymerase of any one of Embodiments 1 to 5, wherein the second mutation is aspartic acid.

[0261] Embodiment 8. The polymerase of any one of Embodiments 1 to 7, wherein the second mutation is a mutation at amino acid position corresponding to position 93, wherein the second mutation is aspartic acid, glutamic acid, asparagine, or glutamine.

[0262] Embodiment 9. The polymerase of any one of Embodiments 1 to 8, wherein the second mutation is a mutation at amino acid position corresponding to position 178, wherein the second mutation is arginine, lysine, histidine, or glutamine.Atorney Docket No.: 051385-638001WO

[0263] Embodiment 10. The polymerase of any one of Embodiments 1 to 9, wherein the second mutation is a mutation at amino acid position corresponding to position 470, wherein the second mutation is lysine, arginine, histidine, or glutamine.

[0264] Embodiment 11. The polymerase of any one of Embodiments 1 to 10, wherein the second mutation is a mutation at amino acid position corresponding to position 369, wherein the second mutation is lysine, arginine, histidine, or glutamine.

[0265] Embodiment 12. The polymerase of any one of Embodiments 1 to 11, further comprising a monovalent polyethylene glycol moiety covalently attached to the polymerase.

[0266] Embodiment 13. The polymerase of any one of Embodiments 1 to 12, comprising a glutamic acid, aspartic acid, alanine, glycine, or threonine at an amino acid position corresponding to position 536 of SEQ ID NO: 1.

[0267] Embodiment 14. The polymerase of Embodiment 1, comprising: M8K and V270R; M8K and F309D; M8K and I93E; M8K and I93E and V270R; M8K and I93E and V270R and F309D; M8K and L178R and F309K; M8K and L178R and V470K; or M8K and L178R and Y369K.

[0268] Embodiment 15. A polymerase comprising an amino acid sequence that is at least 80% identical to SEQ ID NO: 1 and comprising one or more monovalent polyethylene glycol moieties covalently attached to the polymerase.

[0269] Embodiment 16. A method of incorporating a nucleotide into a nucleic acid sequence comprising combining in a reaction vessel: (i) a nucleic acid template, (ii) a nucleotide solution comprising a plurality of nucleotides, and (iii) a polymerase, wherein the polymerase is the polymerase of any one of Embodiments 1 to 14.

[0270] Embodiment 17. A method of amplifying a nucleic acid sequence, said method comprising: a) hybridizing a nucleic acid template to a primer to form a primer-template hybridization complex; b) contacting said primer-template hybridization complex with a polymerase and a plurality of nucleotides, wherein the polymerase is the polymerase of any one of Embodiments 1 to 14; c) subjecting the primer-template hybridization complex to conditions which enable the polymerase to incorporate one or more nucleotides into the primer-Attorney Docket No.: 051385-638001WO template hybridization complex to generate amplification products, thereby amplifying a nucleic acid sequence.

[0271] Embodiment 18. A method of amplifying a template polynucleotide, the method comprising: contacting a template polynucleotide with an amplification primer, and amplifying the template polynucleotide by extending an amplification primer with a polymerase to generate an extension product comprising one or more complements of the template polynucleotide; wherein said polymerase is a polymerase of any one of Embodiments 1 to 14.

[0272] Embodiment 19. The method of Embodiment 18, comprising amplifying the template polynucleotide in a cell.

[0273] Embodiment 20. The method of Embodiment 18, comprising amplifying the template polynucleotide in a tissue.

[0274] Embodiment 21. The method of Embodiment 18, wherein said amplification primer is immobilized on a solid support.

Claims

Attorney Docket No.: 051385-638001 WOWHAT IS CLAIMED IS:

1. A polymerase comprising an amino acid sequence that is at least 80% identical to SEQ ID NO: 1 ; comprising a first mutation at amino acid position corresponding to position 8; and a second mutation at amino acid position corresponding to position 270, 309, 93, 178, 470, or 369.

2. The polymerase of claim 1, wherein the first mutation is a lysine, alanine, histidine, asparagine, glutamine, or arginine.

3. The polymerase of claim 1, wherein the first mutation is a lysine.

4. The polymerase of claim 1, wherein the second mutation is a mutation at amino acid position corresponding to position 270, wherein the second mutation is arginine, lysine, histidine, or glutamine.

5. The polymerase of claim 1, wherein the second mutation is arginine.

6. The polymerase of claim 1, wherein the second mutation is a mutation at amino acid position corresponding to position 309, wherein the second mutation is aspartic acid, glutamic acid, asparagine, or glutamine.

7. The polymerase of claim 5, wherein the second mutation is aspartic acid.

8. The polymerase of claim 1, wherein the second mutation is a mutation at amino acid position corresponding to position 93, wherein the second mutation is aspartic acid, glutamic acid, asparagine, or glutamine.

9. The polymerase of claim 1, wherein the second mutation is a mutation at amino acid position corresponding to position 178, wherein the second mutation is arginine, lysine, histidine, or glutamine.

10. The polymerase of claim 1, wherein the second mutation is a mutation at amino acid position corresponding to position 470, wherein the second mutation is lysine, arginine, histidine, or glutamine.Attorney Docket No.: 051385-638001 WO11. The polymerase of claim 1, wherein the second mutation is a mutation at amino acid position corresponding to position 369, wherein the second mutation is lysine, arginine, histidine, or glutamine.

12. The polymerase of claim 1, further comprising a monovalent polyethylene glycol moiety covalently attached to the polymerase.

13. The polymerase of claim 1, comprising a glutamic acid, aspartic acid, alanine, glycine, or threonine at an amino acid position corresponding to position 536 of SEQ ID NO: 1.

14. The polymerase of claim 1, comprising:M8K and V270R;M8K and F309D;M8K and 193 E;M8K and I93E and V270R;M8K and I93E and V270R and F309D;M8K and L178R and F309K;M8K and L178R and V470K; orM8K and L178R and Y369K.

15. A polymerase comprising an amino acid sequence that is at least 80% identical to SEQ ID NO: 1 and comprising one or more monovalent polyethylene glycol moieties covalently attached to the polymerase.

16. A method of incorporating a nucleotide into a nucleic acid sequence comprising combining in a reaction vessel: (i) a nucleic acid template, (ii) a nucleotide solution comprising a plurality of nucleotides, and (iii) a polymerase, wherein the polymerase is the polymerase of claim 1.

17. A method of amplifying a nucleic acid sequence, said method comprising: a) hybridizing a nucleic acid template to a primer to form a primer-template hybridization complex;Attorney Docket No.: 051385-638001 WO b) contacting said primer-template hybridization complex with a polymerase and a plurality of nucleotides, wherein the polymerase is the polymerase of claim 1; c) subjecting the primer-template hybridization complex to conditions which enable the polymerase to incorporate one or more nucleotides into the primer-template hybridization complex to generate amplification products, thereby amplifying a nucleic acid sequence.

18. A method of amplifying a template polynucleotide, the method comprising: contacting a template polynucleotide with an amplification primer, and amplifying the template polynucleotide by extending an amplification primer with a polymerase to generate an extension product comprising one or more complements of the template polynucleotide; wherein said polymerase is a polymerase of claim 1.

19. The method of claim 18, comprising amplifying the template polynucleotide in a cell.

20. The method of claim 18, comprising amplifying the template polynucleotide in a tissue.

21. The method of claim 18, wherein said amplification primer is immobilized on a solid support.