Methods and compositions for editing a gene associated with galactosemia
By altering the GALT polynucleotide using a base editor system, the method addresses the limitations of dietary treatment for galactosemia, enhancing metabolic function and reducing complications.
Patent Information
- Authority / Receiving Office
- WO · WO
- Patent Type
- Applications
- Current Assignee / Owner
- BEAM THERAPEUTICS INC
- Filing Date
- 2025-12-10
- Publication Date
- 2026-06-18
AI Technical Summary
Current treatments for galactosemia, which involve avoiding galactose and lactose in the diet, are inadequate as they do not prevent the progression of long-term complications due to endogenous galactose production, and there is a need for improved compositions and methods to treat the condition.
A base editor system comprising a programmable DNA binding protein, a nucleobase editor, and a guide RNA is used to modify the galactose-1-phosphate uridylyltransferase (GALT) polynucleotide, specifically altering codon 188 from an R amino acid to a Q amino acid to increase enzyme activity, thereby improving metabolic function.
The method enhances GALT activity, potentially reducing galactose buildup and associated complications in galactosemia patients, offering a more effective treatment than dietary restriction alone.
Smart Images

Figure US2025058927_18062026_PF_FP_ABST
Abstract
Description
[0001] ATTORNEY DOCKET NO. 180802-47402 / PCT
[0002] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0003] METHODS AND COMPOSITIONS FOR EDITING A GENE ASSOCIATED WITH GALACTOSEMIA
[0004] CROSS REFERENCE TO RELATED APPLICATIONS
[0005] The present application claims priority to U. S. Provisional Applications No. 63 / 730,807 filed December 11, 2024, and 63 / 853,947, filed July 30, 2025, the entire contents of each of which are hereby incorporated by reference in their entirety.
[0006] SEQUENCE LISTING
[0007] This application contains a Sequence Listing which has been submitted electronically in XML format and is hereby incorporated by reference in its entirety. The Sequence Listing XML file, created on December 10, 2025, is named 180802-047402PCT_SL.xml and is 1,335,782 bytes in size.
[0008] BACKGROUND
[0009] Galactosemia is characterized by an inability to metabolize galactose that results in build up of galactose and galactose metabolites (e.g., galactitol or galactose- 1-P) in a subject, which may be caused by deficient activity of galactose- 1 -phosphate uridylyltransferase (GALT).
[0010] Galactose is a component of lactose, which is the sugar found in milk. Untreated newborns having galactosemia experience vomiting, lethargyjaundice, cataracts, sepsis, multisystem organ failure, and / or death. In adults, galactosemia results in neurocognitive impairment, cataracts, liver failure, poor muscle tone, premature menopause in women, and motor disorders. Over 2,250 patients in north America suffer from galactosemia, and over 50 new patients are diagnosed with galactosemia each year. To date, the only treatment available for galactosemia is to avoid foods that contain galactose or lactose for life. Many galactosemia patients find it challenging to adhere to such a diet. Galactose is also endogenously produced via de novo synthesis; thus, dietary restriction alone is not sufficient to prevent the progression of long-term complications.
[0011] Therefore, there is a present need for improved compositions and methods for treatment of galactosemia.
[0012] SUMMARY
[0013] As described below, the disclosure features compositions and methods for treating galactosemia by introducing one or more alterations into a galactose- 1 -phosphate ATTORNEY DOCKET NO. 180802-47402 / PCT
[0014] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0015] uridylyltransferase (GALT) polynucleotide in a cell. In particular embodiments, the disclosure provides a base editor system (e.g., a fusion protein or complex comprising a programable DNA binding protein, a nucleobase editor, and gRNA) for modifying a GALT polynucleotide, where the modification is associated with an increase in activity of the GALT polypeptide encoded by the polynucleotide.
[0016] In one aspect, the disclosure provides a method of editing a nucleobase of a galactose- 1-phosphate uridylyltransferase (GALT) polynucleotide. The method involves contacting the GALT polynucleotide with a guide RNA, or a polynucleotide encoding the guide RNA, and a base editor containing a deaminase domain capable deaminating cytidine in DNA and a nucleic acid programmable DNA binding protein (napDNAbp) domain, or one or more polynucleotides encoding the base editor. The guide RNA targets the base editor to effect an alteration to a nucleobase in codon 188 of the GALT polynucleotide encoding an R amino acid. The alteration to the nucleobase changes codon 188 to a codon encoding a Q amino acid. Also, a) the deaminase domain contains an amino acid sequence with at least 85% sequence identity to a deaminase selected from one or more of RrA3f, T1.52, and 6b; and / or b) the guide RNA contains a spacer, where the spacer contains at least 19 contiguous nucleotides of a sequence selected from one or more of: CCCUUACCCGGCAGUGGGGG (SEQ ID NO: 699), UUACCCGGCAGUGGGGGUG (SEQ ID NO: 793), CCUUACCCGGCAGUGGGGGU (SEQ ID NO: 701), CUUACCCGGCAGUGGGGGUG (SEQ ID NO: 703), UUACCCGGCAGUGGGGGUGG (SEQ ID NO: 705), UACCCGGCAGUGGGGGUGGG (SEQ ID NO: 707), CUGACACCCUUACCCGGCAG (SEQ ID NO: 709), UGACACCCUUACCCGGCAGU (SEQ ID NO: 711), GACACCCUUACCCGGCAGUG (SEQ ID NO: 713), ACACCCUUACCCGGCAGUGG (SEQ ID NO: 715), CCCUUACCCGGCAGUGGGGGU (SEQ ID NO: 717), and ACCCGGCAGUGGGGGUGGGG (SEQ ID NO: 719).
[0017] In another aspect, the disclosure provides a method for producing a cell, the method involves editing a GALT polynucleotide in the cell according to the method of any aspect or embodiment of the disclosure.
[0018] In another aspect, the disclosure provides a method for treating galactosemia in a subject in need thereof. The subject contains a galactose- 1 -phosphate uridylyltransferase (GALT) polynucleotide encoding a GALT polypeptide having a Q188R mutation. The method involves administering to the subject a base editor system containing a guide RNA, or a polynucleotide encoding the guide RNA, and a base editor containing a deaminase domain capable of deaminating cytidine in DNA and a nucleic acid programmable DNA binding protein (napDNAbp) domain, or one or more polynucleotides encoding the base editor. The guide RNA ATTORNEY DOCKET NO. 180802-47402 / PCT
[0019] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0020] targets the base editor to effect an alteration to a nucleobase in codon 188 of the GALT polynucleotide, where the alteration to the nucleobase changes codon 188 to a codon encoding a Q amino acid. Also a) the deaminase domain contains an amino acid sequence with at least 85% sequence identity to a deaminase selected from one or more of RrA3f, T1.52, and 6b; and / or b) the guide RNA contains a spacer, where the spacer contains at least 19 contiguous nucleotides of a sequence selected from one or more of: CCCUUACCCGGCAGUGGGGG (SEQ ID NO: 699), UUACCCGGCAGUGGGGGUG (SEQ ID NO: 793), CCUUACCCGGCAGUGGGGGU (SEQ ID NO: 701), CUUACCCGGCAGUGGGGGUG (SEQ ID NO: 703), UUACCCGGCAGUGGGGGUGG (SEQ ID NO: 705), UACCCGGCAGUGGGGGUGGG (SEQ ID NO: 707), CUGACACCCUUACCCGGCAG (SEQ ID NO: 709), UGACACCCUUACCCGGCAGU (SEQ ID NO: 711), GACACCCUUACCCGGCAGUG (SEQ ID NO: 713), ACACCCUUACCCGGCAGUGG (SEQ ID NO: 715), CCCUUACCCGGCAGUGGGGGU (SEQ ID NO: 717), and ACCCGGCAGUGGGGGUGGGG (SEQ ID NO: 719).
[0021] In another aspect, the disclosure provides a base editor system. The base editor system contains a guide RNA, or a polynucleotide encoding the guide RNA, and a base editor containing a deaminase domain capable of deaminating cytidine in DNA and a nucleic acid programmable DNA binding protein (napDNAbp) domain, or one or more polynucleotides encoding the base editor. The guide RNA targets the base editor to effect an alteration to a nucleobase in codon 188 of a GALT polynucleotide encoding an R amino acid, where the alteration to the nucleobase changes codon 188 to a codon encoding a Q amino acid. Also a) the deaminase domain contains an amino acid sequence with at least 85% sequence identity to a deaminase selected from one or more of RrA3f, T1.52, and 6b; and / or b) the guide RNA contains a spacer, where the spacer contains at least 19 contiguous nucleotides of a sequence selected from one or more of: CCCUUACCCGGCAGUGGGGG (SEQ ID NO: 699), UUACCCGGCAGUGGGGGUG (SEQ ID NO: 793), CCUUACCCGGCAGUGGGGGU (SEQ ID NO: 701), CUUACCCGGCAGUGGGGGUG (SEQ ID NO: 703), UUACCCGGCAGUGGGGGUGG (SEQ ID NO: 705), UACCCGGCAGUGGGGGUGGG (SEQ ID NO: 707), CUGACACCCUUACCCGGCAG (SEQ ID NO: 709), UGACACCCUUACCCGGCAGU (SEQ ID NO: 711), GACACCCUUACCCGGCAGUG (SEQ ID NO: 713), ACACCCUUACCCGGCAGUGG (SEQ ID NO: 715), CCCUUACCCGGCAGUGGGGGU (SEQ ID NO: 717), and ACCCGGCAGUGGGGGUGGGG (SEQ ID NO: 719).
[0022] In another aspect, the disclosure provides a polynucleotide or set of polynucleotides encoding the base editor system of any aspect or embodiment of the disclosure, or a component thereof. ATTORNEY DOCKET NO. 180802-47402 / PCT
[0023] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0024] In another aspect, the disclosure provides a vector or set of vectors containing the base editor system, or the polynucleotide or set of polynucleotides of any aspect or embodiment of the disclosure.
[0025] In another aspect, the disclosure provides a lipid nanoparticle containing the base editor system, or the polynucleotide or set of polynucleotides of any aspect or embodiment of the disclosure.
[0026] In another aspect, the disclosure provides a cell containing the base editor system of any aspect or embodiment of the disclosure.
[0027] In another aspect, the disclosure provides a pharmaceutical composition containing an effective amount of the base editor system, the polynucleotide or set of polynucleotides, the vector or set of vectors, the lipid nanoparticle, the composition, the guide polynucleotide, or the cell of any aspect or embodiment of the disclosure, and a pharmaceutically acceptable excipient.
[0028] In another aspect, the disclosure provides a kit containing the base editor system of any aspect or embodiment of the disclosure, or a component thereof, the polynucleotide or set of polynucleotides of any aspect or embodiment of the disclosure, the vector or set of vectors of any aspect or embodiment of the disclosure, the lipid nanoparticle of any aspect or embodiment of the disclosure, the composition of any aspect or embodiment of the disclosure, the guide polynucleotide of any aspect or embodiment of the disclosure, or the pharmaceutical composition of any aspect or embodiment of the disclosure, disposed within a container.
[0029] In another aspect, the disclosure provides a guide polynucleotide containing a spacer or gRNA nucleotide sequence listed in Table 1.
[0030] In another aspect, the disclosure provides a method of editing a nucleobase of a galactose- 1 -phosphate uridylyltransferase (GALT) polynucleotide. The method involves contacting the GALT polynucleotide with a guide RNA and mRNA encoding a base editor. The base editor contains a deaminase domain containing a deaminase selected from one or more of RrA3f, T1.52, and 6b, and a nucleic acid programmable DNA binding protein (napDNAbp) domain. The napDNAbp is an SpRY, an SpRYc, an saCas9, an spCas9, or an spCas9 variant having specificity for a protospacer-adjacent motif (PAM) selected from one or more of: NG, NRRH, and NRTH, where “N” indicates A, C, G, or T, “R” indicates A or G, “Y” indicates C or T, and “H” indicates A, C, or T. The guide RNA targets (i.e., “directs”) the base editor to effect an alteration to a nucleobase in codon 188 of the GALT polynucleotide encoding an R amino acid. The alteration to the nucleobase changes codon 188 to a codon encoding a Q amino acid. The guide RNA contains a spacer where the spacer contains or contains only a sequence selected from one or more of: CCCUUACCCGGCAGUGGGGG (SEQ ID NO: 699), ATTORNEY DOCKET NO. 180802-47402 / PCT
[0031] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0032] UUACCCGGCAGUGGGGGUG (SEQ ID NO: 793), CCUUACCCGGCAGUGGGGGU (SEQ ID NO: 701), CUUACCCGGCAGUGGGGGUG (SEQ ID NO: 703), UUACCCGGCAGUGGGGGUGG (SEQ ID NO: 705), UACCCGGCAGUGGGGGUGGG (SEQ ID NO: 707), CUGACACCCUUACCCGGCAG (SEQ ID NO: 709), UGACACCCUUACCCGGCAGU (SEQ ID NO: 711), GACACCCUUACCCGGCAGUG (SEQ ID NO: 713), ACACCCUUACCCGGCAGUGG (SEQ ID NO: 715), CCCUUACCCGGCAGUGGGGGU (SEQ ID NO: 717), and ACCCGGCAGUGGGGGUGGGG (SEQ ID NO: 719).
[0033] In another aspect, the disclosure provides a base editor system containing a guide RNA and mRNA encoding a base editor. The base editor contains a deaminase domain containing a deaminase selected from one or more of RrA3f, T1.52, and 6b, and a nucleic acid programmable DNA binding protein (napDNAbp) domain. The napDNAbp is an SpRY, an SpRYc, an saCas9, an spCas9, or an spCas9 variant having specificity for a protospacer-adjacent motif (PAM) selected from one or more of: NG, NRRH, and NRTH, where “N” indicates A, C, G, or T, “R” indicates A or G, “Y” indicates C or T, and “H” indicates A, C, or T. The guide RNA targets the base editor to effect an alteration to a nucleobase in codon 188 of the GALT polynucleotide encoding an R amino acid, where the alteration to the nucleobase changes codon 188 to a codon encoding a Q amino acid. The guide RNA contains a spacer, where the spacer contains or contains only a sequence selected from one or more of: CCCUUACCCGGCAGUGGGGG (SEQ ID NO: 699), UUACCCGGCAGUGGGGGUG (SEQ ID NO: 793), CCUUACCCGGCAGUGGGGGU (SEQ ID NO: 701), CUUACCCGGCAGUGGGGGUG (SEQ ID NO: 703), UUACCCGGCAGUGGGGGUGG (SEQ ID NO: 705), UACCCGGCAGUGGGGGUGGG (SEQ ID NO: 707), CUGACACCCUUACCCGGCAG (SEQ ID NO: 709), UGACACCCUUACCCGGCAGU (SEQ ID NO: 711), GACACCCUUACCCGGCAGUG (SEQ ID NO: 713), ACACCCUUACCCGGCAGUGG (SEQ ID NO: 715), CCCUUACCCGGCAGUGGGGGU (SEQ ID NO: 717), and ACCCGGCAGUGGGGGUGGGG (SEQ ID NO: 719).
[0034] In another aspect, the disclosure provides a lipid nanoparticle containing the base editor system of any aspect or embodiment of the disclosure.
[0035] In another aspect, the disclosure provides a composition. The composition contains a guide RNA and mRNA encoding a base editor. The base editor contains a deaminase domain containing a deaminase selected from one or more of RrA3f, T1.52, and 6b, and a nucleic acid programmable DNA binding protein (napDNAbp) domain. The napDNAbp is an SpRY, an SpRYc, an saCas9, an spCas9, or an spCas9 variant having specificity for a protospacer-adjacent motif (PAM) selected from one or more of: NG, NRRH, and NRTH, where “N” indicates A, C, G, or T, “R” indicates A or G, “Y” indicates C or T, and “H” indicates A, C, or T. The guide ATTORNEY DOCKET NO. 180802-47402 / PCT
[0036] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0037] RNA targets the base editor to effect an alteration to a nucleobase in codon 188 of the GALT polynucleotide encoding an R amino acid, where the alteration to the nucleobase changes codon 188 to a codon encoding a Q amino acid. The guide RNA contains a spacer, where the spacer contains or contains only a sequence selected from one or more of:
[0038] CCCUUACCCGGCAGUGGGGG (SEQ ID NO: 699), UUACCCGGCAGUGGGGGUG (SEQ ID NO: 793), CCUUACCCGGCAGUGGGGGU (SEQ ID NO: 701), CUUACCCGGCAGUGGGGGUG (SEQ ID NO: 703), UUACCCGGCAGUGGGGGUGG (SEQ ID NO: 705), UACCCGGCAGUGGGGGUGGG (SEQ ID NO: 707), CUGACACCCUUACCCGGCAG (SEQ ID NO: 709), UGACACCCUUACCCGGCAGU (SEQ ID NO: 711), GACACCCUUACCCGGCAGUG (SEQ ID NO: 713), ACACCCUUACCCGGCAGUGG (SEQ ID NO: 715), CCCUUACCCGGCAGUGGGGGU (SEQ ID NO: 717), and ACCCGGCAGUGGGGGUGGGG (SEQ ID NO: 719).
[0039] In another aspect, the disclosure provides a lipid nanoparticle (LNP) composition containing a guide RNA and mRNA encoding a base editor. The base editor contains a deaminase domain contains a deaminase selected from one or more of RrA3f, T1.52, and 6b, and a nucleic acid programmable DNA binding protein (napDNAbp) domain. The napDNAbp is an SpRY, an SpRYc, an saCas9, an spCas9, or an spCas9 variant having specificity for a protospacer-adjacent motif (PAM) selected from one or more of: NG, NRRH, and NRTH, where “N” indicates A, C, G, or T, “R” indicates A or G, “Y” indicates C or T, and “H” indicates A, C, or T. The guide RNA targets the base editor to effect an alteration to a nucleobase in codon 188 of the GALT polynucleotide encoding an R amino acid, where the alteration to the nucleobase changes codon 188 to a codon encoding a Q amino acid. The guide RNA contains a spacer, where the spacer contains or contains only a sequence selected from one or more of: CCCUUACCCGGCAGUGGGGG (SEQ ID NO: 699), UUACCCGGCAGUGGGGGUG (SEQ ID NO: 793), CCUUACCCGGCAGUGGGGGU (SEQ ID NO: 701), CUUACCCGGCAGUGGGGGUG (SEQ ID NO: 703), UUACCCGGCAGUGGGGGUGG (SEQ ID NO: 705), UACCCGGCAGUGGGGGUGGG (SEQ ID NO: 707), CUGACACCCUUACCCGGCAG (SEQ ID NO: 709), UGACACCCUUACCCGGCAGU (SEQ ID NO: 711), GACACCCUUACCCGGCAGUG (SEQ ID NO: 713), ACACCCUUACCCGGCAGUGG (SEQ ID NO: 715), CCCUUACCCGGCAGUGGGGGU (SEQ ID NO: 717), and ACCCGGCAGUGGGGGUGGGG (SEQ ID NO: 719).
[0040] In another aspect, the disclosure provides a method of treatment. The method of treatment involves administering to a subject in need thereof, where the subject contains a galactose- 1 -phosphate uridylyltransferase (GALT) polynucleotide encoding a GALT polypeptide having a Q188R mutation, a lipid nanoparticle (LNP). The LNP contains a guide RNA and mRNA encoding a base editor. The base editor contains a deaminase domain containing a ATTORNEY DOCKET NO. 180802-47402 / PCT
[0041] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0042] deaminase selected from one or more of RrA3f, T1.52, and 6b, and a nucleic acid programmable DNA binding protein (napDNAbp) domain. The napDNAbp is an SpRY, an SpRYc, an saCas9, an spCas9, or an spCas9 variant having specificity for a protospacer-adjacent motif (PAM) selected from one or more of: NG, NRRH, and NRTH, where “N” indicates A, C, G, or T, “R” indicates A or G, “Y” indicates C or T, and “H” indicates A, C, or T. The guide RNA targets the base editor to effect an alteration to a nucleobase in codon 188 of the GALT polynucleotide encoding an R amino acid, where the alteration to the nucleobase changes codon 188 to a codon encoding a Q amino acid. The guide RNA contains a spacer, where the spacer contains or contains only a sequence selected from one or more of: CCCUUACCCGGCAGUGGGGG (SEQ ID NO: 699), UUACCCGGCAGUGGGGGUG (SEQ ID NO: 793), CCUUACCCGGCAGUGGGGGU (SEQ ID NO: 701), CUUACCCGGCAGUGGGGGUG (SEQ ID NO: 703), UUACCCGGCAGUGGGGGUGG (SEQ ID NO: 705), UACCCGGCAGUGGGGGUGGG (SEQ ID NO: 707), CUGACACCCUUACCCGGCAG (SEQ ID NO: 709), UGACACCCUUACCCGGCAGU (SEQ ID NO: 711), GACACCCUUACCCGGCAGUG (SEQ ID NO: 713), ACACCCUUACCCGGCAGUGG (SEQ ID NO: 715), CCCUUACCCGGCAGUGGGGGU (SEQ ID NO: 717), and ACCCGGCAGUGGGGGUGGGG (SEQ ID NO: 719).
[0043] In another aspect, the disclosure provides a lipid nanoparticle (LNP) containing a guide RNA, or a polynucleotide encoding the guide RNA, an mRNA molecule, or a polynucleotide encoding the mRNA molecule, and an ionizable lipid, or a pharmaceutically acceptable salt thereof. The guide RNA contains a spacer containing a nucleotide sequence selected from UUACCCGGCAGUGGGGGUGG (SEQ ID NO: 705); UUACCCGGCAGUGGGGGUG (SEQ ID NO: 793), and those listed in Table 1. The mRNA molecule encodes a base editor polypeptide containing a nucleic acid programmable DNA binding protein (napDNAbp) domain and a deaminase domain. The deaminase domain contains an amino acid sequence with at least 90% sequence identity to a deaminase selected from one or more of 6b, RrA3f, and T1.52. The ionizable lipid has any one of the following formulas:
[0044] i) a compound of Formula (I):
[0045]
[0046] where:
[0047] each L1and L1'is independently -C(O)- or -OC(O)-; ATTORNEY DOCKET NO. 180802-47402 / PCT
[0048] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0049] each L2and L2'is independently an optionally substituted bivalent saturated or unsaturated, straight or branched C1-12 hydrocarbon chain;
[0050] L3is a covalent bond, -O-, -C(O)O-, -OC(O)-, or -OC(O)O-;
[0051] O-Ra
[0052] i_LCyA_cA rLRa~(
[0053] R1is optionally substituted C1-20 aliphatic, ‘y, or O-Ra;
[0054] LCyAis a covalent bond or an optionally substituted bivalent saturated or unsaturated, straight or branched C1-12 hydrocarbon chain;
[0055] CyAis an optionally substituted ring selected from 3- to 7-membered saturated or partially unsaturated carbocyclyl, 1-adamantyl, 2-adamantyl, sterolyl, and phenyl;
[0056] LRais an optionally substituted bivalent saturated or unsaturated, straight or branched C1-12 hydrocarbon chain;
[0057] each Raand R1is independently optionally substituted C1-20 aliphatic;
[0058] Y1is -C(O)- or -C(O)O-;
[0059] Y2is an optionally substituted bivalent saturated or unsaturated, straight or branched C1-6 hydrocarbon chain;
[0060] Y3is optionally substituted C1-20 aliphatic;
[0061] X1is a covalent bond, -O-, or -NR-;
[0062] X2is an optionally substituted bivalent saturated or unsaturated, straight or branched C1-12 hydrocarbon chain, where 1-3 methylene units are optionally and independently replaced with -O- or -NR-;
[0063] X3is hydrogen or an optionally substituted ring selected from 3- to 7- membered saturated or partially unsaturated carbocyclyl, phenyl, 3- to 7-membered heterocyclyl having 1-3 heteroatoms independently selected from nitrogen, oxygen, and sulfur, or 5- to 6-membered heteroaryl having 1-3 heteroatoms independently selected from nitrogen, oxygen, and sulfur; and
[0064] each R is independently hydrogen or optionally substituted C1-6 aliphatic;
[0065] ii) a compound of formula A’:
[0066] A3
[0067] O R
[0068] 1 / Ll
[0069] y c) R
[0070] L2
[0071] R<.bk, / R"
[0072]
[0073] A’
[0074] or its N-oxide, or a pharmaceutically acceptable salt thereof, where
[0075] L1is absent, C1-6 alkylenyl, or C2-6 heteroalkylenyl; ATTORNEY DOCKET NO. 180802-47402 / PCT
[0076] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0077] each L2is independently optionally substituted C2-15 alkylenyl, or optionally substituted C3-15 heteroalkyl enyl;
[0078] L is Ci-io alkylenyl, or C2-10 heteroalkyl enyl;
[0079] X2is -OC(O)-, -C(O)O-, or -OC(O)O-;
[0080] X is absent, -OC(O)-, -C(O)O-, or -OC(O)O-;
[0081] O
[0082] |_3a y^ I o^R6
[0083] ox^Ra%
[0084] Ax3a
[0085] R” is hydrogen,
[0086]
[0087] O R, O R6, or an optionally substituted group selected from C6-20 aliphatic, 3- to 12-membered cycloaliphatic, 7- to 12-membered bridged bicyclic containing 0-4 heteroatoms independently selected from nitrogen, oxygen, or sulfur, 1- adamantyl, 2-adamantyl, sterolyl, and phenyl;
[0088] each of R and Rais independently hydrogen, or an optionally substituted group selected from Ce- 20 aliphatic, 3- to 12-membered cycloaliphatic, 7- to 12-membered bridged bicyclic containing 0-4 heteroatoms independently selected from nitrogen, oxygen, or sulfur, 1- adamantyl, 2-adamantyl, sterolyl, and phenyl
[0089] each of L3and L3ais independently absent, optionally substituted C1-10 alkylenyl, or optionally substituted C2-10 heteroalkylenyl;
[0090] R1is hydrogen, optionally substituted phenyl, optionally substituted 3- to 7-membered cycloaliphatic, optionally substituted 3- to 7-membered heterocyclyl containing 1-3 heteroatoms independently selected from nitrogen, oxygen, and sulfur, optionally substituted 5- to 6-membered monocyclic heteroaryl containing 1-4 heteroatoms independently selected from nitrogen, oxygen, and sulfur, optionally substituted 8- to 10-membered bicyclic heteroaryl containing 1-4 heteroatoms independently selected from nitrogen, oxygen, and sulfur, -OR2, -C(O)OR2, -C(O)SR2, -OC(O)R2, -OC(O)OR2, -CN,
[0091] -N(R2)2, -C(O)N(R2)2, -S(O)2N(R2)2, -NR2C(O)R2, -OC(O)N(R2)2, -N(R2)C(O)OR2, -NR2S(O)2R2, -NR2C(O)N(R2)2, -NR2C(S)N(R2)2, -NR2C(NR2)N(R2)2, -NR2C(CHR2)N(R2)2, -N(OR2)C(O)R2, -N(OR2)S(O)2R2, -N(OR2)C(O)OR2, -N(OR2)C(O)N(R2)2, -N(OR2)C(S)N(R2)2, -N(OR2)C(NR2)N(R2)2, -N(OR2)C(CHR2)N(R2)2, -C(NR2)N(R2)2, -C(NR2)R2, -C(O)N(R2)OR2, -C(R2)N(R2)2C(O)OR2, -CR2(R3)2, -OP(O)(OR2)2, or -P(O)(OR2)2; or ATTORNEY DOCKET NO. 180802-47402 / PCT
[0092] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0093] R2R2
[0094] / \
[0095] R
[0096]
[0097] 1is ° O, or a ring selected from 3- to 7-membered cycloaliphatic and 3- to 7- membered heterocyclyl containing 1-3 heteroatoms independently selected from nitrogen, oxygen, and sulfur, where the cycloaliphatic or heterocyclyl ring is optionally substituted with 1-4 R2or R3groups;
[0098] each R2is independently hydrogen, oxo, -CN, -NO2, -OR4, -S(O)2R4, -S(O)2N(R4)2, -(CH2)n-R4, or an optionally substituted group selected from C1-6 aliphatic, phenyl, 3- to 7-membered cycloaliphatic, 5- to 6-membered monocyclic heteroaryl containing 1-4 heteroatoms independently selected from nitrogen, oxygen, and sulfur, and 3- to 7-membered heterocyclyl containing 1-3 heteroatoms independently selected from nitrogen, oxygen, and sulfur; or two occurrences of R2, taken together with the atom(s) to which they are attached, form optionally substituted 4- to 7-membered heterocyclyl containing 0-1 additional heteroatom selected from nitrogen, oxygen, and sulfur;
[0099] each R3is independently -(CH2)n-R4; or
[0100] two occurrences of R3, taken together with the atom(s) to which they are attached, form optionally substituted 5- to 6-membered heterocyclyl containing 0-1 additional heteroatom selected from nitrogen, oxygen, and sulfur;
[0101] each R4is independently hydrogen, -OR5, -N(R5)2, -OC(O)R5, -OC(O)OR5, -CN, -C(O)N(R5)2, -NR5C(O)R5, -OC(O)N(R5)2, -N(R5)C(O)OR5, -NR5S(O)2R5, -NR5C(O)N(R5)2,
[0102] R5R5
[0103] / \
[0104] R5*
[0105] -Z —
[0106] -
[0107]
[0108] NR5C(S)N(R5)2, -NR5C(NR5)N(R5)2, or O O ■
[0109] each R5is independently hydrogen, or optionally substituted C1-6 aliphatic; or
[0110] two occurrences of R5, taken together with the atom(s) to which they are attached, form optionally substituted 4- to 7-membered heterocyclyl containing 0-1 additional heteroatom selected from nitrogen, oxygen, and sulfur;
[0111] each R6is independently C4-12 aliphatic; and
[0112] each n is independently 0 to 4; or
[0113] x) a compound of Formula I: ATTORNEY DOCKET NO. 180802-47402 / PCT
[0114] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0115]
[0116] or a pharmaceutically acceptable salt thereof, where:
[0117] L1is a covalent bond, -C(O)-, or -OC(O)-;
[0118] L2is a covalent bond, an optionally substituted bivalent saturated or unsaturated, straight or HH-cyA-H
[0119] branched C1-C12 hydrocarbon chain, orm m;
[0120] CyAis an optionally substituted ring selected from phenylene and 3- to 7-membered saturated or partially unsaturated carbocyclene;
[0121] each m is independently 0, 1, or 2;
[0122] L3is a covalent bond, -C(O)-, -C(O)O-, -OC(O)-, -O-, or -OC(O)O-;
[0123] R, is H4p-C', B, an optionally substituted saturated or unsaturated, straight or branched
[0124] C1-C20 hydrocarbon chain where 1-3 methylene units are optionally and independently
[0125] , O-A1
[0126] HL2
[0127] replaced with -O- or -NR-, or O_A;
[0128] CyBis an optionally substituted ring selected from 3- to 12-membered saturated or partially
[0129] unsaturated carbocyclyl, 1-adamantyl, 2-adamantyl,
[0130]
[0131] , sterolyl, and phenyl; p is 0, 1, 2, or 3;
[0132] each L4is independently a bivalent saturated or unsaturated, straight or branched Ci-Ce hydrocarbon chain;
[0133] each A1and A2is independently an optionally substituted C1-C20 aliphatic or -L5-R5;
[0134] or A1and A2, together with their intervening atoms, may form an optionally substituted ring:
[0135] #K2) X.
[0136] where
[0137] x is selected from 1 or 2; and
[0138] # represents the point of attachment to L4;
[0139] each L5is independently a bivalent saturated or unsaturated, straight or branched C1-C20 hydrocarbon chain, where 1-3 methylene units are optionally and independently replaced with -O- or -NR-; ATTORNEY DOCKET NO. 180802-47402 / PCT
[0140] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0141] each R5is independently an optionally substituted group selected from a 5- to 10-membered aryl ring or a 3- to 8-membered carbocyclic ring;
[0142] X1is a covalent bond, -O-, or -NR-;
[0143] X2is a covalent bond or an optionally substituted, bivalent saturated or unsaturated, straight or branched, C1-C12 hydrocarbon chain, where 1-3 methylene units are optionally and independently replaced with -O-, -NR-, or -Cyc-;
[0144] Cycis an optionally substituted ring selected from 3- to 7- membered saturated or partially unsaturated carbocyclene, phenylene, 3- to 7-membered saturated or partially unsaturated heterocyclene having 1-3 heteroatoms independently selected from nitrogen, oxygen, and sulfur, and 5- to 6-membered heteroarylene having 1-3 heteroatoms independently selected from nitrogen, oxygen, and sulfur;
[0145] X3is hydrogen or an optionally substituted ring selected from 3- to 7-membered saturated or partially unsaturated carbocyclyl, phenyl, 3- to 7-membered saturated or partially unsaturated heterocyclyl having 1-3 heteroatoms independently selected from nitrogen, oxygen, and sulfur, or 5- to 6-membered heteroaryl having 1-3 heteroatoms independently selected from nitrogen, oxygen, and sulfur; and
[0146] each R is independently hydrogen or an optionally substituted Ci-Ce aliphatic group;
[0147] FH-cyB
[0148] provided that when L is a covalent bond, then R must bep
[0149] In another aspect, the disclosure provides a lipid nanoparticle (LNP) containing a guide RNA, or a polynucleotide encoding the guide RNA, an mRNA molecule, or a polynucleotide encoding the mRNA molecule, and an ionizable lipid, or a pharmaceutically acceptable salt thereof. The guide RNA is capable of directing a base editor polypeptide to alter a nucleotide in a galactose- 1 -phosphate uridylyltransferase (GALT) polynucleotide. The mRNA molecule encodes a base editor polypeptide containing a nucleic acid programmable DNA binding protein (napDNAbp) domain and a deaminase domain, where the deaminase domain containing an amino acid sequence with at least 90% sequence identity to a deaminase selected from one or more of 6b, RrA3f, and T1.52. The an ionizable lipid has Formula (I):
[0150]
[0151] where:
[0152] each L1and L1'is independently -C(O)- or -OC(O)-; ATTORNEY DOCKET NO. 180802-47402 / PCT
[0153] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0154] each L2and L2'is independently an optionally substituted bivalent saturated or unsaturated, straight or branched C1-12 hydrocarbon chain;
[0155] L3is a covalent bond, -O-, -C(O)O-, -OC(O)-, or -OC(O)O-;
[0156] O-Ra
[0157] i_LCyA_cA rLRa~(
[0158] R1is optionally substituted C1-20 aliphatic, ‘y, or O-Ra;
[0159] LCyAis a covalent bond or an optionally substituted bivalent saturated or unsaturated, straight or branched C1-12 hydrocarbon chain;
[0160] CyAis an optionally substituted ring selected from 3- to 7-membered saturated or partially unsaturated carbocyclyl, 1-adamantyl, 2-adamantyl, sterolyl, and phenyl;
[0161] LRais an optionally substituted bivalent saturated or unsaturated, straight or branched C1-12 hydrocarbon chain;
[0162] each Raand R1is independently optionally substituted C1-20 aliphatic;
[0163] Y1is -C(O)- or -C(O)O-;
[0164] Y2is an optionally substituted bivalent saturated or unsaturated, straight or branched C1-6 hydrocarbon chain;
[0165] Y3is optionally substituted C1-20 aliphatic;
[0166] X1is a covalent bond, -O-, or -NR-;
[0167] X2is an optionally substituted bivalent saturated or unsaturated, straight or branched C1-12 hydrocarbon chain, where 1-3 methylene units are optionally and independently replaced with -O- or -NR-;
[0168] X3is hydrogen or an optionally substituted ring selected from 3- to 7- membered saturated or partially unsaturated carbocyclyl, phenyl, 3- to 7-membered heterocyclyl having 1-3 heteroatoms independently selected from nitrogen, oxygen, and sulfur, or 5- to 6-membered heteroaryl having 1-3 heteroatoms independently selected from nitrogen, oxygen, and sulfur; and
[0169] each R is independently hydrogen or optionally substituted C1-6 aliphatic.
[0170] In another aspect, the disclosure provides a lipid nanoparticle (LNP) containing a guide RNA, a base editor polypeptide, or a polynucleotide encoding the base editor polypeptide, and an ionizable lipid, or a pharmaceutically acceptable salt thereof. The guide RNA has a nucleotide sequence selected from: mUsmUsmAsCCCGGCAGUGGGGGUGGGUUUUAGAmGmCmCmGmGmCmGmGmAmAmAmCmGmCmC mGmGmCAAGUUAAAAUAAGGCUAGUCCGUUAUCAmAmCmUmUmGmAmAmAmAmAmGmUmGmGmC mAmCmCmGmAmGmUmCmGmGmUmGmCmUsmUsmUsmU (SEQ ID NO: 746); and mUsmUsmAsCCCGGCAGUGGGGGUGGUUUUAGAmGmCmCmGmGmCmGmGmAmAmAmCmGmCmCm GmGmCAAGUUAAAAUAAGGCUAGUCCGUUAUCAmAmCmUmUmGmAmAmAmAmAmGmUmGmGmCm ATTORNEY DOCKET NO. 180802-47402 / PCT
[0171] ELECTRONIC DEPOSIT DATE: December 10, 2025 AmCmCmGmAmGmUmCmGmGmUmGmCmUsmUsmUsmU (SEQ ID NO: 791), where “mN” indicates a 2'-OMe modification of the nucleotide “N”, and “Ns” indicates that the nucleotide “N” is linked to the following nucleotide by a phosphorothioate (PS). The base editor polypeptide contains a nucleic acid programmable DNA binding protein (napDNAbp) domain and a deaminase domain containing a deaminase domain containing a deaminase selected from one or more of 6b, RrA3f, and T1.52. The ionizable lipid has the following structure:
[0172] H
[0173] . CL. N
[0174]
[0175] In another aspect, the disclosure provides a method for modifying a target nucleobase in a galactose- 1 -phosphate uridylyltransferase (GALT) polynucleotide in a cell. The method involves contacting the cell with the lipid nanoparticle, the base editor system, or the polynucleotide or set of polynucleotides of any aspect of the disclosure, or embodiments thereof, thereby modifying the target nucleobase in the GALT polynucleotide.
[0176] In another aspect, the disclosure provides a method of treating a disease in a subject in need thereof. The disease is associated with a pathogenic mutation in a galactose- 1 -phosphate uridylyltransferase (GALT) polynucleotide in the subject. The method involves administering to the subject the lipid nanoparticle, the base editor system, or the polynucleotide or set of polynucleotides of any aspect of the disclosure, or embodiments thereof, thereby altering a target nucleobase in the GALT polynucleotide in the subject.
[0177] In any aspect or embodiment of the disclosure, the guide RNA contains a spacer, where the spacer contains at least 19 contiguous nucleotides of a sequence selected from one or more of: CCCUUACCCGGCAGUGGGGG (SEQ ID NO: 699), UUACCCGGCAGUGGGGGUG (SEQ ID NO: 793), CCUUACCCGGCAGUGGGGGU (SEQ ID NO: 701), CUUACCCGGCAGUGGGGGUG (SEQ ID NO: 703), UUACCCGGCAGUGGGGGUGG (SEQ ID NO: 705), UACCCGGCAGUGGGGGUGGG (SEQ ID NO: 707), CUGACACCCUUACCCGGCAG (SEQ ID NO: 709), UGACACCCUUACCCGGCAGU (SEQ ID NO: 711), GACACCCUUACCCGGCAGUG (SEQ ID NO: 713), ACACCCUUACCCGGCAGUGG (SEQ ID NO: 715), CCCUUACCCGGCAGUGGGGGU (SEQ ID NO: 717), and ACCCGGCAGUGGGGGUGGGG (SEQ ID NO: 719). In any aspect or embodiment of the disclosure, the guide RNA contains a spacer, where the spacer contains only a sequence selected from one or more of: CCCUUACCCGGCAGUGGGGG (SEQ ID NO: 699), UUACCCGGCAGUGGGGGUG (SEQ ID NO: 793), CCUUACCCGGCAGUGGGGGU (SEQ ID NO: ATTORNEY DOCKET NO. 180802-47402 / PCT
[0178] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0179] 701), CUUACCCGGCAGUGGGGGUG (SEQ ID NO: 703), UUACCCGGCAGUGGGGGUGG (SEQ ID NO: 705), UACCCGGCAGUGGGGGUGGG (SEQ ID NO: 707), CUGACACCCUUACCCGGCAG (SEQ ID NO: 709), UGACACCCUUACCCGGCAGU (SEQ ID NO: 711), GACACCCUUACCCGGCAGUG (SEQ ID NO: 713), ACACCCUUACCCGGCAGUGG (SEQ ID NO: 715), CCCUUACCCGGCAGUGGGGGU (SEQ ID NO: 717), and ACCCGGCAGUGGGGGUGGGG (SEQ ID NO: 719). In any aspect or embodiment of the disclosure, the guide RNA contains a spacer, where the spacer contains only a sequence selected from one or more of:
[0180] UACCCGGCAGUGGGGGUGGG(SEQ ID NO: 707), UUACCCGGCAGUGGGGGUG (SEQ ID NO: 793), ACACCCUUACCCGGCAGUGG (SEQ ID NO: 715), CUUACCCGGCAGUGGGGGUG (SEQ ID NO: 703), and UUACCCGGCAGUGGGGGUGG (SEQ ID NO: 705). In any aspect or embodiment of the disclosure, the guide RNA contains a sequence selected from one or more of:
[0181] End-mod spCas9 guide polynucleotide mNsmNsmNsNNNNNNNNNNNNNNNNNGUUUUAGAGCUAGAAAUAGCAAGUUAAAAUAAGGCUAG UCCGUUAUCAACUUGAAAAAGUGGCACCGAGUCGGUGCUsmUsmUsmU (SEQ ID NO: 765); End-mod saCas9 guide polynucleotide mNsmNsmNsNNNNNNNNNNNNNNNNNNGUUUUAGUACUCUGUAAUGAAAAUUACAGAAUCUACU AAAACAAGGCAAAAUGCCGUGUUUAUCUCGUCAACUUGUUGGCGAGAUsmUsmUsmU (SEQ ID NO: 766); and
[0182] LONGEST: mNsmNsmNsNNNNNNNNNNNNNNNNNGUUUUAGAmGmCmCmGmGmCmGmGmAmAmAmCmGmCmC mGmGmCAAGUUAAAAUAAGGCUAGUCCGUUAUCAmAmCmUmUmGmAmAmAmAmAmGmUmGmGmC mAmCmCmGmAmGmUmCmGmGmUmGmCmUsmUsmUsmU (SEQ ID NO: 769), where “N” represents any nucleotide, “mN” indicates a 2'-OMe modification of the nucleotide “N”, and “Ns” indicates that the nucleotide “N” is linked to the following nucleotide by a phosphorothioate (PS), and where the number of N nucleotides is between 19 and 25. In any aspect or embodiment of the disclosure, the guide RNA contains a sequence selected from one or more of
[0183] mC smC smC sUUACCCGGCAGUGGGGGGUUUUAGAGCUAGAAAUAGCAAGUUAAAAUAAGGCUAG UCCGUUAUCAACUUGAAAAAGUGGCACCGAGUCGGUGCmUsmUsmUsU (SEQ ID NO: 728); mC smC smUsUACCCGGCAGUGGGGGUGUUUUAGAGCUAGAAAUAGCAAGUUAAAAUAAGGCUAG UCCGUUAUCAACUUGAAAAAGUGGCACCGAGUCGGUGCmUsmUsmUsU (SEQ ID NO: 729); mCsmUsmUsACCCGGCAGUGGGGGUGGUUUUAGAGCUAGAAAUAGCAAGUUAAAAUAAGGCUAG UCCGUUAUCAACUUGAAAAAGUGGCACCGAGUCGGUGCmUsmUsmUsU (SEQ ID NO: 730); ATTORNEY DOCKET NO. 180802-47402 / PCT
[0184] ELECTRONIC DEPOSIT DATE: December 10, 2025 mUsmUsmAsCCCGGCAGUGGGGGUGGGUUUUAGAGCUAGAAAUAGCAAGUUAAAAUAAGGCUAG UCCGUUAUCAACUUGAAAAAGUGGCACCGAGUCGGUGCmUsmUsmUsU (SEQ ID NO: 731); mUsmAsmC s CCGGCAGUGGGGGUGGGGUUUUAGAGCUAGAAAUAGCAAGUUAAAAUAAGGCUAG UCCGUUAUCAACUUGAAAAAGUGGCACCGAGUCGGUGCmUsmUsmUsU (SEQ ID NO: 732); mCsmUsmGsACACCCUUACCCGGCAGGUUUUAGAGCUAGAAAUAGCAAGUUAAAAUAAGGCUAG UCCGUUAUCAACUUGAAAAAGUGGCACCGAGUCGGUGCmUsmUsmUsU (SEQ ID NO: 733); mUsmGsmAsCACCCUUACCCGGCAGUGUUUUAGAGCUAGAAAUAGCAAGUUAAAAUAAGGCUAG UCCGUUAUCAACUUGAAAAAGUGGCACCGAGUCGGUGCmUsmUsmUsU (SEQ ID NO: 734); mGsmAsmCsACCCUUACCCGGCAGUGGUUUUAGAGCUAGAAAUAGCAAGUUAAAAUAAGGCUAG UCCGUUAUCAACUUGAAAAAGUGGCACCGAGUCGGUGCmUsmUsmUsU (SEQ ID NO: 735); mAsmC smAs CCCUUACCCGGCAGUGGGUUUUAGAGCUAGAAAUAGCAAGUUAAAAUAAGGCUAG UCCGUUAUCAACUUGAAAAAGUGGCACCGAGUCGGUGCmUsmUsmUsU (SEQ ID NO: 736); mC smC smC sUUACCCGGCAGUGGGGGUGUUUUAGUACUCUGUAAUGAAAAUUACAGAAUCUACU AAAACAAGGCAAAAUGCCGUGUUUAUCUCGUCAACUUGUUGGCGAGAUsmUsmUsmU (SEQ ID NO: 737);
[0185] mAsmC smC s CGGCAGUGGGGGUGGGGGUUUUAGAGCUAGAAAUAGCAAGUUAAAAUAAGGCUAG UCCGUUAUCAACUUGAAAAAGUGGCACCGAGUCGGUGCmUsmUsmUsU (SEQ ID NO: 738); mA s mC s mA s C C CUUAC C CGGC AGUGGGUUUUAGAmGmCmCmGmGmCmGmGmAmAmAmCmGmCmC mGmGmCAAGUUAAAAUAAGGCUAGUCCGUUAUCAmAmCmUmUmGmAmAmAmAmAmGmUmGmGmC mAmCmCmGmAmGmUmCmGmGmUmGmCmUsmUsmUsmU (SEQ ID NO: 739); mCsmCsmCsUUACCCGGCAGUGGGGGGUUUUAGAmGmCmCmGmGmCmGmGmAmAmAmCmGmCmC mGmGmCAAGUUAAAAUAAGGCUAGUCCGUUAUCAmAmCmUmUmGmAmAmAmAmAmGmUmGmGmC mAmCmCmGmAmGmUmCmGmGmUmGmCmUsmUsmUsmU (SEQ ID NO: 740); mCsmCsmUsUACCCGGCAGUGGGGGUGUUUUAGAmGmCmCmGmGmCmGmGmAmAmAmCmGmCmC mGmGmCAAGUUAAAAUAAGGCUAGUCCGUUAUCAmAmCmUmUmGmAmAmAmAmAmGmUmGmGmC mAmCmCmGmAmGmUmCmGmGmUmGmCmUsmUsmUsmU (SEQ ID NO: 741);
[0186] mC s mU s mG s AC AC C CUUAC C CGGC AGGUUUUAGAmGmCmCmGmGmCmGmGmAmAmAmCmGmCmC mGmGmCAAGUUAAAAUAAGGCUAGUCCGUUAUCAmAmCmUmUmGmAmAmAmAmAmGmUmGmGmC mAmCmCmGmAmGmUmCmGmGmUmGmCmUsmUsmUsmU (SEQ ID NO: 742); mCsmUsmUsACCCGGCAGUGGGGGUGGUUUUAGAmGmCmCmGmGmCmGmGmAmAmAmCmGmCmC mGmGmCAAGUUAAAAUAAGGCUAGUCCGUUAUCAmAmCmUmUmGmAmAmAmAmAmGmUmGmGmC mAmCmCmGmAmGmUmCmGmGmUmGmCmUsmUsmUsmU (SEQ ID NO: 743); ATTORNEY DOCKET NO. 180802-47402 / PCT
[0187] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0188] mG s mA s mC s C C CUUAC C CGGC AGUGGUUUUAGAmGmCmCmGmGmCmGmGmAmAmAmCmGmCmC mGmGmCAAGUUAAAAUAAGGCUAGUCCGUUAUCAmAmCmUmUmGmAmAmAmAmAmGmUmGmGmC mAmCmCmGmAmGmUmCmGmGmUmGmCmUsmUsmUsmU (SEQ ID NO: 744); mUsmGsmAsCACCCUUACCCGGCAGUGUUUUAGAmGmCmCmGmGmCmGmGmAmAmAmCmGmCmC mGmGmCAAGUUAAAAUAAGGCUAGUCCGUUAUCAmAmCmUmUmGmAmAmAmAmAmGmUmGmGmC mAmCmCmGmAmGmUmCmGmGmUmGmCmUsmUsmUsmU (SEQ ID NO: 745); mUsmUsmAsCCCGGCAGUGGGGGUGGGUUUUAGAmGmCmCmGmGmCmGmGmAmAmAmCmGmCmC mGmGmCAAGUUAAAAUAAGGCUAGUCCGUUAUCAmAmCmUmUmGmAmAmAmAmAmGmUmGmGmC mAmCmCmGmAmGmUmCmGmGmUmGmCmUsmUsmUsmU (SEQ ID NO: 746); and mUsmUsmAsCCCGGCAGUGGGGGUGGUUUUAGAmGmCmCmGmGmCmGmGmAmAmAmCmGmCmCm GmGmCAAGUUAAAAUAAGGCUAGUCCGUUAUCAmAmCmUmUmGmAmAmAmAmAmGmUmGmGmCm AmCmCmGmAmGmUmCmGmGmUmGmCmUsmUsmUsmU (SEQ ID NO: 791);
[0189] where “mN” indicates a 2'-OMe modification of the nucleotide “N”, and “Ns” indicates that the nucleotide “N” is linked to the following nucleotide by a phosphorothioate (PS). In any aspect or embodiment of the disclosure, the guide RNA contains a sequence selected from one or more of: mUsmAsmC s CCGGCAGUGGGGGUGGGGUUUUAGAGCUAGAAAUAGCAAGUUAAAAUAAGGCUAG UCCGUUAUCAACUUGAAAAAGUGGCACCGAGUCGGUGCmUsmUsmUsU (SEQ ID NO: 732); mA s mC s mA s C C CUUAC C CGGC AGUGGGUUUUAGAmGmCmCmGmGmCmGmGmAmAmAmCmGmCmC mGmGmCAAGUUAAAAUAAGGCUAGUCCGUUAUCAmAmCmUmUmGmAmAmAmAmAmGmUmGmGmC mAmCmCmGmAmGmUmCmGmGmUmGmCmUsmUsmUsmU (SEQ ID NO: 739); mCsmUsmUsACCCGGCAGUGGGGGUGGUUUUAGAmGmCmCmGmGmCmGmGmAmAmAmCmGmCmC mGmGmCAAGUUAAAAUAAGGCUAGUCCGUUAUCAmAmCmUmUmGmAmAmAmAmAmGmUmGmGmC mAmCmCmGmAmGmUmCmGmGmUmGmCmUsmUsmUsmU (SEQ ID NO: 743); mUsmUsmAsCCCGGCAGUGGGGGUGGUUUUAGAmGmCmCmGmGmCmGmGmAmAmAmCmGmCmCm GmGmCAAGUUAAAAUAAGGCUAGUCCGUUAUCAmAmCmUmUmGmAmAmAmAmAmGmUmGmGmCm AmCmCmGmAmGmUmCmGmGmUmGmCmUsmUsmUsmU (SEQ ID NO: 791); and mUsmUsmAsCCCGGCAGUGGGGGUGGGUUUUAGAmGmCmCmGmGmCmGmGmAmAmAmCmGmCmC mGmGmCAAGUUAAAAUAAGGCUAGUCCGUUAUCAmAmCmUmUmGmAmAmAmAmAmGmUmGmGmC mAmCmCmGmAmGmUmCmGmGmUmGmCmUsmUsmUsmU (SEQ ID NO: 746); where “mN” indicates a 2'-OMe modification of the nucleotide “N”, and “Ns” indicates that the nucleotide “N” is linked to the following nucleotide by a phosphorothioate (PS).
[0190] In any aspect or embodiment of the disclosure, the deaminase domain is inserted within the napDNAbp domain. In any aspect or embodiment of the disclosure, the deaminase is inserted between amino acid positions 906 and 907 of the napDNAbp referenced to SEQ ID NO: 197. ATTORNEY DOCKET NO. 180802-47402 / PCT
[0191] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0192] In any aspect or embodiment of the disclosure, the napDNAbp has specificity for a protospacer-adjacent motif (PAM) selected from one or more of: NGG, NNGRRT, NRTH, NRN, NG, or RY, where “N” indicates A, C, G, or T, “R” indicates A or G, “Y” indicates C or T, and “H” indicates A, C, or T. In any aspect or embodiment of the disclosure, the napDNAbp is a Cas9. In any aspect or embodiment of the disclosure, the Cas9 is spCas9, saCas9, or a variant thereof. In any aspect or embodiment of the disclosure, the napDNAbp is SpRY or SpRYc. In any aspect or embodiment of the disclosure, the napDNAbp is a nickase. In any aspect or embodiment of the disclosure, the nickase contains a D10A amino acid alteration referenced to SEQ ID NO: 197. In any aspect or embodiment of the disclosure, the Cas9 contains an amino acid sequence selected from one or more of:
[0193] spCas9 DKKYSIGLDIGTNSVGWAVITDEYKVPSKKFKVLGNTDRHSIKKNLIGALLFDSGETAEATRLK RTARRRYTRRKNRICYLQEIFSNEMAKVDDSFFHRLEESFLVEEDKKHERHPIFGNIVDEVAYH EKYPTIYHLRKKLVDSTDKADLRLIYLALAHMIKFRGHFLIEGDLNPDNSDVDKLFIQLVQTYN QLFEENPINASGVDAKAILSARLSKSRRLENLIAQLPGEKKNGLFGNLIALSLGLTPNFKSNFD LAEDAKLQLSKDTYDDDLDNLLAQIGDQYADLFLAAKNLSDAILLSDILRVNTEITKAPLSASM IKRYDEHHQDLTLLKALVRQQLPEKYKEIFFDQSKNGYAGYIDGGASQEEFYKFIKPILEKMDG TEELLVKLNREDLLRKQRTFDNGSIPHQIHLGELHAILRRQEDFYPFLKDNREKIEKILTFRIP YYVGPLARGNSRFAWMTRKSEETITPWNFEEVVDKGASAQSFIERMTNFDKNLPNEKVLPKHSL LYEYFTVYNELTKVKYVTEGMRKPAFLSGEQKKAIVDLLFKTNRKVTVKQLKEDYFKKIECFDS VEISGVEDRFNASLGTYHDLLKI IKDKDFLDNEENEDILEDIVLTLTLFEDREMIEERLKTYAH LFDDKVMKQLKRRRYTGWGRLSRKLINGIRDKQSGKTILDFLKSDGFANRNFMQLIHDDSLTFK EDIQKAQVSGQGDSLHEHIANLAGSPAIKKGILQTVKVVDELVKVMGRHKPENIVIEMARENQT TQKGQKNSRERMKRIEEGIKELGSQILKEHPVENTQLQNEKLYLYYLQNGRDMYVDQELDINRL SDYDVDHIVPQSFLKDDSIDNKVLTRSDKNRGKSDNVPSEEVVKKMKNYWRQLLNAKLITQRKF DNLTKAERGGLSELDKAGFIKRQLVETRQITKHVAQILDSRMNTKYDENDKLIREVKVITLKSK LVSDFRKDFQFYKVREINNYHHAHDAYLNAVVGTALIKKYPKLESEFVYGDYKVYDVRKMIAKS EQEIGKATAKYFFYSNIMNFFKTEITLANGEIRKRPLIETNGETGEIVWDKGRDFATVRKVLSM PQVNIVKKTEVQTGGFSKESILPKRNSDKLIARKKDWDPKKYGGFDSPTVAYSVLVVAKVEKGK SKKLKSVKELLGITIMERSSFEKNPIDFLEAKGYKEVKKDLI IKLPKYSLFELENGRKRMLASA GELQKGNELALPSKYVNFLYLASHYEKLKGSPEDNEQKQLFVEQHKHYLDEI IEQISEFSKRVI LADANLDKVLSAYNKHRDKPIREQAENI IHLFTLTNLGAPAAFKYFDTTIDRKRYTSTKEVLDA TLIHQSITGLYETRIDLSQLGGD (SEQ ID NO: 783);
[0194] saCas9 ATTORNEY DOCKET NO. 180802-47402 / PCT
[0195] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0196] KRNY I LGLD I G I TS VGYG I ID YETRDVIDAGVRLFKEANVENNEGRRS KRGARRLKRRRRHR I Q RVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTG NELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQL DQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYN ALNDLNNLVITRDENEKLEYYEKFQI IENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKP EFTNLKVYHDIKDITARKEI IENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISN LKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPV VKRS F I QS I KV I NA 11 KKYGL PND HIE LARE KNS KDAQKM I NEMQKRNRQTNER I E E 11 RTTG KENAKYL I EKI KLHDMQEGKCL YS LEAI PLEDLLNNPFNYE VDH 11 PRS VS FDNS FNNKVLVKQ EENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDF INRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDA LI IANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDY KYSHRVDKKPNRELINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHD PQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDDY PNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAE FIASFYNNDLIKINGELYRVIGVNNDLLNRIEVNMIDITYREYLENMNDKRPPRI IKTIASKTQ S IKKYSTDILGNLYEVKSKKHPQI IKKG (SEQ ID NO: 784);
[0197] Nureki DKKYSIGLAIGTNSVGWAVITDEYKVPSKKFKVLGNTDRHSIKKNLIGALLFDSGETAEATRLK RTARRRYTRRKNRICYLQEIFSNEMAKVDDSFFHRLEESFLVEEDKKHERHPIFGNIVDEVAYH EKYPTIYHLRKKLVDSTDKADLRLIYLALAHMIKFRGHFLIEGDLNPDNSDVDKLFIQLVQTYN QLFEENPINASGVDAKAILSARLSKSRRLENLIAQLPGEKKNGLFGNLIALSLGLTPNFKSNFD LAEDAKLQLSKDTYDDDLDNLLAQIGDQYADLFLAAKNLSDAILLSDILRVNTEITKAPLSASM IKRYDEHHQDLTLLKALVRQQLPEKYKEIFFDQSKNGYAGYIDGGASQEEFYKFIKPILEKMDG TEELLVKLNREDLLRKQRTFDNGSIPHQIHLGELHAILRRQEDFYPFLKDNREKIEKILTFRIP YYVGPLARGNSRFAWMTRKSEETITPWNFEEVVDKGASAQSFIERMTNFDKNLPNEKVLPKHSL LYEYFTVYNELTKVKYVTEGMRKPAFLSGEQKKAIVDLLFKTNRKVTVKQLKEDYFKKIECFDS VEISGVEDRFNASLGTYHDLLKI IKDKDFLDNEENEDILEDIVLTLTLFEDREMIEERLKTYAH LFDDKVMKQLKRRRYTGWGRLSRKLINGIRDKQSGKTILDFLKSDGFANRNFMQLIHDDSLTFK EDIQKAQVSGQGDSLHEHIANLAGSPAIKKGILQTVKVVDELVKVMGRHKPENIVIEMARENQT TQKGQKNSRERMKRIEEGIKELGSQILKEHPVENTQLQNEKLYLYYLQNGRDMYVDQELDINRL SDYDVDHIVPQSFLKDDSIDNKVLTRSDKNRGKSDNVPSEEVVKKMKNYWRQLLNAKLITQRKF DNLTKAERGGLSELDKAGFIKRQLVETRQITKHVAQILDSRMNTKYDENDKLIREVKVITLKSK LVSDFRKDFQFYKVREINNYHHAHDAYLNAVVGTALIKKYPKLESEFVYGDYKVYDVRKMIAKS EQEIGKATAKYFFYSNIMNFFKTEITLANGEIRKRPLIETNGETGEIVWDKGRDFATVRKVLSM ATTORNEY DOCKET NO. 180802-47402 / PCT
[0198] ELECTRONIC DEPOSIT DATE: December 10, 2025 PQVNIVKKTEVQTGGFSKESIRPKRNSDKLIARKKDWDPKKYGGFVSPTVAYSVLVVAKVEKGK SKKLKSVKELLGITIMERSSFEKNPIDFLEAKGYKEVKKDLI IKLPKYSLFELENGRKRMLASA RFLQKGNELALPSKYVNFLYLASHYEKLKGSPEDNEQKQLFVEQHKHYLDEI IEQISEFSKRVI LADANLDKVLSAYNKHRDKPIREQAENI IHLFTLTNLGAPRAFKYFDTTIDRKVYRSTKEVLDA TLIHQSITGLYETRIDLSQLGGD (SEQ ID NO: 785);
[0199] SpRYc EKKYSIGLDIGTNSVGWAVITDDYKVPSKKFKVLGNTNRKSIKKNLMGALLFDSGETAEATRLK RTARRRYTRRKNRIRYLQEIFANEMAKLDDSFFQRLEESFLVEEDKKNERHPIFGNLADEVAYH RNYPTIYHLRKKLADSPEKADLRLIYLALAHI IKFRGHFLIEGKLNAENSDVAKLFYQLIQTYN QLFEESPLDEIEVDAKGILSARLSKSKRLEKLIAVFPNEKKNGLFGNI IALALGLTPNFKSNFD LTEDAKLQLSKDTYDDDLDELLGQIGDQYADLFSAAKNLSDAILLSDILRSNSEVTKAPLSASM VKRYDEHHQDLALLKTLVRQQFPEKYAEIFKDDTKNGYAGYVGADKKLRKRSGKLATEEEFYKF IKPILEKMDGAEELLAKLNRDDLLRKQRTFDNGSIPHQIHLKELHAILRRQEEFYPFLKENREK IEKILTFRIPYYVGPLARGNSRFAWLTRKSEEAITPWNFEEVVDKGASAQSFIERMTNFDEQLP NKKVLPKHSLLYEYFTVYNELTKVKYVTERMRKPEFLSGEQKKAIVDLLFKTNRKVTVKQLKED YFKKIECFDSVEI IGVEDRFNASLGTYHDLLKI IKDKDFLDNEENEDILEDIVLTLTLFEDREM IEERLKTYAHLFDDKVMKQLKRRHYTGWGRLSRKMINGIRDKQSGKTILDFLKSDGFSNRNFMQ LIHDDSLTFKEEIEKAQVSGQGDSLHEQIADLAGSPAIKKGILQTVKIVDELVKVMGHKPENIV IEMARENQTTTKGLQQSRERKKRIEEGIKELESQILKENPVENTQLQNEKLYLYYLQNGRDMYV DQELDINRLSDYDVDHIVPQSFIKDDSIDNKVLTRSVENRGKSDNVPSEEVVKKMKNYWRQLLN AKL I TQRKFDNLTKAERGGLS EADKAGF I KRQLVETRQ I TKHVAR I LDSRMNTKRDKNDKP I RE VKVITLKSKLVSDFRKDFQLYKVRDINNYHHAHDAYLNAVVGTALIKKYPKLESEFVYGDYKVY DVRKMIAKSEQEIGKATAKRFFYSNIMNFFKTEVKLANGEIRKRPLIETNGETGEVVWNKEKDF ATVRKVLAMPQVNIVKKTEVQTGGFSKESIRPKRNSDKLIARKKDWDPKKYGGFLWPTVAYSVL VVAKVEKGKSKKLKSVKELLGITIMERSSFEKNPIDFLEAKGYKEVKKDLI IKLPKYSLFELEN GRKRMLASAKQLQKGNELALPSKYVNFLYLASHYEKLKGSPEDNEQKQLFVEQHKHYLDEI IEQ ISEFSKRVILADANLDKVLSAYNKHRDKPIREQAENI IHLFTLTRLGAPRAFKYFDTTIDPKQY RSTKEVLDATLIHQSITGLYETRIDLSQLGGD (SEQ ID NO: 786);
[0200] SpRY DKKYSIGLAIGTNSVGWAVITDEYKVPSKKFKVLGNTDRHSIKKNLIGALLFDSGETAERTRLK RTARRRYTRRKNRICYLQEIFSNEMAKVDDSFFHRLEESFLVEEDKKHERHPIFGNIVDEVAYH EKYPTIYHLRKKLVDSTDKADLRLIYLALAHMIKFRGHFLIEGDLNPDNSDVDKLFIQLVQTYN QLFEENPINASGVDAKAILSARLSKSRRLENLIAQLPGEKKNGLFGNLIALSLGLTPNFKSNFD LAEDAKLQLSKDTYDDDLDNLLAQIGDQYADLFLAAKNLSDAILLSDILRVNTEITKAPLSASM IKRYDEHHQDLTLLKALVRQQLPEKYKEIFFDQSKNGYAGYIDGGASQEEFYKFIKPILEKMDG ATTORNEY DOCKET NO. 180802-47402 / PCT
[0201] ELECTRONIC DEPOSIT DATE: December 10, 2025 TEELLVKLNREDLLRKQRTFDNGSIPHQIHLGELHAILRRQEDFYPFLKDNREKIEKILTFRIP YYVGPLARGNSRFAWMTRKSEETITPWNFEEVVDKGASAQSFIERMTNFDKNLPNEKVLPKHSL LYEYFTVYNELTKVKYVTEGMRKPAFLSGEQKKAIVDLLFKTNRKVTVKQLKEDYFKKIECFDS VEISGVEDRFNASLGTYHDLLKI IKDKDFLDNEENEDILEDIVLTLTLFEDREMIEERLKTYAH LFDDKVMKQLKRRRYTGWGRLSRKLINGIRDKQSGKTILDFLKSDGFANRNFMQLIHDDSLTFK EDIQKAQVSGQGDSLHEHIANLAGSPAIKKGILQTVKVVDELVKVMGRHKPENIVIEMARENQT TQKGQKNSRERMKRIEEGIKELGSQILKEHPVENTQLQNEKLYLYYLQNGRDMYVDQELDINRL SDYDVDHIVPQSFLKDDSIDNKVLTRSDKNRGKSDNVPSEEVVKKMKNYWRQLLNAKLITQRKF DNLTKAERGGLSELDKAGFIKRQLVETRQITKHVAQILDSRMNTKYDENDKLIREVKVITLKSK LVSDFRKDFQFYKVREINNYHHAHDAYLNAVVGTALIKKYPKLESEFVYGDYKVYDVRKMIAKS EQEIGKATAKYFFYSNIMNFFKTEITLANGEIRKRPLIETNGETGEIVWDKGRDFATVRKVLSM PQVNIVKKTEVQTGGFSKESIRPKRNSDKLIARKKDWDPKKYGGFLWPTVAYSVLVVAKVEKGK SKKLKSVKELLGITIMERSSFEKNPIDFLEAKGYKEVKKDLI IKLPKYSLFELENGRKRMLASA KQLQKGNELALPSKYVNFLYLASHYEKLKGSPEDNEQKQLFVEQHKHYLDEI IEQISEFSKRVI LADANLDKVLSAYNKHRDKPIREQAENI IHLFTLTRLGAPRAFKYFDTTIDPKQYRSTKEVLDA TLIHQSITGLYETRIDLSQLGGD (SEQ ID NO: 787);
[0202] spCas9 NRRH DKKYSIGLAIGTNSVGWAVITDEYKVPSKKFKVLGNTDRHSIKKNLIGALLFDSGETAEATRLK RTARRRYTRRKNRICYLQEIFSNEMAKVDDSFFHRLEESFLVEEDKKHERHPIFGNIVDEVAYH EKYPTIYHLRKKLVDSTDKADLRLIYLALAHMIKFRGHFLIEGDLNPDNSDVDKLFIQLVQTYN QLFEENPINASGVDAKAILSARLSKSRRLENLIAQLPGEKKNGLFGNLIALSLGLTPNFKSNFD LAEDAKLQLSKDTYDDDLDNLLAQIGDQYADLFLAAKNLSDAILLSDILRVNTEITKAPLSASM VKRYDEHHQDLTLLKALVRQQLPEKYKEIFFDQSKNGYAGYIDGGASQEEFYKFIKPILEKMDG TEELLVKLNREDLLRKQRTFDNGI IPHQIHLGELHAILRRQGDFYPFLKDNREKIEKILTFRIP YYVGPLARGNSRFAWMTRKSEETITPWNFEEVVDKGASAQSFIERMTNFDKNLPNEKVLPKHSL LYEYFTVYNELTKVKYVTEGMRKPAFLSGEQKKAIVDLLFKTNRKVTVKQLKEDYFKKIECFDS VEISGVEDRFNASLGTYHDLLKI IKDKDFLDNEENEDILEDIVLTLTLFEDREMIEERLKTYAH LFDDKVMKQLKRLRYTGWGRLSRKLINGIRDKQSGKTILDFLKSDGFANRNFMQLIHDDSLTFK EDIQKAQVSGQGDSLHEHIANLAGSPAIKKGILQTVKVVDELVKVMGGHKPENIVIEMARENQT TQKGQKNSRERMKRIEEGIKELGSQILKEHPVENTQLQNEKLYLYYLQNGRDMYVDQELDINRL SDYDVDHIVPQSFLKDDSIDNKVLTRSDKNRGKSDNVPSEEVVKKMKNYWRQLLNAKLITQRKF DNLTKAERGGLSELDKAGFIKRQLVETRQITKHVAQILDSRMNTKYDENDKLIREVKVITLKSK LVSDFRKDFQFYKVREINNYHHAHDAYLNAVVGTALIKKYPKLESEFVYGDYKVYDVRKMIAKS EQEIGKATAKYFFYSNIMNFFKTEITLANGEIRKRPLIETNGETGEIVWDKGRDFATVRKVLSM PQVNIVKKTEVQTGGFSKESILPKGNSDKLIARKKDWDPKKYGGFNSPTAAYSVLVVAKVEKGK ATTORNEY DOCKET NO. 180802-47402 / PCT
[0203] ELECTRONIC DEPOSIT DATE: December 10, 2025 SKKLKSVKELLGITIMERSSFEKNPIGFLEAKGYKEVKKDLI IKLPKYSLFELENGRKRMLASA GVLHKGNELALPSKYVNFLYLASHYEKLKGSPEDNEQKQLFVEQHKHYLDEI IEQISEFSKRVI LADANLDKVLSAYNKHRDKPIREQAENI IHLFTLTNLGVPAAFKYFDTTIDKKRYTSTKEVLDA TLIHQSITGLYETRIDLSQLGGD (SEQ ID NO: 788); and
[0204] spCas9 NRTH DKKYSIGLAIGTNSVGWAVITDEYKVPSKKFKVLGNTDRHSIKKNLIGALLFDSGETAEATRLK RTARRRYTRRKNRICYLQEIFSNEMAKVDDSFFHRLEESFLVEEDKKHERHPIFGNIVDEVAYH EKYPTIYHLRKKLVDSTDKADLRLIYLALAHMIKFRGHFLIEGDLNPDNSDVDKLFIQLVQTYN QLFEENPINASGVDAKAILSARLSKSRRLENLIAQLPGEKKNGLFGNLIALSLGLTPNFKSNFD LAEDAKLQLSKDTYDDDLDNLLAQIGDQYADLFLAAKNLSDAILLSDILRVNTEITKAPLSASM VKRYDEHHQDLTLLKALVRQQLPEKYKEIFFDQSKNGYAGYIDGGASQEEFYKFIKPILEKMDG TEELLVKLNREDLLRKQRTFDNGI IPHQIHLGELHAILRRQGDFYPFLKDNREKIEKILTFRIP YYVGPLARGNSRFAWMTRKSEETITPWNFEEVVDKGASAQSFIERMTNFDKNLPNEKVLPKHSL LYEYFTVYNELTKVKYVTEGMRKPAFLSGEQKKAIVDLLFKTNRKVTVKQLKEDYFKKIECFDS VEISGVEDRFNASLGTYHDLLKI IKDKDFLDNEENEDILEDIVLTLTLFEDREMIEERLKTYAH LFDDKVMKQLKRLRYTGWGRLSRKLINGIRDKQSGKTILDFLKSDGFANRNFMQLIHDDSLTFK EDIQKAQVSGQGDSLHEHIANLAGSPAIKKGILQTVKVVDELVKVMGGHKPENIVIEMARENQT TQKGQKNSRERMKRIEEGIKELGSQILKEHPVENTQLQNEKLYLYYLQNGRDMYVDQELDINRL SDYDVDHIVPQSFLKDDSIDNKVLTRSDKNRGKSDNVPSEEVVKKMKNYWRQLLNAKLITQRKF DNLTKAERGGLSELDKAGFIKRQLVETRQITKHVAQILDSRMNTKYDENDKLIREVKVITLKSK LVSDFRKDFQFYKVREINNYHHAHDAYLNAVVGTALIKKYPKLESEFVYGDYKVYDVRKMIAKS EQEIGKATAKYFFYSNIMNFFKTEITLANGEIRKRPLIETNGETGEIVWDKGRDFATVRKVLSM PQVNIVKKTEVQTGGFSKESILPKGNSDKLIARKKDWDPKKYGGFNSPTVAYSVLVVAKVEKGK SKKLKSVKELLGITIMERSSFEKNPIGFLEAKGYKEVKKDLI IKLPKYSLFELENGRKRMLASA SVLHKGNELALPSKYVNFLYLASHYEKLKGSSEDNKQKQLFVEQHKHYLDEI IEQISEFSKRVI LADANLDKVLSAYNKHRDKPIREQAENI IHLFTLTNLGASAAFKYFDTTIGRKLYTSTKEVLDA TLIHQSITGLYETRIDLSQLGGD (SEQ ID NO: 789).
[0205] In any aspect or embodiment of the disclosure, the base editor further contains two uracil glycosylase inhibitor (UGI) domains. In any aspect or embodiment of the disclosure, the base editor further contains a nuclear localization signal (NLS).
[0206] In any aspect or embodiment of the disclosure, the nucleobase in codon 188 corresponds to the lowercase “c” in the following nucleotide sequence:
[0207] CTGACACCCTTACCcGGCAGTGGGGGTGGGGGTTAG (SEQ ID NO: 692).
[0208] In any aspect or embodiment of the disclosure, the deaminase domain contains an amino acid sequence with at least 90% sequence identity to a sequence selected from one or more of: ATTORNEY DOCKET NO. 180802-47402 / PCT
[0209] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0210] T1.52 SEVEYSHEYWMRHALTLAKRARDERHVPVGAVLVLNNRVIGEGWNRAKGLHDPTAHAEIMALRQ GGLVMQNYRLWGATL YTTFE PCVMCAGAM IHSR I GRVVFGVCNAKTHACMS LMDVLGHPGMPHR VEITEGILADECEALLCRFFRMPRRVFNAQKKAQSSTD (SEQ ID NO: 773);
[0211] 6b SSEVEFSHEYWMRHALTLAKRARDEGEAPVGAVLVLNNRVIGEGWVRRIGLHDPTAHAEIMALR QGGLVMQNPRL IDATLYVTFE PCVMCAGAM INSR I GRVVFGVRNS KRGAAGS LMNVLNYPGMNH RVEITEGILADECAALLCDFYRMPRQVFNAQKKAQSSIN (SEQ ID NO: 790); and
[0212] RrA3f
[0213] KPQIRDHRPNPMEAMYPHIFYFHFENLEKAYGRNETWLCFTVEI IKQYLPVPWKKGVFRNQVDP ETHCHAEKCFLSWFCNNTLSPKKNYQVTWYTSWSPCPECAGEVAEFLAEHSNVKLTIYTARLYY FWDTDYQEGLRSLSEEGASVEIMDYEDFQYCWENFVYDDGEPFKRWKGLKYNFQSLTRRLREIL
[0214] Q (Positions 2 to 193 of SEQ ID NO: 67).
[0215] In any aspect or embodiment of the disclosure, the deaminase domain contains an amino acid sequence selected from one or more of SEQ ID NOs: 773, 790, and 67.
[0216] In any aspect or embodiment of the disclosure, the base editor contains, in order from N-terminus to C-terminus, the deaminase domain, a peptide linker, and the napDNAbp domain, where the deaminase domain is separated from the napDNAbp domain in the base editor by less than about 15 amino acids. In any aspect or embodiment of the disclosure, the peptide linker contains or contains only an amino acid sequence selected from one or more of:
[0217] EGSSSKEEEEPG (SEQ ID NO: 647), NSISSSNGQK (SEQ ID NO: 648), GEEGEGSGGGEK (SEQ ID NO: 649), EGEGGKESGSSE (SEQ ID NO: 650), GGGGSSKSPGSE (SEQ ID NO: 651), PIGSDQDD (SEQ ID NO: 652), TEKGQVPHGS (SEQ ID NO: 653), ESGEGGGGSEKK (SEQ ID NO: 654), EEGKPKEGEGSG (SEQ ID NO: 655), ASREPKDSS (SEQ ID NO: 656), KQGSEHDE (SEQ ID NO: 657), SESKSEKGSSEK (SEQ ID NO: 658), QYDSGERSDQ (SEQ ID NO: 659), PGANEEIPGQ (SEQ ID NO: 660), NSPTDEK (SEQ ID NO: 661), EGANEEIPGQ (SEQ ID NO: 662), EGEKEKKKSGES (SEQ ID NO: 663), PGRHEEVPGQ (SEQ ID NO: 664), SKHQTEQDDS (SEQ ID NO: 665), ESEDDSSGRK (SEQ ID NO: 666), KESEKKESESKS (SEQ ID NO: 667), KGEGKSSIKD (SEQ ID NO: 668), DRSQKQDQQD (SEQ ID NO: 669), GPSSTSSS (SEQ ID NO: 670), GSSGEKEEGEPS (SEQ ID NO: 671), GEPKSKKSGSGS (SEQ ID NO: 672), SSGEGGKSESGP (SEQ ID NO: 673), SPQPTSSD (SEQ ID NO: 674), EGGSEEEEESGS (SEQ ID NO: 675), KGPKPKKEESEK (SEQ ID NO: 676), SKSQQFVTYE (SEQ ID NO: 677), TGNSKYQTGK (SEQ ID NO: 678), PQPIPHTNPT (SEQ ID NO: 679), ANAHSDISTG (SEQ ID NO: 680), KSQQTEDQSK (SEQ ID NO: 681), QSQDQKQKEH (SEQ ID NO: 682), NQQRPSSD ATTORNEY DOCKET NO. 180802-47402 / PCT
[0218] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0219] (SEQ ID NO: 683), TTKDTSPKPQ (SEQ ID NO: 684), EGKDNQQTGE (SEQ ID NO: 685), and EPQPDSSE (SEQ ID NO: 686).
[0220] In any aspect or embodiment of the disclosure, the base editor contains an amino acid sequence having at least 85% or 90% identity to a sequence selected from one or more of: MDKKYSIGLAIGTNSVGWAVITDEYKVPSKKFKVLGNTDRHSIKKNLIGALLFDSGETAEATRL KRTARRRYTRRKNRICYLQEIFSNEMAKVDDSFFHRLEESFLVEEDKKHERHPIFGNIVDEVAY HEKYPTIYHLRKKLVDSTDKADLRLIYLALAHMIKFRGHFLIEGDLNPDNSDVDKLFIQLVQTY NQLFEENPINASGVDAKAILSARLSKSRRLENLIAQLPGEKKNGLFGNLIALSLGLTPNFKSNF DLAEDAKLQLSKDTYDDDLDNLLAQIGDQYADLFLAAKNLSDAILLSDILRVNTEITKAPLSAS MIKRYDEHHQDLTLLKALVRQQLPEKYKEIFFDQSKNGYAGYIDGGASQEEFYKFIKPILEKMD GTEELLVKLNREDLLRKQRTFDNGSIPHQIHLGELHAILRRQEDFYPFLKDNREKIEKILTFRI PYYVGPLARGNSRFAWMTRKSEETITPWNFEEVVDKGASAQSFIERMTNFDKNLPNEKVLPKHS LLYEYFTVYNELTKVKYVTEGMRKPAFLSGEQKKAIVDLLFKTNRKVTVKQLKEDYFKKIECFD SVEISGVEDRFNASLGTYHDLLKI IKDKDFLDNEENEDILEDIVLTLTLFEDREMIEERLKTYA HLFDDKVMKQLKRRRYTGWGRLSRKLINGIRDKQSGKTILDFLKSDGFANRNFMQLIHDDSLTF KEDIQKAQVSGQGDSLHEHIANLAGSPAIKKGILQTVKVVDELVKVMGRHKPENIVIEMARENQ TTQKGQKNSRERMKRIEEGIKELGSQILKEHPVENTQLQNEKLYLYYLQNGRDMYVDQELDINR LSDYDVDHIVPQSFLKDDSIDNKVLTRSDKNRGKSDNVPSEEVVKKMKNYWRQLLNAKLITQRK FDNLTKAERGGSSGSETPGTSESATPESSGSEVEYSHEYWMRHALTLAKRARDERHVPVGAVLV LNNRVIGEGWNRAKGLHDPTAHAEIMALRQGGLVMQNYRLWGATLYTTFEPCVMCAGAMIHSRI GRVVFGVCNAKTHACMSLMDVLGHPGMPHRVEITEGILADECEALLCRFFRMPRRVFNAQKKAQ SSTDGSSGSGSSGSGLSELDKAGFIKRQLVETRQITKHVAQILDSRMNTKYDENDKLIREVKVI TLKSKLVSDFRKDFQFYKVREINNYHHAHDAYLNAVVGTALIKKYPKLESEFVYGDYKVYDVRK MIAKSEQEIGKATAKYFFYSNIMNFFKTEITLANGEIRKRPLIETNGETGEIVWDKGRDFATVR KVLSMPQVNIVKKTEVQTGGFSKESILPKRNSDKLIARKKDWDPKKYGGFDSPTVAYSVLVVAK VEKGKSKKLKSVKELLGITIMERSSFEKNPIDFLEAKGYKEVKKDLI IKLPKYSLFELENGRKR MLASAGELQKGNELALPSKYVNFLYLASHYEKLKGSPEDNEQKQLFVEQHKHYLDEI IEQISEF SKRVILADANLDKVLSAYNKHRDKPIREQAENI IHLFTLTNLGAPAAFKYFDTTIDRKRYTSTK EVLDATLIHQSITGLYETRIDLSQLGGDSGGSGGSGGSTNLSDIIEKETGKQLVIQESILMLPE EVEEVIGNKPESDILVHTAYDESTDENVMLLTSDAPEYKPWALVIQDSNGENKIKMLSGGSGGS GGSTNLSDIIEKETGKQLVIQESILMLPEEVEEVIGNKPESDILVHTAYDESTDENVMLLTSDA PEYKPWALVIQDSNGENKIKMLEGADKRTADGSEFESPKKKRKV (SEQ ID NO: 774);
[0221] MKRTADGSEFESPKKKRKVSSEVEFSHEYWMRHALTLAKRARDEGEAPVGAVLVLNNRVIGEGW VRRIGLHDPTAHAEIMALRQGGLVMQNPRLIDATLYVTFEPCVMCAGAMINSRIGRVVFGVRNS KRGAAGSLMNVLNYPGMNHRVEITEGILADECAALLCDFYRMPRQVFNAQKKAQSSINSGGSSG ATTORNEY DOCKET NO. 180802-47402 / PCT
[0222] ELECTRONIC DEPOSIT DATE: December 10, 2025 GSSGSETPGTSESATPESSGGSSGGSDKKYSIGLAIGTNSVGWAVITDEYKVPSKKFKVLGNTD RHSIKKNLIGALLFDSGETAEATRLKRTARRRYTRRKNRICYLQEIFSNEMAKVDDSFFHRLEE SFLVEEDKKHERHPIFGNIVDEVAYHEKYPTIYHLRKKLVDSTDKADLRLIYLALAHMIKFRGH FLIEGDLNPDNSDVDKLFIQLVQTYNQLFEENPINASGVDAKAILSARLSKSRRLENLIAQLPG EKKNGLFGNLIALSLGLTPNFKSNFDLAEDAKLQLSKDTYDDDLDNLLAQIGDQYADLFLAAKN LSDAILLSDILRVNTEITKAPLSASMIKRYDEHHQDLTLLKALVRQQLPEKYKEIFFDQSKNGY AGYIDGGASQEEFYKFIKPILEKMDGTEELLVKLNREDLLRKQRTFDNGSIPHQIHLGELHAIL RRQEDFYPFLKDNREKIEKILTFRIPYYVGPLARGNSRFAWMTRKSEETITPWNFEEVVDKGAS AQSFIERMTNFDKNLPNEKVLPKHSLLYEYFTVYNELTKVKYVTEGMRKPAFLSGEQKKAIVDL LFKTNRKVTVKQLKEDYFKKIECFDSVEISGVEDRFNASLGTYHDLLKI IKDKDFLDNEENEDI LEDIVLTLTLFEDREMIEERLKTYAHLFDDKVMKQLKRRRYTGWGRLSRKLINGIRDKQSGKTI LDFLKSDGFANRNFMQLIHDDSLTFKEDIQKAQVSGQGDSLHEHIANLAGSPAIKKGILQTVKV VDELVKVMGRHKPENIVIEMARENQTTQKGQKNSRERMKRIEEGIKELGSQILKEHPVENTQLQ NEKLYLYYLQNGRDMYVDQELDINRLSDYDVDHIVPQSFLKDDSIDNKVLTRSDKNRGKSDNVP S EE VVKKMKNYWRQLLNAKL I TQRKFDNLTKAERGGLS ELDKAGF I KRQLVETRQ I TKHVAQ I L DSRMNTKYDENDKLIREVKVITLKSKLVSDFRKDFQFYKVREINNYHHAHDAYLNAVVGTALIK KYPKLESEFVYGDYKVYDVRKMIAKSEQEIGKATAKYFFYSNIMNFFKTEITLANGEIRKRPLI ETNGETGEIVWDKGRDFATVRKVLSMPQVNIVKKTEVQTGGFSKESILPKRNSDKLIARKKDWD PKKYGGFDSPTVAYSVLVVAKVEKGKSKKLKSVKELLGITIMERSSFEKNPIDFLEAKGYKEVK KDLI IKLPKYSLFELENGRKRMLASAGELQKGNELALPSKYVNFLYLASHYEKLKGSPEDNEQK QLFVEQHKHYLDEI IEQISEFSKRVILADANLDKVLSAYNKHRDKPIREQAENI IHLFTLTNLG APAAFKYFDTTIDRKRYTSTKEVLDATLIHQSITGLYETRIDLSQLGGDSGGSGGSGGSTNLSD I IEKETGKQLVIQESILMLPEEVEEVIGNKPESDILVHTAYDESTDENVMLLTSDAPEYKPWAL VIQDSNGENKIKMLSGGSGGSGGSTNLSDIIEKETGKQLVIQESILMLPEEVEEVIGNKPESDI LVHTAYDESTDENVMLLTSDAPEYKPWALVIQDSNGENKIKMLSGGSKRTADGSEFESPKKKRK
[0223] V (SEQ ID NO: 775);
[0224] MKRTADGSEFESPKKKRKVMKPQIRDHRPNPMEAMYPHIFYFHFENLEKAYGRNETWLCFTVEI IKQYLPVPWKKGVFRNQVDPETHCHAEKCFLSWFCNNTLSPKKNYQVTWYTSWSPCPECAGEVA EFLAEHSNVKLTIYTARLYYFWDTDYQEGLRSLSEEGASVEIMDYEDFQYCWENFVYDDGEPFK RWKGLKYNFQSLTRRLREILQSGGSSGGSSGSETPGTSESATPESSGGSSGGSDKKYSIGLAIG TNSVGWAVITDEYKVPSKKFKVLGNTDRHSIKKNLIGALLFDSGETAEATRLKRTARRRYTRRK NRICYLQEIFSNEMAKVDDSFFHRLEESFLVEEDKKHERHPIFGNIVDEVAYHEKYPTIYHLRK KLVDSTDKADLRLIYLALAHMIKFRGHFLIEGDLNPDNSDVDKLFIQLVQTYNQLFEENPINAS GVDAKAILSARLSKSRRLENLIAQLPGEKKNGLFGNLIALSLGLTPNFKSNFDLAEDAKLQLSK DTYDDDLDNLLAQ I GDQ YADLFLAAKNLSDAI LLSD I LRVNTE I TKAPLS ASM I KRYDEHHQDL ATTORNEY DOCKET NO. 180802-47402 / PCT
[0225] ELECTRONIC DEPOSIT DATE: December 10, 2025 TLLKALVRQQLPEKYKEIFFDQSKNGYAGYIDGGASQEEFYKFIKPILEKMDGTEELLVKLNRE DLLRKQRTFDNGSIPHQIHLGELHAILRRQEDFYPFLKDNREKIEKILTFRIPYYVGPLARGNS RFAWMTRKSEETITPWNFEEVVDKGASAQSFIERMTNFDKNLPNEKVLPKHSLLYEYFTVYNEL TKVKYVTEGMRKPAFLSGEQKKAIVDLLFKTNRKVTVKQLKEDYFKKIECFDSVEISGVEDRFN ASLGTYHDLLKI IKDKDFLDNEENEDILEDIVLTLTLFEDREMIEERLKTYAHLFDDKVMKQLK RRRYTGWGRLSRKLINGIRDKQSGKTILDFLKSDGFANRNFMQLIHDDSLTFKEDIQKAQVSGQ GDSLHEHIANLAGSPAIKKGILQTVKVVDELVKVMGRHKPENIVIEMARENQTTQKGQKNSRER MKRIEEGIKELGSQILKEHPVENTQLQNEKLYLYYLQNGRDMYVDQELDINRLSDYDVDHIVPQ SFLKDDSIDNKVLTRSDKNRGKSDNVPSEEVVKKMKNYWRQLLNAKLITQRKFDNLTKAERGGL SELDKAGFIKRQLVETRQITKHVAQILDSRMNTKYDENDKLIREVKVITLKSKLVSDFRKDFQF YKVREINNYHHAHDAYLNAVVGTALIKKYPKLESEFVYGDYKVYDVRKMIAKSEQEIGKATAKY FFYSNIMNFFKTEITLANGEIRKRPLIETNGETGEIVWDKGRDFATVRKVLSMPQVNIVKKTEV QTGGFSKESIRPKRNSDKLIARKKDWDPKKYGGFVSPTVAYSVLVVAKVEKGKSKKLKSVKELL GITIMERSSFEKNPIDFLEAKGYKEVKKDLI IKLPKYSLFELENGRKRMLASARFLQKGNELAL PS KYVNFL YLASHYEKLKGS PEDNEQKQLFVEQHKHYLDE 11 EQ I S E FS KRVI LADANLDKVLS AYNKHRDKPIREQAENIIHLFTLTNLGAPRAFKYFDTTIDRKVYRSTKEVLDATLIHQSITGLY ETRIDLSQLGGDSGGSGGSGGSTNLSDIIEKETGKQLVIQESILMLPEEVEEVIGNKPESDILV HTAYDESTDENVMLLTSDAPEYKPWALVIQDSNGENKIKMLSGGSGGSGGSTNLSDI IEKETGK QLVIQESILMLPEEVEEVIGNKPESDILVHTAYDESTDENVMLLTSDAPEYKPWALVIQDSNGE NKIKMLSGGSKRTADGSEFESPKKKRKV (SEQ ID NO: 776);
[0226] MKPQIRDHRPNPMEAMYPHIFYFHFENLEKAYGRNETWLCFTVEI IKQYLPVPWKKGVFRNQVD PETHCHAEKCFLSWFCNNTLSPKKNYQVTWYTSWSPCPECAGEVAEFLAEHSNVKLTI YEARLY YFWDTDYQEGLRSLSEEGASVEIMDYEDFQYCWENFVYDDGEPFKRWKGLKYNFQSLTRRLREI LQSGGSSGGSSGSETPGTSESATPESSGGSSGGSDKKYSIGLAIGTNSVGWAVITDEYKVPSKK FKVLGNTDRHSIKKNLIGALLFDSGETAEATRLKRTARRRYTRRKNRICYLQEIFSNEMAKVDD SFFHRLEESFLVEEDKKHERHPIFGNIVDEVAYHEKYPTIYHLRKKLVDSTDKADLRLIYLALA HMIKFRGHFLIEGDLNPDNSDVDKLFIQLVQTYNQLFEENPINASGVDAKAILSARLSKSRRLE NLIAQLPGEKKNGLFGNLIALSLGLTPNFKSNFDLAEDAKLQLSKDTYDDDLDNLLAQIGDQYA DLFLAAKNLSDAILLSDILRVNTEITKAPLSASMIKRYDEHHQDLTLLKALVRQQLPEKYKEIF FDQSKNGYAGYIDGGASQEEFYKFIKPILEKMDGTEELLVKLNREDLLRKQRTFDNGSIPHQIH LGELHAILRRQEDFYPFLKDNREKIEKILTFRIPYYVGPLARGNSRFAWMTRKSEETITPWNFE EVVDKGASAQSFIERMTNFDKNLPNEKVLPKHSLLYEYFTVYNELTKVKYVTEGMRKPAFLSGE QKKAIVDLLFKTNRKVTVKQLKEDYFKKIECFDSVEISGVEDRFNASLGTYHDLLKI IKDKDFL DNEENEDILEDIVLTLTLFEDREMIEERLKTYAHLFDDKVMKQLKRRRYTGWGRLSRKLINGIR DKQSGKTILDFLKSDGFANRNFMQLIHDDSLTFKEDIQKAQVSGQGDSLHEHIANLAGSPAIKK ATTORNEY DOCKET NO. 180802-47402 / PCT
[0227] ELECTRONIC DEPOSIT DATE: December 10, 2025 GILQTVKVVDELVKVMGRHKPENIVIEMARENQTTQKGQKNSRERMKRIEEGIKELGSQILKEH PVENTQLQNEKLYLYYLQNGRDMYVDQELDINRLSDYDVDHIVPQSFLKDDSIDNKVLTRSDKN RGKSDNVPSEEVVKKMKNYWRQLLNAKLITQRKFDNLTKAERGGLSELDKAGFIKRQLVETRQI TKHVAQILDSRMNTKYDENDKLIREVKVITLKSKLVSDFRKDFQFYKVREINNYHHAHDAYLNA VVGTALIKKYPKLESEFVYGDYKVYDVRKMIAKSEQEIGKATAKYFFYSNIMNFFKTEITLANG EIRKRPLIETNGETGEIVWDKGRDFATVRKVLSMPQVNIVKKTEVQTGGFSKESILPKRNSDKL IARKKDWDPKKYGGFDSPTVAYSVLVVAKVEKGKSKKLKSVKELLGITIMERSSFEKNPIDFLE AKGYKEVKKDLI IKLPKYSLFELENGRKRMLASAGELQKGNELALPSKYVNFLYLASHYEKLKG S PEDNEQKQLFVEQHKHYLDE 11 EQ I S E FS KRVI LADANLDKVLS AYNKHRDKP I REQAENI IH LFTLTNLGAPAAFKYFDTTIDRKRYTSTKEVLDATLIHQSITGLYETRIDLSQLGGDSGGSGGS GGSTNLSDIIEKETGKQLVIQESILMLPEEVEEVIGNKPESDILVHTAYDESTDENVMLLTSDA PEYKPWALVIQDSNGENKIKMLSGGSGGSGGSTNLSDIIEKETGKQLVIQESILMLPEEVEEVI GNKPESDILVHTAYDESTDENVMLLTSDAPEYKPWALVIQDSNGENKIKMLSGGSKRTADGSEF ESPKKKRKV (SEQ ID NO: 777);
[0228] MKPQIRDHRPNPMEAMYPHIFYFHFENLEKAYGRNETWLCFTVEI IKQYLPVPWKKGVFRNQVD PETHCHAEKCFLSWFCNNTLSPKKNYQVTWYTSWSPCPECAGEVAEFLAEHSNVKLTI YEARLY YFWDTDYQEGLRSLSEEGASVEIMDYEDFQYCWENFVYDDGEPFKRWKGLKYNFQSLTRRLREI LQSGGSSGGSSGSETPGTSESATPESSGGSSGGSGKRNYILGLAIGITSVGYGI IDYETRDVID AGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVK GLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERL KKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFG WKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQI TEN VFKQKKKPTLKQ TAKE I LVNEED I KGYRVTSTGKPE FTNLKVYHD I KD I TARKE 11 ENAELLDQ IAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELWHTNDNQ IAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIEL AREKNSKDAQKMINEMQKRNRQTNERIEEI IRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIP LEDLLNNPFNYEVDHI IPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKK HILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYFRVNNLD VKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFE EKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRELINDTLYSTRKDDKGN TLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETG NYLTKYS KKDNGPVI KKI KYYGNKLNAHLD I TDD YPNSRNKVVKLS LKP YRFDVYLDNGVYKFV TVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQAEFIASFYNNDLIKINGELYRVIGVNNDLLN RIEVNMIDITYREYLENMNDKRPPRIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKGG SPKKKRKVSSDYKDHDGDYKDHDIDYKDDDDKSGGSGGSGGSTNLSDIIEKETGKQLVIQESIL ATTORNEY DOCKET NO. 180802-47402 / PCT
[0229] ELECTRONIC DEPOSIT DATE: December 10, 2025 MLPEEVEEVIGNKPESDILVHEAYDESEDENVMLLESDAPEYKPWALVIQDSNGENKIKMLSGG SGGSGGSENLSDIIEKEEGKQLVIQESILMLPEEVEEVIGNKPESDILVHEAYDESEDENVMLL ESDAPEYKPWALVIQDSNGENKIKMLSGGSKREADGSEFESPKKKRKV (SEQ ID NO: 778); MKPQIRDHRPNPMEAMYPHIFYFHFENLEKAYGRNEEWLCFEVEI IKQYLPVPWKKGVFRNQVD PEEHCHAEKCFLSWFCNNELSPKKNYQVEWYESWSPCPECAGEVAEFLAEHSNVKLEI YEARLY YFWDEDYQEGLRSLSEEGASVEIMDYEDFQYCWENFVYDDGEPFKRWKGLKYNFQSLERRLREI LQSGGSSGGSSGSEEPGESESAEPESSGGSSGGSEKKYSIGLDIGENSVGWAVIEDDYKVPSKK FKVLGNENRKSIKKNLMGALLFDSGEEAEAERLKREARRRYERRKNRIRYLQEIFANEMAKLDD SFFQRLEESFLVEEDKKNERHPIFGNLADEVAYHRNYPEIYHLRKKLADSPEKADLRLIYLALA
[0230] H 11 KFRGHFL I EGKLNAENSDVAKLF YQL I QEYNQLFEES PLDE I E VDAKG I LS ARLS KS KRLE KLIAVFPNEKKNGLFGNI IALALGLEPNFKSNFDLEEDAKLQLSKDEYDDDLDELLGQIGDQYA DLFSAAKNLSDAILLSDILRSNSEVEKAPLSASMVKRYDEHHQDLALLKELVRQQFPEKYAEIF KDDEKNGYAGYVGADKKLRKRSGKLAEEEEFYKFIKPILEKMDGAEELLAKLNRDDLLRKQREF DNGSIPHQIHLKELHAILRRQEEFYPFLKENREKIEKILEFRIPYYVGPLARGNSRFAWLERKS EEAIEPWNFEEVVDKGASAQSFIERMENFDEQLPNKKVLPKHSLLYEYFEVYNELEKVKYVEER MRKPEFLSGEQKKAIVDLLFKENRKVEVKQLKEDYFKKIECFDSVEI IGVEDRFNASLGEYHDL LKI IKDKDFLDNEENEDILEDIVLELELFEDREMIEERLKEYAHLFDDKVMKQLKRRHYEGWGR LSRKMINGIRDKQSGKEILDFLKSDGFSNRNFMQLIHDDSLEFKEEIEKAQVSGQGDSLHEQIA DLAGSPAIKKGILQEVKIVDELVKVMGHKPENIVIEMARENQEEEKGLQQSRERKKRIEEGIKE LESQILKENPVENEQLQNEKLYLYYLQNGRDMYVDQELDINRLSDYDVDHIVPQSFIKDDSIDN KVLERSVENRGKSDNVPSEEVVKKMKNYWRQLLNAKLIEQRKFDNLEKAERGGLSEADKAGFIK RQLVEERQIEKHVARILDSRMNEKRDKNDKPIREVKVIELKSKLVSDFRKDFQLYKVRDINNYH HAHDAYLNAVVGEALIKKYPKLESEFVYGDYKVYDVRKMIAKSEQEIGKAEAKRFFYSNIMNFF KEEVKLANGEIRKRPLIEENGEEGEVVWNKEKDFAEVRKVLAMPQVNIVKKEEVQEGGFSKESI RPKRNSDKLIARKKDWDPKKYGGFLWPEVAYSVLVVAKVEKGKSKKLKSVKELLGIEIMERSSF EKNPIDFLEAKGYKEVKKDLI IKLPKYSLFELENGRKRMLASAKQLQKGNELALPSKYVNFLYL ASHYEKLKGS PEDNEQKQLFVEQHKHYLDE 11 EQ I S E FS KRVI LADANLDKVLS AYNKHRDKP I REQAENIIHLFTLTRLGAPRAFKYFDTTIDPKQYRSTKEVLDATLIHQSITGLYETRIDLSQLG GDSGGSGGSGGSTNLSDIIEKETGKQLVIQESILMLPEEVEEVIGNKPESDILVHTAYDESTDE NVMLLTSDAPEYKPWALVIQDSNGENKIKMLSGGSGGSGGSTNLSDIIEKETGKQLVIQESILM LPEEVEEVIGNKPESDILVHTAYDESTDENVMLLTSDAPEYKPWALVIQDSNGENKIKMLSGGS KRTADGSEFESPKKKRKV (SEQ ID NO: 779);
[0231] MKPQIRDHRPNPMEAMYPHIFYFHFENLEKAYGRNETWLCFTVEI IKQYLPVPWKKGVFRNQVD PETHCHAEKCFLSWFCNNTLSPKKNYQVTWYTSWSPCPECAGEVAEFLAEHSNVKLTI YEARLY YFWDEDYQEGLRSLSEEGASVEIMDYEDFQYCWENFVYDDGEPFKRWKGLKYNFQSLERRLREI ATTORNEY DOCKET NO. 180802-47402 / PCT
[0232] ELECTRONIC DEPOSIT DATE: December 10, 2025 LQSGGSSGGSSGSETPGTSESATPESSGGSSGGSDKKYSIGLAIGTNSVGWAVITDEYKVPSKK FKVLGNTDRHSIKKNLIGALLFDSGETAERTRLKRTARRRYTRRKNRICYLQEIFSNEMAKVDD SFFHRLEESFLVEEDKKHERHPIFGNIVDEVAYHEKYPTIYHLRKKLVDSTDKADLRLIYLALA HMIKFRGHFLIEGDLNPDNSDVDKLFIQLVQTYNQLFEENPINASGVDAKAILSARLSKSRRLE NLIAQLPGEKKNGLFGNLIALSLGLTPNFKSNFDLAEDAKLQLSKDTYDDDLDNLLAQIGDQYA DLFLAAKNLSDAILLSDILRVNTEITKAPLSASMIKRYDEHHQDLTLLKALVRQQLPEKYKEIF FDQSKNGYAGYIDGGASQEEFYKFIKPILEKMDGTEELLVKLNREDLLRKQRTFDNGSIPHQIH LGELHAILRRQEDFYPFLKDNREKIEKILTFRIPYYVGPLARGNSRFAWMTRKSEETITPWNFE EVVDKGASAQSFIERMTNFDKNLPNEKVLPKHSLLYEYFTVYNELTKVKYVTEGMRKPAFLSGE QKKAIVDLLFKTNRKVTVKQLKEDYFKKIECFDSVEISGVEDRFNASLGTYHDLLKI IKDKDFL DNEENEDILEDIVLTLTLFEDREMIEERLKTYAHLFDDKVMKQLKRRRYTGWGRLSRKLINGIR DKQSGKTILDFLKSDGFANRNFMQLIHDDSLTFKEDIQKAQVSGQGDSLHEHIANLAGSPAIKK GILQTVKVVDELVKVMGRHKPENIVIEMARENQTTQKGQKNSRERMKRIEEGIKELGSQILKEH PVENTQLQNEKLYLYYLQNGRDMYVDQELDINRLSDYDVDHIVPQSFLKDDSIDNKVLTRSDKN RGKSDNVPSEEVVKKMKNYWRQLLNAKLITQRKFDNLTKAERGGLSELDKAGFIKRQLVETRQI TKHVAQILDSRMNTKYDENDKLIREVKVITLKSKLVSDFRKDFQFYKVREINNYHHAHDAYLNA VVGTALIKKYPKLESEFVYGDYKVYDVRKMIAKSEQEIGKATAKYFFYSNIMNFFKTEITLANG EIRKRPLIETNGETGEIVWDKGRDFATVRKVLSMPQVNIVKKTEVQTGGFSKESIRPKRNSDKL IARKKDWDPKKYGGFLWPTVAYSVLVVAKVEKGKSKKLKSVKELLGITIMERSSFEKNPIDFLE AKGYKEVKKDLI IKLPKYSLFELENGRKRMLASAKQLQKGNELALPSKYVNFLYLASHYEKLKG S PEDNEQKQLFVEQHKHYLDE 11 EQ I S E FS KRVI LADANLDKVLS AYNKHRDKP I REQAENI IH LFTLTRLGAPRAFKYFDTTIDPKQYRSTKEVLDATLIHQSITGLYETRIDLSQLGGDSGGSGGS GGSTNLSDIIEKETGKQLVIQESILMLPEEVEEVIGNKPESDILVHTAYDESTDENVMLLTSDA PEYKPWALVIQDSNGENKIKMLSGGSGGSGGSTNLSDIIEKETGKQLVIQESILMLPEEVEEVI GNKPESDILVHTAYDESTDENVMLLTSDAPEYKPWALVIQDSNGENKIKMLSGGSKRTADGSEF ESPKKKRKV (SEQ ID NO: 780);
[0233] MKPQIRDHRPNPMEAMYPHIFYFHFENLEKAYGRNETWLCFTVEI IKQYLPVPWKKGVFRNQVD PETHCHAEKCFLSWFCNNTLSPKKNYQVTWYTSWSPCPECAGEVAEFLAEHSNVKLTI YEARLY YFWDTDYQEGLRSLSEEGASVEIMDYEDFQYCWENFVYDDGEPFKRWKGLKYNFQSLTRRLREI LQSGGSSGGSSGSETPGTSESATPESSGGSSGGSDKKYSIGLAIGTNSVGWAVITDEYKVPSKK FKVLGNTDRHSIKKNLIGALLFDSGETAEATRLKRTARRRYTRRKNRICYLQEIFSNEMAKVDD SFFHRLEESFLVEEDKKHERHPIFGNIVDEVAYHEKYPTIYHLRKKLVDSTDKADLRLIYLALA HMIKFRGHFLIEGDLNPDNSDVDKLFIQLVQTYNQLFEENPINASGVDAKAILSARLSKSRRLE NLIAQLPGEKKNGLFGNLIALSLGLTPNFKSNFDLAEDAKLQLSKDTYDDDLDNLLAQIGDQYA DLFLAAKNLSDAILLSDILRVNTEITKAPLSASMVKRYDEHHQDLTLLKALVRQQLPEKYKEIF ATTORNEY DOCKET NO. 180802-47402 / PCT
[0234] ELECTRONIC DEPOSIT DATE: December 10, 2025 FDQSKNGYAGYIDGGASQEEFYKFIKPILEKMDGTEELLVKLNREDLLRKQRTFDNGI IPHQIH LGELHAILRRQGDFYPFLKDNREKIEKILTFRIPYYVGPLARGNSRFAWMTRKSEETITPWNFE EVVDKGASAQSFIERMTNFDKNLPNEKVLPKHSLLYEYFTVYNELTKVKYVTEGMRKPAFLSGE QKKAIVDLLFKTNRKVTVKQLKEDYFKKIECFDSVEISGVEDRFNASLGTYHDLLKI IKDKDFL DNEENEDILEDIVLTLTLFEDREMIEERLKTYAHLFDDKVMKQLKRLRYTGWGRLSRKLINGIR DKQSGKTILDFLKSDGFANRNFMQLIHDDSLTFKEDIQKAQVSGQGDSLHEHIANLAGSPAIKK GILQTVKVVDELVKVMGGHKPENIVIEMARENQTTQKGQKNSRERMKRIEEGIKELGSQILKEH PVENTQLQNEKLYLYYLQNGRDMYVDQELDINRLSDYDVDHIVPQSFLKDDSIDNKVLTRSDKN RGKSDNVPSEEVVKKMKNYWRQLLNAKLITQRKFDNLTKAERGGLSELDKAGFIKRQLVETRQI TKHVAQILDSRMNTKYDENDKLIREVKVITLKSKLVSDFRKDFQFYKVREINNYHHAHDAYLNA VVGTALIKKYPKLESEFVYGDYKVYDVRKMIAKSEQEIGKATAKYFFYSNIMNFFKTEITLANG EIRKRPLIETNGETGEIVWDKGRDFATVRKVLSMPQVNIVKKTEVQTGGFSKESILPKGNSDKL IARKKDWDPKKYGGFNSPTAAYSVLVVAKVEKGKSKKLKSVKELLGITIMERSSFEKNPIGFLE AKGYKEVKKDLI IKLPKYSLFELENGRKRMLASAGVLHKGNELALPSKYVNFLYLASHYEKLKG S PEDNEQKQLFVEQHKHYLDE 11 EQ I S E FS KRVI LADANLDKVLS AYNKHRDKP I REQAENI IH LFTLTNLGVPAAFKYFDTTIDKKRYTSTKEVLDATLIHQSITGLYETRIDLSQLGGDSGGSGGS GGSTNLSDIIEKETGKQLVIQESILMLPEEVEEVIGNKPESDILVHTAYDESTDENVMLLTSDA PEYKPWALVIQDSNGENKIKMLSGGSGGSGGSTNLSDIIEKETGKQLVIQESILMLPEEVEEVI GNKPESDILVHTAYDESTDENVMLLTSDAPEYKPWALVIQDSNGENKIKMLSGGSKRTADGSEF ESPKKKRKV (SEQ ID NO: 781);
[0235] MKRTADGSEFESPKKKRKVMKPQIRDHRPNPMEAMYPHIFYFHFENLEKAYGRNETWLCFTVEI IKQYLPVPWKKGVFRNQVDPETHCHAEKCFLSWFCNNTLSPKKNYQVTWYTSWSPCPECAGEVA EFLAEHSNVKLTIYTARLYYFWDTDYQEGLRSLSEEGASVEIMDYEDFQYCWENFVYDDGEPFK RWKGLKYNFQSLTRRLREILQSGGSSGGSSGSETPGTSESATPESSGGSSGGSDKKYSIGLAIG TNSVGWAVITDEYKVPSKKFKVLGNTDRHSIKKNLIGALLFDSGETAEATRLKRTARRRYTRRK NRICYLQEIFSNEMAKVDDSFFHRLEESFLVEEDKKHERHPIFGNIVDEVAYHEKYPTIYHLRK KLVDSTDKADLRLIYLALAHMIKFRGHFLIEGDLNPDNSDVDKLFIQLVQTYNQLFEENPINAS GVDAKAILSARLSKSRRLENLIAQLPGEKKNGLFGNLIALSLGLTPNFKSNFDLAEDAKLQLSK DTYDDDLDNLLAQ I GDQ YADLFLAAKNLSDAI LLSD I LRVNTE I TKAPLS ASMVKRYDEHHQDL TLLKALVRQQLPEKYKEIFFDQSKNGYAGYIDGGASQEEFYKFIKPILEKMDGTEELLVKLNRE DLLRKQRTFDNGI IPHQIHLGELHAILRRQGDFYPFLKDNREKIEKILTFRIPYYVGPLARGNS RFAWMTRKSEETITPWNFEEVVDKGASAQSFIERMTNFDKNLPNEKVLPKHSLLYEYFTVYNEL TKVKYVTEGMRKPAFLSGEQKKAIVDLLFKTNRKVTVKQLKEDYFKKIECFDSVEISGVEDRFN ASLGTYHDLLKI IKDKDFLDNEENEDILEDIVLTLTLFEDREMIEERLKTYAHLFDDKVMKQLK RLRYTGWGRLSRKLINGIRDKQSGKTILDFLKSDGFANRNFMQLIHDDSLTFKEDIQKAQVSGQ ATTORNEY DOCKET NO. 180802-47402 / PCT
[0236] ELECTRONIC DEPOSIT DATE: December 10, 2025 GDSLHEHIANLAGSPAIKKGILQTVKVVDELVKVMGGHKPENIVIEMARENQTTQKGQKNSRER MKRIEEGIKELGSQILKEHPVENTQLQNEKLYLYYLQNGRDMYVDQELDINRLSDYDVDHIVPQ SFLKDDSIDNKVLTRSDKNRGKSDNVPSEEVVKKMKNYWRQLLNAKLITQRKFDNLTKAERGGL SELDKAGFIKRQLVETRQITKHVAQILDSRMNTKYDENDKLIREVKVITLKSKLVSDFRKDFQF YKVREINNYHHAHDAYLNAVVGTALIKKYPKLESEFVYGDYKVYDVRKMIAKSEQEIGKATAKY FFYSNIMNFFKTEITLANGEIRKRPLIETNGETGEIVWDKGRDFATVRKVLSMPQVNIVKKTEV QTGGFSKESILPKGNSDKLIARKKDWDPKKYGGFNSPTVAYSVLVVAKVEKGKSKKLKSVKELL GITIMERSSFEKNPIGFLEAKGYKEVKKDLI IKLPKYSLFELENGRKRMLASASVLHKGNELAL PSKYVNFLYLASHYEKLKGSSEDNKQKQLFVEQHKHYLDEI IEQISEFSKRVILADANLDKVLS AYNKHRDKPIREQAENIIHLFTLTNLGASAAFKYFDTTIGRKLYTSTKEVLDATLIHQSITGLY ETRIDLSQLGGDSGGSGGSGGSTNLSDIIEKETGKQLVIQESILMLPEEVEEVIGNKPESDILV HTAYDESTDENVMLLTSDAPEYKPWALVIQDSNGENKIKMLSGGSGGSGGSTNLSDI IEKETGK QLVIQESILMLPEEVEEVIGNKPESDILVHTAYDESTDENVMLLTSDAPEYKPWALVIQDSNGE NKIKMLSGGSKRTADGSEFESPKKKRKV (SEQ ID NO: 782); and MKRTADGSEFESPKKKRKVssevef sheywmrhaltlakrardegeapvgavlvlnnrvigegw vrriglhdptahaeimalrqgglvmqnprlidatlyvtf epcvmcagaminsrigrvvf gvrns krgaagslmnvlnypgmnhrveitegiladecaallcdfyrmprqvf naqkkaqssinEGGSEE EEESGSDKKYSIGLAIGTNSVGWAVITDEYKVPSKKFKVLGNTDRHSIKKNLIGALLFDSGETA EATRLKRTARRRYTRRKNRICYLQEIFSNEMAKVDDSFFHRLEESFLVEEDKKHERHPIFGNIV DEVAYHEKYPTIYHLRKKLVDSTDKADLRLIYLALAHMIKFRGHFLIEGDLNPDNSDVDKLFIQ LVQTYNQLFEENPINASGVDAKAILSARLSKSRRLENLIAQLPGEKKNGLFGNLIALSLGLTPN FKSNFDLAEDAKLQLSKDTYDDDLDNLLAQIGDQYADLFLAAKNLSDAILLSDILRVNTEITKA PLSASMIKRYDEHHQDLTLLKALVRQQLPEKYKEIFFDQSKNGYAGYIDGGASQEEFYKFIKPI LEKMDGTEELLVKLNREDLLRKQRTFDNGSIPHQIHLGELHAILRRQEDFYPFLKDNREKIEKI LTFRIPYYVGPLARGNSRFAWMTRKSEETITPWNFEEVVDKGASAQSFIERMTNFDKNLPNEKV LPKHSLLYEYFTVYNELTKVKYVTEGMRKPAFLSGEQKKAIVDLLFKTNRKVTVKQLKEDYFKK IECFDSVEISGVEDRFNASLGTYHDLLKI IKDKDFLDNEENEDILEDIVLTLTLFEDREMIEER LKTYAHLFDDKVMKQLKRRRYTGWGRLSRKLINGIRDKQSGKTILDFLKSDGFANRNFMQLIHD DSLTFKEDIQKAQVSGQGDSLHEHIANLAGSPAIKKGILQTVKVVDELVKVMGRHKPENIVIEM ARENQTTQKGQKNSRERMKRIEEGIKELGSQILKEHPVENTQLQNEKLYLYYLQNGRDMYVDQE LDINRLSDYDVDHIVPQSFLKDDSIDNKVLTRSDKNRGKSDNVPSEEVVKKMKNYWRQLLNAKL I TQRKFDNLTKAERGGLS ELDKAGF I KRQLVETRQ I TKHVAQ I LDSRMNTKYDENDKL I RE VKV ITLKSKLVSDFRKDFQFYKVREINNYHHAHDAYLNAVVGTALIKKYPKLESEFVYGDYKVYDVR KMIAKSEQEIGKATAKYFFYSNIMNFFKTEITLANGEIRKRPLIETNGETGEIVWDKGRDFATV RKVLSMPQVNIVKKTEVQTGGFSKESILPKRNSDKLIARKKDWDPKKYGGFDSPTVAYSVLVVA ATTORNEY DOCKET NO. 180802-47402 / PCT
[0237] ELECTRONIC DEPOSIT DATE: December 10, 2025 KVEKGKSKKLKSVKELLGITIMERSSFEKNPIDFLEAKGYKEVKKDLI IKLPKYSLFELENGRK RMLASAGELQKGNELALPSKYVNFLYLASHYEKLKGSPEDNEQKQLFVEQHKHYLDEI IEQISE FSKRVILADANLDKVLSAYNKHRDKPIREQAENI IHLFTLTNLGAPAAFKYFDTTIDRKRYTST KEVLDATLIHQSITGLYETRIDLSQLGGDSGGSGGSGGSTNLSDIIEKETGKQLVIQESILMLP EEVEEVIGNKPESDILVHTAYDESTDENVMLLTSDAPEYKPWALVIQDSNGENKIKMLSGGSGG SGGSTNLSDIIEKETGKQLVIQESILMLPEEVEEVIGNKPESDILVHTAYDESTDENVMLLTSD APEYKPWALVIQDSNGENKIKMLSGGSKRTADGSEFESPKKKRKV (SEQ ID NO: 792).
[0238] In any aspect or embodiment of the disclosure, the GALT polynucleotide is in a cell. In any aspect or embodiment of the disclosure, the cell is a mammalian cell. In any aspect or embodiment of the disclosure, the subject is a mammal. In any aspect or embodiment of the disclosure, the mammal is a human.
[0239] In any aspect or embodiment of the disclosure, the method results in lowering of the concentration of galactose- 1-P in the blood of the subject relative to an untreated subject. In any aspect or embodiment of the disclosure, the concentration of galactose-l-P in the blood of the subject is lowered to a concentration of less than 5 mg / dL.
[0240] In any aspect or embodiment of the disclosure, the method results in an increase in GALT polypeptide activity in a hepatocyte of the subject.
[0241] In any aspect or embodiment of the disclosure, the method further involves administering to a subject a lipid nanoparticle containing the base editor system.
[0242] In any aspect or embodiment of the disclosure, a target nucleotide is deaminated with an efficiency of about or at least about 5%, 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, or 50%.
[0243] In any aspect or embodiment of the disclosure, the deaminase domain is modified to omit an N-terminal methionine.
[0244] In any aspect of the disclosure, or embodiments thereof, the lipid nanoparticle contains an ionizable lipid having the following structure, or a pharmaceutically acceptable salt thereof:
[0245]
[0246] In any aspect of the disclosure, or embodiments thereof, the LNP contains a molar ratio of ionizable nitrogen atoms in an ionizable lipid to the total negative charge in the payload (N: P ratio) of between about 5:1 and 7:1. In any aspect of the disclosure, or embodiments thereof, the LNP contains an N: P ratio of about 6:1. ATTORNEY DOCKET NO. 180802-47402 / PCT
[0247] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0248] In any aspect of the disclosure, or embodiments thereof, the guide RNA contains a scaffold with the following nucleotide sequence:
[0249] GUUUUAGAmGmCmCmGmGmCmGmGmAmAmAmCmGmCmCmGmGmCAAGUUAAAAUAAGGCUAGUC CGUU AU C AmAmCmU mU mGmAmAmAm AmAmGmU mGmGmCm AmCmCmGm AmGmU mCmGmGmU mGm CmUsmUsmUsmU (SEQ ID NO: 794), where “mN” indicates a 2'-OMe modification of the nucleotide “N”, and “Ns” indicates that the nucleotide “N” is linked to the following nucleotide by a phosphorothioate (PS). In any aspect of the disclosure, or embodiments thereof, the guide RNA contains a spacer containing the nucleotide sequence UUACCCGGCAGUGGGGGUGG (SEQ ID NO: 705) or UUACCCGGCAGUGGGGGUG (SEQ ID NO: 793). In any aspect of the disclosure, or embodiments thereof, the guide RNA contains a nucleotide sequence selected from those listed in Table 2. In any aspect of the disclosure, or embodiments thereof, the guide RNA contains one of the following nucleotide sequences: mUsmUsmAsCCCGGCAGUGGGGGUGGGUUUUAGAmGmCmCmGmGmCmGmGmAmAmAmCmGmCmC mGmGmCAAGUUAAAAUAAGGCUAGUCCGUUAUCAmAmCmUmUmGmAmAmAmAmAmGmUmGmGmC mAmCmCmGmAmGmUmCmGmGmUmGmCmUsmUsmUsmU (SEQ ID NO: 746); mUsmUsmAsCCCGGCAGUGGGGGUGGUUUUAGAmGmCmCmGmGmCmGmGmAmAmAmCmGmCmCm GmGmCAAGUUAAAAUAAGGCUAGUCCGUUAUCAmAmCmUmUmGmAmAmAmAmAmGmUmGmGmCm AmCmCmGmAmGmUmCmGmGmUmGmCmUsmUsmUsmU (SEQ ID NO: 791);
[0250] where “mN” indicates a 2'-0Me modification of the nucleotide “N”, and “Ns” indicates that the nucleotide “N” is linked to the following nucleotide by a phosphorothioate (PS). In any aspect of the disclosure, or embodiments thereof, the guide RNA contains a spacer selected from those listed in Table 1.
[0251] In any aspect of the disclosure, or embodiments thereof, the napDNAbp is an SpCas9, an SpRY, an SpRYc, an saCas9, or an spCas9 variant having specificity for a protospacer-adjacent motif (PAM) selected from one or more of: NG, NRRH, and NRTH, where “N” indicates A, C, G, or T, “R” indicates A or G, “Y” indicates C or T, and “FT indicates A, C, or T. In any aspect of the disclosure, or embodiments thereof, the napDNAbp is a nickase.
[0252] In any aspect of the disclosure, or embodiments thereof, the base editor polypeptide contains an amino acid sequence with at least about 90% identity to a sequence listed in Table 2.
[0253] In any aspect of the disclosure, or embodiments thereof, the cell is in a subject.
[0254] In any aspect of the disclosure, or embodiments thereof, the disease is galactosemia.
[0255] Definitions
[0256] Unless defined otherwise, all technical and scientific terms used herein have the meaning commonly understood by a person skilled in the art to which this disclosure belongs. The ATTORNEY DOCKET NO. 180802-47402 / PCT
[0257] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0258] following references provide one of skill with a general definition of many of the terms used in this disclosure: Singleton et al., Dictionary of Microbiology and Molecular Biology (2nd ed. 1994); The Cambridge Dictionary of Science and Technology (Walker ed., 1988); The Glossary of Genetics, 5th Ed., R. Rieger et al. (eds.), Springer Verlag (1991); and Hale & Marham, The Harper Collins Dictionary of Biology (1991). As used herein, the following terms have the meanings ascribed to them below, unless specified otherwise.
[0259] By “6b polypeptide” is meant a deaminase capable of deaminating cytidine in DNA and comprising an amino acid sequence with at least 85% identity to the following amino acid sequence:
[0260] MSSEVEFSHEYWMRHALTLAKRARDEGEAPVGAVLVLNNRVIGEGWVRRIGLHDPTAHAEIMAL RQGGLVMQNPRL IDATL YVTFE PCVMCAGAM INSR I GRVVFGVRNS KRGAAGS LMNVLNYPGMN HRVEITEGILADECAALLCDFYRMPRQVFNAQKKAQSSIN (SEQ ID NO: 771), or a functional fragment thereof. In various embodiments, the 6b polypeptide may be inserted within a Cas9 polypeptide between amino acid positions 906 and 907 referenced to SEQ ID NO: 197 (where a base editor with a deaminase inserted at this position may be referred to as an “IBE34” base editor). In some embodiments, the 6b polypeptide lacks an N-terminal methionine.
[0261] By “6b polynucleotide” is meant a polynucleotide encoding an 6b polypeptide.
[0262] By “galactose- 1 -phosphate uridylyltransferase (GALT) polypeptide” is meant a protein with at least about 85% amino acid sequence identity to GenBank Accession No. AAC83409.1, which is provided below, or a fragment thereof. A functional GALT polypeptide is capable of metabolizing the conversion of galactose-l-P to glucose-l-P (see, e.g., FIG. 1). In some embodiments, GALT contains a pathogenic Q188R mutation that prevents functional protein activity required to break down galactose in a subject, which may be associated with a toxic accumulation of galactose metabolites (e.g., galacitol or Galactose-l-P) in a cell or subject. Amino acid position 188 is underlined in the below sequence. In various embodiment, the methods of the disclosure involve altering a GALT polypeptide to replace an R at amino acid position 188 with a Q.
[0263] > AAC83409.1 galactose- 1 -phosphate uridyl transferase [Homo sapiens] MSRSGTDPQQRQQASEADAAAATFRANDHQHIRYNPLQDEWVLVSAHRMKRPWQGQVEPQLLKT VPRHDPLNPLCPGAIRANGEVNPQYDSTFLFDNDFPALQPDAPSPGPSDHPLFQAKSARGVCKV MCFHPWSDVTLPLMSVPEIRAVVDAWASVTEELGAQYPWVQIFENKGAMMGCSNPHPHCQVWAS SFLPDIAQREERSQQAYKSQHGEPLLMEYSRQELLRKERLVLTSEHWLVLVPFWATWPYQTLLL PVGHVRRLPELTPAERDDLASIMKKLLTKYDNLFETSFPYSMGWHGAPTGSEAGANWNHWQLHA ATTORNEY DOCKET NO. 180802-47402 / PCT
[0264] ELECTRONIC DEPOSIT DATE: December 10, 2025 HYYPPLLRSATVRKFMVGYEMLAQAQRDLTPEQAAERLRALPEVHYHLGQKDRETATIA (SEQ ID NO: 687)
[0265] By “galactose- 1 -phosphate uridylyltransferase (GALT) polynucleotide” is meant a nucleic acid molecule encoding a GALT polypeptide, as well as the introns, exons, 3' untranslated regions, 5' untranslated regions, and regulatory sequences associated with its expression, or fragments thereof. In embodiments, a GALT polynucleotide is the genomic sequence, cDNA, mRNA, or gene associated with and / or required for GALT expression. An exemplary GALT polynucleotide sequence is provided at ENSEMBL Gene Accession No. ENSG00000213930, which is provided below. A further exemplary GALT nucleotide sequence from Homo Sapiens is provided below (GenBank Accession No.: M60091.1:29-1168). In various embodiments, the methods of the disclosure involve deaminating a “C” complementary to a “G” at the nucleotide position corresponding to the position of the “A” underlined in the below sequences (i.e., deaminating the C complementary to coding position 563 (c. G563)).
[0266] >chromosome: GRCh38:9:34637533:34651635:l (ENSG000002139300, where GALT exons are shown in bold CTCTGCCCGCTCACCAGCGTACTGCCGCGCCAACTGCGCTATCTCTTCGCGCTGGAAGACGAAG CTCTGCGTACCCAGCCAGAGCCAGACGACCTGGGTCAGCACCGCTGCGACAGCCAGGAGCAGCG CGGCCCACGCCCACCGCCGGCCCACGGCCCACTGCATCCCGGCGGGCGGCCTGGCACGGCGCAG CTCAGGAGGGAGCCGGGGCCTGAGGCTTTGCGCTCACGGCCTCGGAGCCCGCCGGCTGCCCACG AAGCGCTCCCCCGCCACCGCCGTGGTACGGCTCGCCCTGACCAATCGGAGCGGCCCGGGCACCT TGGCACCGCCCTTCGGTTGAACCATTTCCTCGGGGGGAGCACGGAGGAGTGGGGCGGTGCCGCG CAAGGGAGGCAAGGGGGCGGGGCCAAGTCGCCCAGCGGCAGATGTCAGGGTCTTCAGGCGAGGT CTGACCTAGCTAGCGCCACTCTACATACGGATAGGTGTGGGGATAGTGAGACCCAGAATGGACG GGCTCGCCCGAGCTCACTGGCTGGTGAAGCAGACGAGCCCGGAAGTGAAGCCTTCTGGGACTCA ACGATTGTTAAGGCCCGCCTTTACCAGGACTGAAGCGGAGAGGAGGCAGTTCAAGCTGGGAGCC CCAGGGTGGGCATTCCAGAGGCGGTGACACCTGAGCAGTGTCTTTAAGAGCTCGCAGAAGGGGC ATGCCAGCCCACAGGGACCAGCCCCTAGAGCAAAGGCCTCCGAGGAGAGACAGCTTGGGGTCTT CAGGGGCCTACAATTCCTTCTCTGTTCTGCGGCACAAAATGTGAGAGGAGGCCTTGTAAACCAG CTGAGGGTTATCAGCAAGGGAGGAAGGGGATTAGATTTTGCCACCGAGTGGCACAGGACTGTCG AGTGGAGGGGAGCAAGATGAGTCAGGGAGACCATGACTGTGGGCCAGGTAAGAGGGAAGGTCTG TGTCAGTGGAGATGGAAGAAAGAAATGAGCTTAACAACTCTGGAAATGGAGTTGGCAGGGCTTG ATCTCTTCTTCCTGGAATACTCTTTCCCCCTTTATCCTATCCCCACTTTCCCCTTATTTGGCTA ACATCCTTTGCATGTCTAGTGGGCAGTTGAGTGTCCAGGTCTCGAAGCTCAGCTCCAGTCTGGA GATAGAGCTGGGAGTTGTCATTCATTCAGATACTTTGTTGGCACTTGTATGTGCCAAGTTCTGT ATTORNEY DOCKET NO. 180802-47402 / PCT
[0267] ELECTRONIC DEPOSIT DATE: December 10, 2025 GGTGGCCATTAGGGATATGGCTAGAAACAATGGTCCTACTCCCCAGCTTAGAGATCTCACTGGA GAGTCTACAGAAAACCTCTAAGAATGGATGAAATCTCACAGGGAGAGAGAATAGAGTAAGATAA GGCTAAAGACAGAGCCCTGGAGAGCACCAACAGCAAAGGGTAGGCAGAGGAAGAAGAAAGCAAA AAGGAAAAGAAACAAAAGAGAAAAGAGGTAGCATTGTGCAGTGATTAAGAAATAAACTCTGGAG CTGGACTGCCTGAGTTCAAGGCCTGGCTCCTGCTACCTACTAGTTGTCTGATCTTGGGCAGTTT ACTTAAAATCTCTGTAACTTAGTTTTCTTACCTGTAAAATGGAATGGAGATAAGTACCTACCTC ATAGTACATATCTTCATTGTGATGAGATTAGATGGGTTAATATTTATAAAGCAGTTAGTGCCTC ACATGGTGAGAATTTTATGTGATGGTTATATATATAAAACATAAAGGAACTGGAAAAAAGTAGC CTCATACATTTTTGACCTTAAAGAAAGAAACTGAGGCCGGGCGAGGTGGCTCATAACTGTAATC CCAACACTTTGGGAGGCTGAGGCAGACGGAACACTTGAGTCCAGGTATTCGAGACCAGCCTTGG CAACATGGTGAAACCCCATCTCTGCAAAAAGTACAAAAATTAGTCAGGAGTGGGGGCGCACACC TGTAGTCCCAGCTACTCAGGAGGCCGAAGTGGGAGGATCGCTTGAGCCTGGGAAGTGGAGGTTG CAGTTGAGCCAACAGAGGGAGACTCTGCCTCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGGGC CAGGCACAGTGGCACACACCTGTAGTCCCAGCACTTTGGGAGGCCAAGGTGGGCAGGCAGATTT CCTGAGCTCAGAAGTTCGAGACCACCCTGGGCAACATGGTGAAACCTCGTCTCTACTAAAATAC AAAAAATTAGCCGGGCGTGGTGGCAGGTGCCTGTAGTCCCAGCTACTCGGGAGGCTGAAGCAGA AGAATCGCTTGAACCCAGAAGGTGGATGTTGCAATGAGCCAAGATCGTGCCACTGCACTCCAGC CTGGACAACAGAGGGAGACCCTGTCTCTAAAAAAAAAAGAAAAGAAAAAGGGGCCGGGCACTGT GGCTCACACCTGCAATCCCAGCACTTTGGGAGGCCGAGGCGGGTGGATCACCTGAGGTTGGGAA TTTGAGACCAGCCTAACCAACATGGAGAAACCCTGTCTCTACTAAAAATACAAAATTAGCTGGG CGTGGTGGCGCATGCTTGTAATCCCAGCTACTCAGGAGGCTGAGGCAGGAGAATCACTTGAACC CAGGAGGCAGAGGTTGCGGTGAGCCGAGATCACGCCATTGCACTGCAGCCTCAGTGTGAAACAG CAAAACTCCGTCTCAAAAAAAACAAAACAGGCTGGGCGGGGTGGCTCACGCCTGTAATCCCAGC GCTTTGGGAGGCCGAGGCAGGTGGATCACGTGGTCAGGAGTTCGAGACCAGCCTGACCAACATG GTGAAACCCCATCTCTACTAAAAAATACAAAAATTAGCCGGGCGCGGTGGCGCGTGCCACCCAC TTGGCCCAGCTACTTGGGGGGCTGAGGCAGGAGAATCGTTTGAACCCGGGAGGTGGAGGTTGCA GTGAGTGGAGACTGCACCATTGCACTCCAGCCTGGGCGACAGAGTGAGACTCTGTCTCAAAAAA GAAAAACACACAAACAAAAACAACAGTGAGGAAAAAAAAAAGCTGAGGCAAAATTAATATAGAG AGTTTATTTGGGCCAAGGTAGAGAACGACTGCTCGGGACACTTTTCCAAGTTGCCTTGGGGAAT GTTGATACAAGCATTTTTTATTTTTATATGTGGTTTTTATTTTTATTTTTTATTTTCTTGGAGA TGGATTCTTACTCTGTTGCCTAGGCTGGAATGCAGTAGTGCAATCTTGGCTCACTGCAACTCCC GCCTCCCAGGTTCAAGCTATCCTCCTCCCTCAGCCTCCCGAGTAGCTGGGACCACAGGTGTGCA CCACCATGCCTGGCTAATTTTTGTAGTTTTAGTGGAGACAGGTTTTTGCCATGTTGGCCAGGCT GGTCTCTAACTCCTGACCTCAGGTGATCCACACTCCTCGGCCTCCCAAAGTGTGGGATTACATG TGTGAGCCACTGCACCTGGCCACTTTATTTTAACTTAATTTAATTTTATTTTTTTTGAGATGGT ATTORNEY DOCKET NO. 180802-47402 / PCT
[0268] ELECTRONIC DEPOSIT DATE: December 10, 2025 GTCTCCACTGTCGCCCAGGCTGGAGTGCAGTGGCGCAATCTCGGCTCACTGCAACCTCTGCCTC CCAGTTCAAGAAATTCTCCTGCCTCAGCCTCCCGAGTAGCTGGGACTACAGGCACCTGCCACCA CATCCAGCTAATTTTTTTGTATTTTTAGTAGAGATGGGGTTTCACCATGTTGGCCAGGCTGGTC TTGAACTCTGACCTCGTGATTTGCCCGCCTCGGCCTCCCAAAGTGCTGGGATTATAGGCGTGAG CCACCGCGCCCGGCCTATTTTTATTTTTTGAGACCGGATCTTGCTCTGTTACCTAGGCTGGAGT TCAGTGGCACGATCACAGCTCATTGCAGCCTTGACCTCTTGGGCTCAAGGGATCATCCTGCCTT GGCCTCCCAAAGTGCTGGAATTACAGGTATGAACCATCATGCCCAGCCACAAGCAAGTTTTTAA AGGCAAAAGAGGACAAGGAGTGGGCTGACACAAAATTGTTTAACAGGAATTCTCATTGGCTTAC AGAAACAACATTGATTAGTGATTGGCTATACCTTGTTGAACTATAGGGTATGAGTTATGGTGTC TAGACTATGGCATTTTATGGCTATGTGGTATCAATTTAGAGCCCACATAGCAAGTGGCTTCAAG GGGTAATTATTTGGCTCCATGGGGGTGGGGGAAGGTGAGATATGACTGCTGTTACATGCTGTCACTTTTTTTTTTTTTTTTTTTTGAGACTGAGTCATACTCTGTCGCCCAGGCTGGAGTGCAGTGGCGCAATCTCGGCTCACTGCAACCTCCGCCTCCCGGGTTGAAGCGATTCTCCTGCCTCCGCCACCC AAGTAGCTGGAATTACAGGCACCTGCCACTATGCTCAGCTTTTTGTATTTTTAGTAGTGACAGG GTTTCACCATAATGGCCAGGCTGGTCTCAAACTCCTGACCTCAAGTGATCCCGCCCGCCTTGGT TTCCCAGTGTGCTGGGATTACAAGTGTGAGCCACTGCACCCGGCCTTTTTTTTTCTTTTTAAAA TTATTATTATTATTATTATTATTATTTTGAGACGGAGTCTCTTGCACTGTCGCCGGGGCTGGAG TGCAGTGGCATGATCCTGGCTCACTGCAACCTCTGCCTCCCAGGTTCAAGCAATTCTCCTGCCT CAGCCTCCTGAGTAGCTGGGATTACAGGCGCCCGCCACCATGCCCAACTAATTTTTTGTATTTT TAGTAGAGACAGGGTTTCACCATGTTGGCCAGGCTGGTCTCGAACTCCTGACCTCGTGATTCGC CCACCTTGGCCTCCCAAATTGCTGGGATTACAGGCGTGAGCCACCGCGCCAGGCCCATGCTGTC ACATTTTAATGCCTCTCTGGGCCTGATAATTAAAAGAGGCTGGCATTTCTCAGATAAAAAGTTT CTTTTCTTTCTCAAGAGTGATGGAATGGGCCAGGCACGGTGGCTCACACCTGTAATCCCAGCAC TTTGGGAGGCCGAGGTAGGGGAATCACCTGAGGTTGCCAGTTCAAGATCCAGCCTGACCAACAT GGAGAAACCCCATCTCTTCTAAAAATACAAAATTAGCCAGGCGTGGTGGCGCATACCTGTAATC CCAGCTACTCGGGAGGCTGAGGCAGGAGAATCGCTTGAACCTGGGAGGCGGAGGTTATGGTGAG CTGAGAGCACGCCATTGCACTCCAGTTTGGGCAATGAGAGCGAAACTCCATCTAAAAAAATTAA AAAAAGAGTGATGGAATGAAGGGTTAACACAGTGAAATATAATGGTGAGATTCAATGAGATGAT GGGAAATGATCATTAGACTTCTCAGTAGGGGCATGTGGCAGAGACAATGCTATACATTCACAAA TCCATTTCCTTTTGCTCTTGGACAGCTAGATTACATTCCACAGTCTACTCTGGATCATGTGATT GCATTCTGGCCAATGAAATATAGATAGAAATGATGTATGACATTTCTAGGGCTGGCTCCTAAAG TCTCCTCTGAAATTCTTTTTTTTTTTTTTCTCTTTCTCTTTTTAAGAGACAGGATTGTGCTATG TCACTCCCAGACTGGTATCAAACTCTTGGGCTCAAGTGATCTCCCACCCTGGCTTCCCAAGAGC TGGGGGACTACAGGCACATAGTACTATGCCCAGCTTGAAGTTCTTTACATTTTCTTTCTTCCCA TGTCTATTACTGTAGTTATAGAGGATCTCTAGTGGAGTATTCTGAGACCCTAGGACATGGCATA ATTORNEY DOCKET NO. 180802-47402 / PCT
[0269] ELECTRONIC DEPOSIT DATE: December 10, 2025 GCTACTAGATGGATGGCACCTGGATCCCTAAATACTGCTTGGAACAGAGCTCCCTGTTACCCCC AGAATTGACTGTAGTGCTAAGCCATTGAGTTTTGGGAGTGTTTGTTACAGCAATTAAGATTACT TACCCTGACAAATATAGGGCAAAAGATGTGTTACAATATACCATGCAGCTAAGAGTAGAGACCA GAAAGATGGCCTCTGTTTTTAAGAAACTTGGCAGGCCGGGCACGGTGGCTCACGCCTGTAATCC CAATACTTTGGGAGGCAGAGGCAGGCGGATCACCTGAGGTCAGGAATTTGGGACCAGCCTGGCC AACGTGGTGAAACCCTGTCTCTACTACAAATACAAAAATTAGCTGGGTGTGGTGGTGCACGCCT GTAATCCCAGCTACAGGGGAGGCTGAGGCAGGAGAATCACTTGAATCAGGAGGCGGATGTTGCA GTGAGCCTAGATTGCACCACTGTACTCCAGCCTGGGCGACAGAGCGAGACTCCATCTAAAAAAA AAAAAAAAAAAAAAAAAAAAGAAACTTGGCAGAGAAATTAAGGATAAAGAAAGGATTATCTAGA GGAGAATACAGGATTGAGGGAACATGTTTTTGTATTGGTTCTGGCTTCTGTGGCATTCCCGACT CGACAATTCACTAAGAACCCTGGGCAGCAGGGATTAGCCCCCAAATTTGGTCTCTGCTCCCCAT TCCAGGAACAGCTTTCTGGGATCATCTGGGCCTACACCTCCCCTATCTGGTCTTTCAGAAAGCT TTTCCAGGCCTTTCCAGGTGTGGTGACTTATGTCTGTAATCCCAGCACTTTGGGAAGGCCGAGA TGGACAGATTACCTGAGGTCAGGCGTTCAAGACCAGCCTGGCCAACATGGTGAAACTCTGTCTC TAAAAAATACAAAAATTAGCTAGTCATGATGGTGGGTGCCTGTAATTCCAGCTACTCGGGAGGC TGAGGCAGGAGAATCACTTGAACCTGGGAGGCAGAGGTTGCGGTGAGCCGAGATCAAGCCACTG CACTCCAGCCTGGGCAACAGAGTGAAACTCCGTCTTAAAAAAAAAAAATTATATATATATATAT ATATATATATATATATATATATATATATATATATATATATATATAAATGGATAACCAGCAGCCC TCAGGGGTGCTCTGTCTATGGAGTGGCCATTTTTTTATTCCTTCACTTTCCTGATAAACTTGCT TTCACTTCACACTATGGACTCACCTTGAATTCTTTCTTGTGTGAGATCCAAGAATCCTCTCTTG GGGTCTGGATTGGGATCTCTTTACTGTAACAACACCTCTGGGATCTGGCAGGGAGGGAGCTGGT TAGGCCCATCTTAGTCATTCTCCTCTTCCTCCAAAAAATGACCTTGGGGAGTCATAGAGGAGAA TTATGGAGATGTTCCTGAAGAGATGTTCCTGATGGGGGTCAGGATTTAATTAGCATGATTCAGG GGGCGGCACTGAAGCACTGGTGTTCTTGGGAAAATGGAGCTGTCTACAAGCTCTAGTCACTCAA ACAAATATAAATGAGAGAATGGAAAAGCAGATCTCCTGTTCCTGTCTCACCTCTTTGGGTAATA TGAACAGCCAGTGGGACAGGTAAAACCCCTGTTTCTCAGTCCTCAGCTCCCTGCCCCCTACCTC TGAAGGTAGCCTCTTCCTATTCCAGTCTGAGGACCCTAGGGCAGAGATCAAGTAAATGACCCAT GACCTAGTTTTTGCAGCTCAATTGCAAAACACCTGGCTTGGCTTGGTCTTGATTGGGGGAAGGG ATGTCCCTGCTATGGCTCAACTTCTAACTCACTGGGTCTCTGTCCTGCATTCCACTGAAGCCTC AGAGGCATCTCACAGCACTGGGAAAAGCCTTGGAAACTTGCATTCAAATCTCAGCTTGGCCACT AATTCTCTGGGTGACCTTGAGTCCTCTGGGCTCCAATGTGAAATGGGGAAGAGGTTTAGACAAT ATTTAGATGGCTCTTTAAAAGTACTGGGGGCCAGCTGTTTGGGAGGCTGAGGCAGGAGGATCAC TTGAAGTCAGGAGTTCAAGACCAGCCTGGCCAACATGTGAAACCCCGTCTCTACTAAAAATACA TAAATTAGCCCAGTGTGGTGGCACACAGCTCTAATTCCAGCTACTTGGAGGCTGAGGCAGGAGA ATCGCTTGAACCTGGGAGGTGGAGGTTGCAGTGAGCCAAGATCACGCCACTGCACTCCAGCCTG ATTORNEY DOCKET NO. 180802-47402 / PCT
[0270] ELECTRONIC DEPOSIT DATE: December 10, 2025 GGTGGCAGAGTGAGACTCTCTCTCAAACAAACAAACAAACAAAAGTACCAGGGGAGGAATTAAT TTGAATTTTATCCTAGTGTTAGCCAATTGGTCCCATCCAAGGAAAATTTAGAAAAGGGAAGGGG ATGTGTAAAGGAAACACTAGGCCCCACCTAGATGGTGGCTGGAGCTTCTGATAGTCCTGTACTC TCCACATTTTTAGACTTTCTTGTACTTTTTTTTTTTTTTTTGTGACGGAGTCTGCTCTGTCGCC AGGCTAGAGTGCATGGGCACAATCTCGGCTCACTGCAACCTCTGCCTCCTGGGTTCAAGTGATT CTCCTGCCTCAGCCTCCCGAGTAGGTGGGACTACAGGTGCACGCCACCAAGACCAGCTAATTTTTTTTTTTTTTTTTTTTACTAGAGACGGGGTTTCACCATGTTGGCCAGGATGGTCTCAATTTCTTGACCTTGTGATCCGCCTGCCTTGGTCTCCCAAAGTGTTGGGATTACAGGCGTGAGCCACCGAGC CCGGCCATTTTTAGATTTTTTTTTATAGCCCTTTTACCCCTCCCACTTCTTGTGCTGTCTTGGT GTGTGTATGAAAAAGAGAGAGAGAGAGAGAGGAAGGAGAGCAAGAGTGAGAGAAAAAGAGAATA TGGTTACTTAGCCCTTTTTCCCAAATCTGTTTTTGCTTTTGGGGATAGGATATGGTTATGGTGA AGTACTCCACACAGACCCCACCAAGTTTGCCTCTCCTGCGCTACAGCCACTGAGCAGGTGACTG AGGCGCTGTTGATCCAGGGTCATATCCTTACTCGCCCTAGAATGTAAGCTCAGGCAGGGCAGAG GCCATGCCTGACATGTGCATCCCAATGCTTTACACAGCCTAGGTGCCTAGCACATGCTAGATGC TTGGTAAATATTTGCTGAATGAAAGATCAAATGAATGATTGCAGCAAGCAAGTCCTGTAGGCAT CCTGGAGCCCAAGGATTCTGCAGTAGGCAGCTTTCACAGAGGTTCTTCCAGTGTAGTGGCTCTA GCTCTGGGTGAAGTAGGATCATCAATGTCGGCCCCCAGGGTTCACAGCTGTTCTGAGCCCCGCC CCCAGGTGGCAGGGCAGCCCAGTCAGTCAGTCACGTGCTGGCGGCTGGCCAATCATCGGGGGCG GCGCGGGGAGGGGTGGTGTGGACGGAGAAAGTGAAAGGTGAGGCACGGCCCTGCAGATTTTCCA GCGGATCCCCCGGTGGCCTCATGTCGCGCAGTGGAACCGATCCTCAGCAACGCCAGCAGGCGTC AGAGGCGGACGCCGCAGCAGCAACCTTCCGGGCAAACGGTAACTGCACCGCGGCAGGGACTCGC TGGGGCGCGGAGCCGAGCCCTCCCCTTCCTTAGGAAGCTTTCGTCCCCTCCGAAGGTTGGAACG CTCATCCCGAGCCAGACCGACAAGGCGTACAGTCTGCAGGCCTGTACGAGCAGCAGGCCAATTG GCGCTGGGAAAGTCCAATCCTGGGCCTCTAGCTCCTGAGCGGGACAGGGCCGAGAGGGCGCTCC CGAGCTTGGGCCTGCTGGTGGGTGAGACCCAGGAGAGAGGGAGCTAGAGAGCTCTGAGGACTGA TCTTGACTGTCTGCCCCCAGACCATCAGCATATCCGCTACAACCCGCTGCAGGATGAGTGGGTG CTGGTGTCAGCTCACCGCATGAAGCGGCCCTGGCAGGGTCAAGTGGAGCCCCAGCTTCTGAAGA CAGTGCCCCGCCATGACCCTCTCAACCCTCTGTGTCCTGGGGCCATCCGAGCCAACGGAGAGGT AAGCCTGTAGAGCCCTGCATCTGCAGGCTGGGCCACGGGGAGTAGTTCCCTCTTAGAACTGTCC TCCACCCACAGGGATAGTGAACCTCCTTCTGGGTCATATCCCACCAAGCTTTTTGGTCCCCTAG GGTGGGCCTTCCCTACTCCCTTGTAGCCTGTCCAGTCTTTGAAGCCCACCAGGTAACTGGTGGT ATGGGGCAGTGAGTGCTTCTAGCCTATCCTTGTCGGTAGGTGAATCCCCAGTACGATAGCACCT TCCTGTTTGACAACGACTTCCCAGCTCTGCAGCCTGATGCCCCCAGTCCAGGTAACCTGGCTCC AACTGCTGCTGGGGAGGAGGGTGGCTAGACCTCTTGAGGGACTTCTGCTGCAGAGAGTGATACT CCTTTACCTCAGGACCCAGTGATCATCCCCTTTTCCAAGCAAAGTCTGCTCGAGGAGTCTGGTA ATTORNEY DOCKET NO. 180802-47402 / PCT
[0271] ELECTRONIC DEPOSIT DATE: December 10, 2025 ACTATGGATTTCCCCTCTTACAACTTTCAAACCAGAGTTGGAGACTCAGCATTGGGGTTCGGCC CTGCCCGTAGCACAGCCAAGCCCTACCTCTCGGTTATCTTTTCTCCCGTCACCACCCAGTAAGG TCATGTGCTTCCACCCCTGGTCGGATGTAACGCTGCCACTCATGTCGGTCCCTGAGATCCGGGC TGTTGTTGATGCATGGGCCTCAGTCACAGAGGAGCTGGGTGCCCAGTACCCTTGGGTGCAGGTT TGTGAGGTCGCCCCTTCCCCTGGATGGGCAGGGAGGGGGTGATGAAGCTTTGGTTCTGGGGAGT AACATTTCTGTTTCCACAGGGTGTGGTCAGGAGGGAGTTGACTTGGTGTCTTTTGGCTAACAGA GCTCCGTATCCCTATCTGATAGATCTTTGAAAACAAAGGTGCCATGATGGGCTGTTCTAACCCC CACCCCCACTGCCAGGTAAGGGTGTCAGGGGCTCCAGTGGGTTTCTTGGCTGAGTCTGAGCCAG CACTGTGGACATGGGAACAGGATTAATGGATGGGACAGAGGAAATATGCCAATGATGTGGAGGC TTGGAGGTAAAGGACCTGCCTGTTCTTCTCTGCTTTTGCCCCTTGACAGGTATGGGCCAGCAGT TTCCTGCCAGATATTGCCCAGCGTGAGGAGCGATCTCAGCAGGCCTATAAGAGTCAGCATGGAG AGCCCCTGCTAATGGAGTACAGCCGCCAGGAGCTACTCAGGAAGGTGGGAGAGAGCCAAGCCCT GTGTCCCCAAGGAGTCCCTAACTTTCTTATCCCATGAGAGAGGTGTGTAAAGGAGAAAGCTAGA GGTGAACTAGTAGAGAGAGACTTGCTAGGAGGCCTTAGCAATAATCCAGTAATCTAAAGGAAAG ATGATGGTGACTTAGACTCGGGTGGTTAGTGGTAGAGGTGGTGAGAAGACATCAGATCCTGGGC ACATTCTTTTCTTCTGCTTCCCTTGCCTATTTGCTGACCACACTCCGGCTCCTATGTCACCTTG ATGACTTCCTATCCATTCTGTCTTCCTAGGAACGTCTGGTCCTAACCAGTGAGCACTGGTTAGT ACTGGTCCCCTTCTGGGCAACATGGCCCTACCAGACACTGCTGCTGCCCCGTCGGCATGTGCGG CGGCTACCTGAGCTGACCCCTGCTGAGCGTGATGGTCAGTCTCCCAAGTAGGATCCTGGGGCTA GGCACTGGATGGAGGTTGCTCCCAGTAGGGTCAGCATCTGGACCCCAGGCTGAGAGTCAGGCTC TGATTCCAGATCTAGCCTCCATCATGAAGAAGCTCTTGACCAAGTATGACAACCTCTTTGAGAC GTCCTTTCCCTACTCCATGGGCTGGCATGGTGAGGCTTTTCAAGTACCTATATTTAGCCCCAAC ACCATTTCTGGGCTCCTGGGGCTCAGCCTAGTGAACTGCAACCTCAAAGGAGCAAGCCTTGAAA CAGTTGCTGGGGGAAGTGGCCAGAGTAGAGATGCTGGGACTGAGGGTGGAGCAGCAAACTTGGT GAAACTACATCTCCAATGTGCTTTCTAATCTCCTGCCAGCTCTTCTCAAGCAGGGGATCCTGGG AGATGTAGTTTTCAGATACCTGGTTGGGTTTGGGAGTAGGTGCTAACCTGGATAACTGTAAAAG GGCTCTCTCTCCCCACTGTCTCTCTTCTTTCTGTCAGGGGCTCCCACAGGATCAGAGGCTGGGG CCAACTGGAACCATTGGCAGCTGCACGCTCATTACTACCCTCCGCTCCTGCGCTCTGCCACTGT CCGGAAATTCATGGTTGGCTACGAAATGCTTGCTCAGGCTCAGAGGGACCTCACCCCTGAGCAG GTCAGGACTCAGAACAGTCTGGCGTCTCCAGACTCTCACATGCAGTATGTGCAGGCACCTGATA CTTCTGTTGCCCTTGTGCTCCAGTCATTGCACAAGGCAGAAAACAGCTCTGGCAGGAAGGGACT GCCAAAGTTAGGAGCCCTAGGGCCTGGAAGGAGAGTATGGTCCTCAGATCCCCCTTCTCTCCTG CTTCCTCCAGGGAACCCAACAGTCATGACCCTGATAGTTTCCCATAACAACCTGGGCATTCCTT GGGACTCAGGAGCTGCTAAACTCTTTCATCCCCTGGTGGCTTCAGCAGTCCTTATCACCAGCCT CACAATCCCACAGGCCCACCCCCAGTGGGCCTGTGGCATTCATATTTCATATTCATATTTCAAA ATTORNEY DOCKET NO. 180802-47402 / PCT
[0272] ELECTRONIC DEPOSIT DATE: December 10, 2025 CCACAATATCCAGCAAAATGTCTCCTGAGCACCCAGAACTCCATACCATCGGCCGGGTGTGGTG GCTCATGCCTTAATCCCAGCACTTTGGGAGGTCAAGATGGGAGGATTGCTTGAGCCCAGAAGTT CGAGACTAGCCTGGGAAACATAGGAAGCCCTCGTCTCTACAAAAAAAATTTAAAAAGTTAGCCA GGTATGGTGGCATATACTTGTGGTCCCAGATACTTGGGAGGCTGAGATAGGATCACTTGGGCCC AGGAGTTTGAGGCTGCAGTGAGCCATCATCATGGCATCATTGCATTCCAGCCTGGGCAACAGAG CAAGACCTCGTCTCAAAAAAAAAAAAAAAATGAAGTCCATGCCACCATTCTTGGCAGCCCAGCC CTTATCCTCCTTAATTGCTCCCTGTCCCTTTTCCAGGCTGCAGAGAGACTAAGGGCACTTCCTG AGGTTCATTACCACCTGGGGCAGAAGGACAGGGAGACAGCAACCATCGCCTGACCACGCCGACC ACAGGGCCTTGAATCCTTTTTTGTTTTCAACAGTCTTGCTGAATTAAGCAGAAAGGGCCTTGAA TCCTGGCCTGGAATTTGGGCAGATATAGCATTAATAAAACTGTGCATCTCAAACTTTTATCACA TACTCTAATATCAGAGGAGTGTGAACCTTCAGAGATCTAGGGTTAAAAGCTAAAGGCATAGCTC CAGCTACAACTTTTCTTTGTCTACAGATGTGTAAAATCTTGATGCAAAAATCTACCCCAAATTT CTGAACAAAATTAATATTCAGTATTATATCTAGCCTATAGATTCTCTTACTCTTGGTGTAATGA CAGCATTCCCAGTTTAGGAAACTGTTTTAAAATCTTGTGTCTTGGTTGTGGCTGAGGCTTTGGG AAGGCCAGAGCCCCCTTTGCCACCACCCCTAGGGTGTCAGCTCCAAGCCTGGGCCTGAGGGTTT GATGGGGAGCGGGTGAGCTGGATCCTGATCCCAGTGGTCAGTGGCCAGATCCTGGGCTGCTGGC CAGAACCCTGGCTTTGTGAGTGGCTGCAGCTTGGCTCTGCTCCAGCCCACAGGGGCTAAAGGGG CGGGTAGGGAGTGTGTAGGGAGCGTGTACGTGGCTTTGTGTCCTTGGCTCCTCCAGACTCTCAC ATGTGCGGGCACCATATACTTCTGCTGCCCTTGTGCCCCAATCATTGCACAAACAGAAACAGCT CTGACAGAGAAGGGACTGCTAGAGTTAGGAACCCTAGGGCCTGGAAGGAAGGTATGGCTCTCAG AGTGACCTGGGGTCTATTCAGTCCCCTCTGGACCTGTTTCCCTGGCAACGCTGTGGGAACAACA GTCTATGCAACACAGGAGACCTGATGCCACTGGTGTCCTCCAAATTTTAAGGCGAAGGCAGGCA TTGTGAATGATGCAGGGATGGCCAGAATGGTTTTTGCCTTCGTTCTTTCCCCAGGGGAACCAAG CCTGTGGGTGGGGAAATCACACTAAGGCTGGACAGTGACTGAATCTGGCCCAGGAGAAACTTGG CTGCCTGCCTTCCAGACCAGGGTTTTTCTTCCCTGAGATCCTGGAGGGGAAGTACCTGGAACTT GGGAGTGTTGCCCTGGTCTGGAAGATCTGATAGACATAGTATGCACACATTTCCCTCGAGCTTC CCATGACTGGGGAGTAGATGAGA (SEQ ID NO: 688)
[0273] > M60091.1:29-l 168 Homo sapiens galactose- 1 -phosphate uridyl transferase (GALT) mRNA, complete cds ATGTCGCGCAGTGGAACCGATCCTCAGCAACGCCAGCAGGCGTCAGAGGCGGACGCCGCAGCAG CAACCTTCCGGGCAAACGACCATCAGCATATCCGCTACAACCCGCTGCAGGATGAGTGGGTGCT GGTGTCAGCTCACCGCATGAAGCGGCCCTGGCAGGGTCAAGTGGAGCCCCAGCTTCTGAAGACA GTGCCCCGCCATGACCCTCTCAACCCTCTGTGTCCTGGGGCCATCCGAGCCAACGGAGAGGTGA ATCCCCAGTACGATAGCACCTTCCTGTTTGACAACGACTTCCCAGCTCTGCAGCCTGATGCCCC ATTORNEY DOCKET NO. 180802-47402 / PCT
[0274] ELECTRONIC DEPOSIT DATE: December 10, 2025 CAGTCCAGGACCCAGTGATCATCCCCTTTTCCAAGCAAAGTCTGCTCGAGGAGTCTGTAAGGTC ATGTGCTTCCACCCCTGGTCGGATGTAACGCTGCCACTCATGTCGGTCCCTGAGATCCGGGCTG TTGTTGATGCATGGGCCTCAGTCACAGAGGAGCTGGGTGCCCAGTACCCTTGGGTGCAGATCTT TGAAAACAAAGGTGCCATGATGGGCTGTTCTAACCCCCACCCCCACTGCCAGGTATGGGCCAGC AGTTTCCTGCCAGATATTGCCCAGCGTGAGGAGCGATCTCAGCAGGCCTATAAGAGTCAGCATG GAGAGCCCCTGCTAATGGAGTACAGCCGCCAGGAGCTACTCAGGAAGGAACGTCTGGTCCTAAC CAGTGAGCACTGGTTAGTACTGGTCCCCTTCTGGGCAACATGGCCCTACCAGACACTGCTGCTG CCCGTCGGCCATGTGCGGCGGCTACCTGAGCTGACCCCTGCTGAGCGTGATGATCTAGCCTCCA TCATGAAGAAGCTCTTGACCAAGTATGACAACCTCTTTGAGACGTCCTTTCCCTACTCCATGGG CTGGCATGGGGCTCCCACAGGATCAGAGGCTGGGGCCAACTGGAACCATTGGCAGCTGCACGCT CATTACTACCCTCCGCTCCTGCGCTCTGCCACTGTCCGGAAATTCATGGTTGGCTACGAAATGC TTGCTCAGGCTCAGAGGGACCTCACCCCTGAGCAGGCTGCAGAGAGACTAAGGGCACTTCCTGA GGTTCATTACCACCTGGGGCAGAAGGACAGGGAGACAGCAACCATCGCCTGA (SEQ ID NO: 689)
[0275] By “adenine” or “ 9Z7-Purin-6-amine” is meant a purine nucleobase with the molecular
[0276]
[0277] formula C5H5N5, having the structure, and corresponding to CAS No. 73-24-5.
[0278] By “adenosine” or “ 4-Amino-l-[(2A,3A,45,5A)-3,4-dihydroxy-5-(hydroxymethyl)oxolan-2-yl]pyrimidin-2(177)-one” is meant an adenine molecule attached to a NH2
[0279] ribose sugar via a glycosidic bond, having the structure
[0280]
[0281] OH OH corresponding to CAS No. 65-46-3. Its molecular formula is C10H13N5O4.
[0282] By “adenosine deaminase” or “adenine deaminase” is meant a polypeptide or fragment thereof capable of catalyzing the hydrolytic deamination of adenine or adenosine. In some embodiments, the deaminase or deaminase domain is an adenosine deaminase catalyzing the hydrolytic deamination of adenosine to inosine or deoxy adenosine to deoxyinosine. In some ATTORNEY DOCKET NO. 180802-47402 / PCT
[0283] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0284] embodiments, the adenosine deaminase catalyzes the hydrolytic deamination of adenine or adenosine in deoxyribonucleic acid (DNA). The adenosine deaminases (e.g., engineered adenosine deaminases, evolved adenosine deaminases) provided herein may be from any organism (e.g., eukaryotic, prokaryotic), including but not limited to algae, bacteria, fungi, plants, invertebrates (e.g., insects), and vertebrates (e.g., amphibians, mammals). In some embodiments, the adenosine deaminase is an adenosine deaminase variant with one or more alterations and is capable of deaminating both adenine and cytosine in a target polynucleotide (e.g., DNA, RNA) and may be referred to as a “dual deaminase”. Non-limiting examples of dual deaminases include those described in PCT / US22 / 22050. In some embodiments, the target polynucleotide is single or double stranded. In some embodiments, the adenosine deaminase variant is capable of deaminating both adenine and cytosine in DNA. In some embodiments, the adenosine deaminase variant is capable of deaminating both adenine and cytosine in singlestranded DNA. In some embodiments, the adenosine deaminase variant is capable of deaminating both adenine and cytosine in RNA. In embodiments, the adenosine deaminase variant is selected from those described in PCT / US2020 / 018192, PCT / US2020 / 049975, PCT / US2017 / 045381, PCT / US2021 / 016827, PCT / US2022 / 073781, PCT / US24 / 34189, or PCT / US2020 / 028568, the full contents of which are each incorporated herein by reference in their entireties for all purposes. Further non-limiting examples of adenosine deaminases include those disclosed or referenced in Rufflow, etal., “Design of highly functional genome editors by modeling of the universe of CRISPR-Cas Sequences,” bioRxiv, posted April 22, 2024, doi: 10.1101 / 2024.04.22.590591, the disclosure of which is incorporated herein by reference in its entirety for all purposes, which were designed using artificial intelligence. Further exemplary adenosine deaminase amino acid sequences include: TadA-8e (SEQ ID NO: 470), Tadl (SEQ ID NO: 471), Tad2 (SEQ ID NO: 472), Tad3 (SEQ ID NO: 473), Tad4 (SEQ ID NO: 474), Tad6 (SEQ ID NO: 475), Tad6-SR (SEQ ID NO: 476), TadA9 (SEQ ID NO: 477), TadA20 (SEQ ID NO: 478), Staphylococcus aureus Tad A (SEQ ID NO: 479), Bacillus sublilis Tad A (SEQ ID NO: 480), Salmonella typhimurium Tad A (SEQ ID NO: 481), Shewanella putrefaciens (SEQ ID NO: 482), Haemophilus influenzae F3031 Tad A (SEQ ID NO: 483), Caulobacter crescentus TadA (SEQ ID NO: 484), Geobacter sulfiureducens TadA (SEQ ID NO: 485), Streptococcus pyogenes TadA (SEQ ID NO: 486), Aquifex aeolicus TadA (SEQ ID NO: 487), and E. coll TadA deaminase (ecTadA) (SEQ ID NO: 488).
[0285] By “adenosine deaminase activity” is meant catalyzing the deamination of adenine or adenosine to guanine in a polynucleotide. ATTORNEY DOCKET NO. 180802-47402 / PCT
[0286] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0287] By “Adenosine Base Editor (ABE)” is meant a base editor comprising an adenosine deaminase.
[0288] By “Adenosine Base Editor (ABE) polynucleotide” is meant a polynucleotide encoding an ABE.
[0289] “Administering” is referred to herein as providing one or more compositions described herein to a patient or a subject. By way of example and without limitation, composition administration (e.g., injection) can be performed by intravenous (i.v.) injection, sub-cutaneous (s.c.) injection, intradermal (i.d.) injection, intraperitoneal (i.p.) injection, or intramuscular (i.m.) injection. One or more such routes can be employed. Parenteral administration can be, for example, by bolus injection or by gradual perfusion over time. In some embodiments, parenteral administration includes infusing or injecting intravascularly, intravenously, intramuscularly, intraarterially, intrathecally, intratumorally, intradermally, intraperitoneally, transtracheally, subcutaneously, subcuticularly, intraarticularly, subcapsularly, subarachnoidly and intrasternally. Alternatively, or concurrently, administration can be by the oral route.
[0290] By “agent” is meant any small molecule chemical compound, antibody, lipid nanoparticle, nucleic acid molecule, polypeptide, polypeptide complex, or functional fragments thereof. In an embodiment, the agent is a base editor system described herein or a component thereof. In some embodiments, the agent is a lipid nanoparticle of the disclosure. In some embodiments, the lipid nanoparticle contains a base editor system of the disclosure. The base editor system contains a base editor, or a polynucleotide (e.g., an mRNA molecule) encoding the base editor, and a guide RNA, or a polynucleotide encoding the guide RNA.
[0291] By “alteration” is meant a change in the level, structure, or activity of an analyte, gene or polypeptide as detected by standard art known methods such as those described herein. As used herein, an alteration includes a change (e.g., increase or reduction) in expression levels. In embodiments, the increase or reduction in expression levels is by 10%, 25%, 40%, 50% or greater. In some embodiments, an alteration includes an insertion, deletion, or substitution of a nucleobase or amino acid (by, e.g., genetic engineering).
[0292] By “ameliorate” is meant reduce, suppress, attenuate, diminish, arrest, or stabilize the development or progression of a disease. In embodiments, the disease is galactosemia, an autosomal recessive inborn error of metabolism associated with a deficiency of galactose- 1-phosphate uridylyltransferase (GALT). Editing of a GALT gene in a subject having galactosemia restores GALT activity, thereby ameliorating symptoms associated with the disease.
[0293] The term “aliphatic” or “aliphatic group”, as used herein, means a straight-chain (i.e., unbranched) or branched, substituted or unsubstituted hydrocarbon chain that is completely ATTORNEY DOCKET NO. 180802-47402 / PCT
[0294] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0295] saturated or that contains one or more units of unsaturation, or a monocyclic hydrocarbon or bicyclic hydrocarbon that is completely saturated or that contains one or more units of unsaturation, but which is not aromatic (also referred to herein as “carbocycle,” “carbocyclic”, “cycloaliphatic” or “cycloalkyl”), that has a single point of attachment to the rest of the molecule. Unless otherwise specified, aliphatic groups contain 1-6 aliphatic carbon atoms. In some embodiments, aliphatic groups contain 1-5 carbon atoms. In some embodiments, aliphatic groups contain 1-4 carbon atoms. In some embodiments, aliphatic groups contain 1-3 carbon atoms, and in some embodiments, aliphatic groups contain 1-2 carbon atoms. In some embodiments, “carbocyclic” (or “cycloaliphatic” or “carbocycle” or “cycloalkyl”) refers to an optionally substituted monocyclic Cs-Cs hydrocarbon, or an optionally substituted C6-C12 bicyclic hydrocarbon, that is completely saturated or that contains one or more units of unsaturation, but which is not aromatic, that has a single point of attachment to the rest of the molecule. Suitable aliphatic groups include, but are not limited to, linear or branched, substituted or unsubstituted alkyl, alkenyl, alkynyl groups and hybrids thereof such as (cycloalkyl)alkyl, (cycloalkenyl)alkyl or (cycloalkyl)alkenyl.
[0296] As used herein, the term “alkenyl” refers to an alkyl group, as defined herein, having one or more double bonds. In some embodiments, the term “alkenyl”, used alone or as part of a larger moiety, refers to an optionally substituted straight or branched hydrocarbon chain having at least one double bond and having (unless otherwise specified) 2-20, 2-18, 2-16, 2-14, 2-12, 2-10, 2-8, 2-6, 2-4, or 2-3 carbon atoms (e.g., C2-20, C2-18, C2-16, C2-14, C2-12, C2-10, C2-8, C2-6, C2-4, or C2-3). Exemplary alkenyl groups include ethenyl, propenyl, butenyl, pentenyl, hexenyl, and heptenyl.
[0297] The term “alkenylene” refers to a bivalent alkenyl group. A substituted alkenylene chain is a polymethylene group containing at least one double bond in which one or more hydrogen atoms are replaced with a substituent. Suitable substituents include those described below for a substituted aliphatic group.
[0298] As used herein, the term “alkyl” is given its ordinary meaning in the art and may include saturated aliphatic groups, including straight-chain alkyl groups, branched-chain alkyl groups, cycloalkyl (alicyclic) groups, alkyl substituted cycloalkyl groups, and cycloalkyl substituted alkyl groups. In some embodiments, alkyl has 1-100 carbon atoms. In certain embodiments, a straight chain or branched chain alkyl has about 1-20 carbon atoms in its backbone (e.g., C1-C20 for straight chain, C2-C20 for branched chain), and alternatively, about 1-10. In some embodiments, a cycloalkyl ring has from about 3-10 carbon atoms in their ring structure where such rings are monocyclic or bicyclic, and alternatively about 5, 6 or 7 carbons in the ring ATTORNEY DOCKET NO. 180802-47402 / PCT
[0299] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0300] structure. In some embodiments, an alkyl group may be a lower alkyl group, wherein a lower alkyl group comprises 1-4 carbon atoms (e.g., C1-C4 for straight chain lower alkyls).
[0301] The term “alkylenyl” or “alkylene” refers to a bivalent alkyl group (i.e., a bivalent saturated hydrocarbon chain) that is a straight-chain (i.e., unbranched) or branched, substituted or unsubstituted. Any of the above mentioned monovalent alkyl groups may be an alkylenyl by abstraction of a second hydrogen atom from the alkyl. In some embodiments, an “alkylenyl” is a polymethylene group, i.e., -(CH2)n-, wherein n is a positive integer, preferably from 1 to 10, from 1 to 9, from 1 to 8, from 1 to 7, from 1 to 6, from 1 to 5, from 1 to 4, from 1 to 3, from 1 to 2, from 2 to 5, or from 4 to 8. A substituted alkylenyl is a polymethylene group in which one or more methylene hydrogen atoms are replaced with a substituent. Suitable substituents include those described below for a substituted aliphatic group.
[0302] As used herein, the term “alkynyl” refers to an alkyl group, as defined herein, having one or more triple bonds. In some embodiments, the term “alkynyl”, used alone or as part of a larger moiety, refers to an optionally substituted straight or branched chain hydrocarbon group having at least one triple bond and having (unless otherwise specified) 2-20, 2-18, 2-16, 2-14, 2-12, 2-10, 2-8, 2-6, 2-4, or 2-3 carbon atoms (e.g., C2-20, C2-18, C2-16, C2-14, C2-12, C2-10, C2-8, C2-6, C2-4, or C2-3). Exemplary alkynyl groups include ethynyl, propynyl, butynyl, pentynyl, hexynyl, and heptynyl.
[0303] By “analog” is meant a molecule that is not identical but has analogous functional or structural features. For example, a polypeptide analog retains the biological activity of a corresponding naturally-occurring polypeptide, while having certain biochemical modifications that enhance the analog’s function relative to a naturally occurring polypeptide. Such biochemical modifications could increase the analog’s protease resistance, membrane permeability, or half-life, without altering, for example, ligand binding. An analog may include an unnatural amino acid.
[0304] The term “aryl” refers to monocyclic and bicyclic ring systems having a total of six to fourteen ring members (e.g., C6-14), wherein at least one ring in the system is aromatic and wherein each ring in the system contains three to seven ring members. The term “aryl” may be used interchangeably with the term “aryl ring”. In some embodiments, “aryl” refers to an aromatic ring system which includes, but is not limited to, phenyl, naphthyl, anthracyl and the like, which may bear one or more substituents. Unless otherwise specified, “aryl” groups are hydrocarbons.
[0305] By “base editor (BE),” or “nucleobase editor polypeptide (NBE)” is meant an agent that binds a polynucleotide and has nucleobase modifying activity. In various embodiments, the base ATTORNEY DOCKET NO. 180802-47402 / PCT
[0306] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0307] editor comprises a nucleobase modifying polypeptide (e.g., a deaminase) and a polynucleotide programmable nucleotide binding domain (e.g., Cas9 or Cpfl). Representative nucleic acid and protein sequences of base editors include those sequences having about or at least about 85% sequence identity to any base editor sequence provided in the sequence listing, such as those corresponding to SEQ ID NOs: 2-11.
[0308] By “BE4 cytidine deaminase (BE4) polypeptide,” is meant a base editor comprising a nucleic acid programmable DNA binding protein (napDNAbp) domain, a cytidine deaminase domain, and two uracil glycosylase inhibitor domains (UGIs). In embodiments, the napDNAbp is a Cas9n (DIO A) polypeptide. Non-limiting examples of cytidine deaminase domains include rAPOBEC, ppAPOBEC, RrA3F, AmAPOBECl, and SsAPOBEC3B.
[0309] By “BE4 cytidine deaminase (BE4) polynucleotide,” is meant a polynucleotide encoding a BE4 polypeptide.
[0310] By “base editing activity” is meant acting to chemically alter a base within a polynucleotide. In one embodiment, a first base is converted to a second base. In one embodiment, the base editing activity is cytidine deaminase activity, e.g., converting target OG to T»A. In another embodiment, the base editing activity is adenosine or adenine deaminase activity, e.g., converting A»T to G»C.
[0311] By “base editing efficiency” is meant the total percent of one or more target bases in a sample that have been modified using a base editor. In some cases, the base editing efficiency is calculated as the total percent of target polynucleotides in a sample containing a modified target base. In some instances, the base editing efficiency is calculated as the total percent of target polynucleotides in a sample containing a modification to one or more (e.g., 2, 3, 4, 5, 6, 7, 8, 9 or 10) of 2, 3, 4, 5, 6, 7, 8, 9, or 10 target bases. Methods for measuring base editing efficiency for a base editor are known in the art (see, e.g., Gaudelli, etal. Nature 551:464-471 (2017), the disclosure of which is incorporated herein in its entirety for all purposes). In some cases a base editing efficiency is a median base editing efficiency calculated across 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, 25, 30, 35, 40, 45, 50, 60, 70, 80, 90, 100 or more target sites.
[0312] By “base editing window” for a base editor is meant bases within a target polynucleotide sequence that can be modified using the base editor. In some embodiments, the position of the nucleobases in the target polynucleotide sequence are numbered relative to a protospacer adjacent motif (PAM) for which a nucleic acid programmable DNA binding protein (napDNAbp) domain of the base editor has specificity, where base 1 corresponds to the base immediately adjacent to the PAM. In some embodiments, the position of the nucleobases in the target polynucleotide sequence are numbered relative to the 5' or 3' end of a spacer of a guide ATTORNEY DOCKET NO. 180802-47402 / PCT
[0313] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0314] polynucleotide used to guide a nucleic acid programmable DNA binding protein (napDNAbp) domain of the base editor to a target site, where base 1 corresponds to the 5' or 3' terminal base of the spacer.
[0315] The term “base editor system” refers to an intermolecular complex for editing a nucleobase of a target nucleotide sequence. In various embodiments, the base editor (BE) system comprises (1) a polynucleotide programmable nucleotide binding domain, a deaminase domain (e.g., cytidine deaminase or adenosine deaminase) for deaminating nucleobases in the target nucleotide sequence; and (2) one or more guide polynucleotides (e.g., guide RNA) in conjunction with the polynucleotide programmable nucleotide binding domain. In various embodiments, the base editor (BE) system comprises a nucleobase editor domain selected from an adenosine deaminase or a cytidine deaminase, and a domain having nucleic acid sequence specific binding activity. In some embodiments, the base editor system comprises (1) a base editor (BE) comprising a polynucleotide programmable DNA binding domain and a deaminase domain for deaminating one or more nucleobases in a target nucleotide sequence; and (2) one or more guide RNAs in conjunction with the polynucleotide programmable DNA binding domain. In some embodiments, the polynucleotide programmable nucleotide binding domain is a polynucleotide programmable DNA binding domain. In some embodiments, the base editor is a cytidine base editor (CBE). In some embodiments, the base editor is an adenine or adenosine base editor (ABE). In some embodiments, the base editor is an adenine or adenosine base editor (ABE) or a cytidine or cytosine base editor (CBE). In some embodiments, the base editor system (e.g., a base editor system comprising a cytidine deaminase) comprises a uracil glycosylase inhibitor or other agent or peptide (e.g., a uracil stabilizing protein such as provided in WO2022015969, the disclosure of which is incorporated herein by reference in its entirety for all purposes) that inhibits the inosine base excision repair system.
[0316] As used herein, the term “bivalent” refers to a chemical moiety with two points of attachment. For example, a “bivalent Ci-s (or Ci-e) saturated or unsaturated, straight or branched, hydrocarbon chain”, refers to bivalent alkylene, alkenylene, and alkynylene chains that are straight or branched as defined herein.
[0317] As used herein, the term “bridged bicyclic” refers to any bicyclic ring system, i.e. carbocyclic or heterocyclic, saturated or partially unsaturated, having at least one bridge. As defined by IUPAC, a “bridge” is an unbranched chain of atoms or an atom or a valence bond connecting two bridgeheads, where a “bridgehead” is any skeletal atom of the ring system which is bonded to three or more skeletal atoms (excluding hydrogen). In some embodiments, a bridged bicyclic group has 7-12 ring members and 0-4 heteroatoms independently selected from ATTORNEY DOCKET NO. 180802-47402 / PCT
[0318] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0319] nitrogen, oxygen, or sulfur. Such bridged bicyclic groups are well known in the art and include those groups set forth below where each group is attached to the rest of the molecule at any substitutable carbon or nitrogen atom. Unless otherwise specified, a bridged bicyclic group is optionally substituted with one or more substituents as set forth for aliphatic groups.
[0320] Additionally or alternatively, any substitutable nitrogen of a bridged bicyclic group is optionally substituted. Exemplary bridged bicyclics include but are not limited to:
[0321]
[0322] The terms “carbocyclyl,” “carbocycle,” and “carbocyclic ring” as used herein, refer to saturated or partially unsaturated cyclic aliphatic monocyclic, bicyclic, or polycyclic ring systems, as described herein, having from 3 to 14 members, wherein the aliphatic ring system is optionally substituted as described herein. Carbocyclic groups include, without limitation, cyclopropyl, cyclobutyl, cyclopentyl, cyclopentenyl, cyclohexyl, cyclohexenyl, cycloheptyl, cycloheptenyl, cyclooctyl, cyclooctenyl, norbornyl, adamantyl, and cyclooctadienyl. In some embodiments, “carbocyclyl” (or “cycloaliphatic”) refers to an optionally substituted monocyclic Cs-Cs hydrocarbon, or an optionally substituted C6-C12 bicyclic hydrocarbon that is completely saturated or that contains one or more units of unsaturation, but which is not aromatic, that has a single point of attachment to the rest of the molecule. The term “cycloalkyl” refers to an optionally substituted saturated ring system of about 3 to about 10 ring carbon atoms. In some embodiments, cycloalkyl groups have 3-6 carbons. Exemplary monocyclic cycloalkyl rings include cyclopropyl, cyclobutyl, cyclopentyl, cyclohexyl, and cycloheptyl. The term “cycloalkenyl” refers to an optionally substituted non-aromatic monocyclic or multicyclic ring ATTORNEY DOCKET NO. 180802-47402 / PCT
[0323] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0324] system containing at least one carbon-carbon double bond and having about 3 to about 10 carbon atoms. Exemplary monocyclic cycloalkenyl rings include cyclopentenyl, cyclohexenyl, and cycloheptenyl.
[0325] The term “Cas9” or “Cas9 domain” refers to an RNA guided nuclease comprising a Cas9 protein, or a fragment thereof (e.g., a protein comprising an active, inactive, or partially active DNA cleavage domain of Cas9, and / or the gRNA binding domain of Cas9). A Cas9 nuclease is also referred to sometimes as a casnl nuclease or a CRISPR (clustered regularly interspaced short palindromic repeat) associated nuclease.
[0326] The term “conservative amino acid substitution” or “conservative mutation” refers to the replacement of one amino acid by another amino acid with a common property. A functional way to define common properties between individual amino acids is to analyze the normalized frequencies of amino acid changes between corresponding proteins of homologous organisms (Schulz, G. E. and Schirmer, R. EL, Principles of Protein Structure, Springer-Verlag, New York (1979)). According to such analyses, groups of amino acids can be defined where amino acids within a group exchange preferentially with each other, and therefore resemble each other most in their impact on the overall protein structure (Schulz, G. E. and Schirmer, R. EL, supra). Nonlimiting examples of conservative mutations include amino acid substitutions of amino acids, for example, lysine for arginine and vice versa such that a positive charge can be maintained; glutamic acid for aspartic acid and vice versa such that a negative charge can be maintained; serine for threonine such that a free -OH can be maintained; and glutamine for asparagine such that a free -NH2 can be maintained.
[0327] Amino acids generally can be grouped into classes according to the following common side- chain properties:
[0328] (1) hydrophobic: Norleucine, Met, Ala, Vai, Leu, He;
[0329] (2) neutral hydrophilic: Cys, Ser, Thr, Asn, Gin;
[0330] (3) acidic: Asp, Glu;
[0331] (4) basic: His, Lys, Arg;
[0332] (5) residues that influence chain orientation: Gly, Pro;
[0333] (6) aromatic: Trp, Tyr, Phe.
[0334] In some embodiments, conservative substitutions can involve the exchange of a member of one of these classes for another member of the same class. In some embodiments, nonconservative amino acid substitutions can involve exchanging a member of one of these classes for another class. ATTORNEY DOCKET NO. 180802-47402 / PCT
[0335] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0336] The term “coding sequence” or “protein coding sequence” as used interchangeably herein refers to a segment of a polynucleotide that codes for a protein. Coding sequences can also be referred to as open reading frames. The region or sequence is bounded nearer the 5' end by a start codon and nearer the 3' end with a stop codon. Stop codons useful with the base editors described herein include the following: TAG, TAA, and TGA.
[0337] By “complex” is meant a combination of two or more molecules whose interaction relies on inter-molecular forces. Non-limiting examples of inter-molecular forces include covalent and non-covalent interactions. Non-limiting examples of non-covalent interactions include hydrogen bonding, ionic bonding, halogen bonding, hydrophobic bonding, van der Waals interactions (e.g., dipole-dipole interactions, dipole-induced dipole interactions, and London dispersion forces), and 7t-effects. In an embodiment, a complex comprises polypeptides, polynucleotides, or a combination of one or more polypeptides and one or more polynucleotides. In one embodiment, a complex comprises one or more polypeptides that associate to form a base editor (e.g., base editor comprising a nucleic acid programmable DNA binding protein, such as Cas9, and a deaminase) and a polynucleotide (e.g., a guide RNA). In an embodiment, the complex is held together by hydrogen bonds. It should be appreciated that one or more components of a base editor (e.g., a deaminase, or a nucleic acid programmable DNA binding protein) may associate covalently or non-covalently. As one example, a base editor may include a deaminase covalently linked to a nucleic acid programmable DNA binding protein (e.g., by a peptide bond).
[0338] Alternatively, a base editor may include a deaminase and a nucleic acid programmable DNA binding protein that associate noncovalently (e.g., where one or more components of the base editor are supplied in trans and associate directly or via another molecule such as a protein or nucleic acid). In an embodiment, one or more components of the complex are held together by hydrogen bonds.
[0339] By “cytosine” or “4-Aminopyrimidin-2(U7)-one” is meant a purine nucleobase with the o
[0340] N^NH
[0341] molecular formula C4H5N3O, having the structure
[0342]
[0343] 2,anc[ corresponding to CAS No. 71-30-7. ATTORNEY DOCKET NO. 180802-47402 / PCT
[0344] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0345] By “cytidine” is meant a cytosine molecule attached to a ribose sugar via a glycosidic
[0346]
[0347] bond, having the structure, and corresponding to CAS No. 65-46-3. Its molecular formula is C9H13N3O5.
[0348] By “Cytidine Base Editor (CBE)” is meant a base editor comprising a cytidine deaminase. Non-limiting examples of cytidine deaminase base editor amino acid sequences include amino acid sequences for BE4max (SEQ ID NO: 553), YE1-BE4 (SEQ ID NO: 554), YE2-BE4 (SEQ ID NO: 555), YEE-BE4 (SEQ ID NO: 556), EE-BE4 (SEQ ID NO: 557), R33A-BE4 (SEQ ID NO: 558), R33A+K34A-BE4 (SEQ ID NO: 559), APOBEC3A (A3A)-BE4 (SEQ ID NO: 560), APOBEC3B (A3B)-BE4 (SEQ ID NO: 561), APOBEC3G (A3G)-BE4 (SEQ ID NO: 562), AID-BE4 (SEQ ID NO: 563), CDA-BE4 (SEQ ID NO: 564), FERNY-BE4 (SEQ ID NO: 565), evolved APOBEC3A (eA3A)-BE4 (SEQ ID NO: 566), AALN-BE4 (SEQ ID NO: 567), BE4max modified with SpCas9-NG (SEQ ID NO: 568), YEl-SpCas9-NG (YE1-NG) (SEQ ID NO: 569), YE2-SpCas9-NG (SEQ ID NO: 570), YEE-SpCas9-NG (SEQ ID NO: 571), EE-SpCas9-NG (SEQ ID NO: 572), R33A+K34A-SpCas9-NG (SEQ ID NO: 573), YE1-CP1028 (YE1-BE4-CP1028, or YE1-CP) (SEQ ID NO: 574), YE2-CP1028 (YE2-BE4-CP1028) (SEQ ID NO: 575), YEE-CP1028 (YEE-BE4-CP1028) (SEQ ID NO: 576), EE-CP1028 (EE-BE4-CP1028) (SEQ ID NO: 577), R33A+K34A-CP1028 (R33A+K34A-BE4-CP1028) (SEQ ID NO: 578), BE4max (with nickase) (SEQ ID NO: 597), BE4 (SEQ ID NO: 598), BE4 with His tag (SEQ ID NO: 599), BE4max (SEQ ID NO: 600), AncBE4max 689 (SEQ ID NO: 601), and AncBE4max 687 (SEQ ID NO: 602). Exemplary CBE sequences are also provided at SEQ ID NOs: 774 to 782, and 792.
[0349] By “Cytidine Base Editor (CBE) polynucleotide” is meant a polynucleotide encoding a CBE. Non-limiting examples of polynucleotide sequences encoding cytidine deaminase base editors include those encoding BE4max (SEQ ID NO: 616), AncBE4max689 (SEQ ID NO: 617), and AncBE4max687 (SEQ ID NO: 618).
[0350] By “cytidine deaminase” or “cytosine deaminase” is meant a polypeptide or fragment thereof capable of deaminating cytidine or cytosine. In embodiments, the cytidine or cytosine is present in a polynucleotide. In one embodiment, the cytidine deaminase converts cytosine to ATTORNEY DOCKET NO. 180802-47402 / PCT
[0351] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0352] uracil or 5-methylcytosine to thymine. In some embodiments, a cytidine deaminase is 6b, RrA3f, or T1.52. The terms “cytidine deaminase” and “cytosine deaminase” are used interchangeably throughout the application. Petromyzon marinus cytosine deaminase 1 (PmCDAl) (SEQ ID NO: 12-13), Activation-induced cytidine deaminase (AICDA) (SEQ ID NOs: 14-20), and APOBEC (SEQ ID NOs: 21-61) are exemplary cytidine deaminases. Further exemplary cytidine deaminase (CD A) sequences are provided in the Sequence Listing as SEQ ID NOs: 62-66 and SEQ ID NOs: 67-189. Non-limiting examples of cytidine deaminases include those described in PCT / US20 / 16288, PCT / US2018 / 021878, 180802-021804 / PCT, PCT / US2018 / 048969, PCT / US2016 / 058344, PCT / US2020 / 062428, and PCT / US2019 / 033848, the disclosures of which are incorporated herein by reference in their entireties for all purposes. Non-limiting examples of cytidine deaminase amino acid sequences include amino acid sequences for Rat APOBEC 1 (SEQ ID NO: 579), Human APOBEC1 (SEQ ID NO: 580), Human APOBEC3 (SEQ ID NO: 581), Human APOBEC3B (SEQ ID NO: 582), Human APOBEC3G (SEQ ID NO: 583), evoAPOBEC3A(eA3A) (SEQ ID NO: 584), evoCDA (SEQ ID NO: 585), evoAPOBECl (SEQ ID NO: 586), YE1 (SEQ ID NO: 587), YE2 (SEQ ID NO: 588), YEE (SEQ ID NO: 589), EE (SEQ ID NO: 590), R33A (SEQ ID NO: 591), R33A+K34A (SEQ ID NO: 592), AALN (SEQ ID NO: 593), FERNY (SEQ ID NO: 594), evoFERNY (SEQ ID NO: 595), APOBEC (SEQ ID NO: 619), Anc686 APOBEC (SEQ ID NO: 620), Human APOBEC-3GD316R D317R (SEQ ID NO: 621), Human APOBEC-3G chain A (SEQ ID NO: 622), Human APOBEC3-G chain A D120R D121R (SEQ ID NO: 623), Mouse APOBEC3 (SEQ ID NO: 624), Rat APOBEC3 (SEQ ID NO: 625), Rhesus macaque APOBEC-3G (SEQ ID NO: 626), Chimpanzee APOBEC-3G (SEQ ID NO: 627), Green Monkey APOBEC-3G (SEQ ID NO: 628), Human APOBEC-3G (SEQ ID NO: 629), Human APOBEC-3F (SEQ ID NO: 630), Human APOBEC-3B (SEQ ID NO: 631), Rat APOBEC-3B (SEQ ID NO: 632), Bovine APOBEC-3B (SEQ ID NO: 633), Chimpanzee APOBEC-3B (SEQ ID NO: 634), Gorilla APOBEC-3C (SEQ ID NO: 635), Human APOBEC-3A (SEQ ID NO: 636), Rhesus macaque APOBEC-3A (SEQ ID NO: 637), Bovine APOBEC-3A (SEQ ID NO: 638), Human APOBEC-3H (SEQ ID NO: 639), Human APOBEC-3D (SEQ ID NO: 640), Rat ABOPEC1 (SEQ ID NO: 641), Anc689 APOBEC (SEQ ID NO: 642), Anc687 APOBEC (SEQ ID NO: 643), Anc686 APOBEC (SEQ ID NO: 644), Anc655 APOBEC (SEQ ID NO: 645), and Anc733 APOBEC (SEQ ID NO: 646).
[0353] By “cytidine deaminase polynucleotide” is meant a polynucleotide encoding a cytidine deaminase. Non-limiting examples of polynucleotide sequences encoding cytidine deaminase domains include those encoding Rat APOBEC 1 (SEQ ID NO: 604), Anc689 APOBEC (SEQ ID NO: 605), Anc687 APOBEC (SEQ ID NO: 606), Anc686 APOBEC (SEQ ID NO: 607), Anc655 ATTORNEY DOCKET NO. 180802-47402 / PCT
[0354] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0355] APOBEC (SEQ ID NO: 608), Anc733 APOBEC (SEQ ID NO: 609), Rat APOBEC1 (SEQ ID NO: 610), Anc689 APOBEC (SEQ ID NO: 611), Anc687 APOBEC (SEQ ID NO: 612), Anc686 APOBEC (SEQ ID NO: 613), Anc655 APOBEC (SEQ ID NO: 614), and Anc733 APOBEC (SEQ ID NO: 615).
[0356] By “cytosine deaminase activity” is meant catalyzing the deamination of cytosine or cytidine. In one embodiment, a polypeptide having cytosine deaminase activity converts an amino group to a carbonyl group. In an embodiment, a cytosine deaminase converts cytosine to uracil ( / .<., C to U) or 5-methylcytosine to thymine ( / .<., 5mC to T). In some embodiments, a cytosine deaminase as provided herein has increased cytosine deaminase activity (e.g., at least 10-fold, 20-fold, 30-fold, 40-fold, 50-fold, 60-fold, 70-fold, 80-fold, 90-fold, 100-fold or more) relative to a reference cytosine deaminase.
[0357] The term “deaminase” or “deaminase domain,” as used herein, refers to a protein or fragment thereof that catalyzes a deamination reaction.
[0358] The term “detect” refers to identifying the presence, absence or amount of the analyte to be detected. In one embodiment, a sequence alteration in a polynucleotide or polypeptide is detected. In another embodiment, the presence of indels is detected. In one embodiment, galactose- 1 -phosphate uridylyltransferase (GALT) activity is detected. In another embodiment, levels of galactose and / or galactose metabolites (e.g., galactitol or galactose- 1-P) in a biological sample (e.g., blood, serum, plasma, tissue) are detected.
[0359] By “disease” is meant any condition or disorder that damages or interferes with the normal function of a cell, tissue, or organ. Exemplary diseases include galactosemia.
[0360] By “effective amount” is meant the amount of an agent (e.g., a base editor, cell) as described herein, that is required to ameliorate the symptoms of a disease relative to an untreated patient or an individual without disease, i.e., a healthy individual, or is the amount of the agent sufficient to elicit a desired biological response. The effective amount of active compound(s) used to practice embodiments of the present disclosure for therapeutic treatment of a disease varies depending upon the manner of administration, the age, body weight, and general health of the subject. Ultimately, the attending physician or veterinarian will decide the appropriate amount and dosage regimen. Such amount is referred to as an “effective” amount. In one embodiment, an effective amount is the amount of a base editor of the disclosure sufficient to introduce an alteration in a gene of interest in a cell (e.g., a cell in vitro or in vivo). In one embodiment, an effective amount is the amount of a base editor required to achieve a therapeutic effect. Such therapeutic effect need not be sufficient to alter a pathogenic gene in all cells of a subject, tissue or organ, but only to alter the pathogenic gene in about 1%, 5%, 10%, 25%, 50%, ATTORNEY DOCKET NO. 180802-47402 / PCT
[0361] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0362] 75% or more of the cells present in a subject, tissue or organ. In one embodiment, an effective amount is sufficient to ameliorate one or more symptoms of a disease. For example, an effective amount of a base editor described herein is delivered to a tissue of a patient where it increases the level of expression of functional galactose- 1 -phosphate uridylyltransferase (GALT), thereby reducing the accumulation of galactose or galactose metabolites (e.g., galactitol or galactose- 1-P) in the serum of the patient. In some cases, an effective amount of a base editor described herein is delivered to a tissue of a patient where it increases the level of expression galactose- 1-phosphate uridylyltransferase (GALT) activity, thereby reducing the accumulation of galactose and / or galactose metabolites (e.g., galactitol or galactose- 1-P) in the serum of the patient.
[0363] The term “encapsulated” is used herein to refer to substances that are completely surrounded by another material. In an embodiment, an mRNA molecule encoding a base editor and / or a guide polynucleotide is encapsulated by a lipid nanoparticle of the disclosure.
[0364] By “fragment” is meant a portion of a polypeptide or nucleic acid molecule. This portion contains, at least about 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, or 90% of the entire length of the reference nucleic acid molecule or polypeptide. A fragment may contain 10, 20, 30, 40, 50, 60, 70, 80, 90, or 100, 200, 300, 400, 500, 600, 700, 800, 900, or 1000 nucleotides or amino acids. In some embodiments, the fragment is a functional fragment.
[0365] By “glycome” is meant the complete set of carbohydrates contained in an organism, cell, tissue, or organ).
[0366] By “guide polynucleotide” is meant a polynucleotide or polynucleotide complex which is specific for a target sequence and can form a complex with a polynucleotide programmable nucleotide binding domain protein (e.g., Cas9 or Cpfl). In an embodiment, the guide polynucleotide is a guide RNA (gRNA). gRNAs can exist as a complex of two or more RNAs, or as a single RNA molecule.
[0367] By “guide polynucleotide” is meant a polynucleotide or polynucleotide complex which is specific for a target sequence and can form a complex with a polynucleotide programmable nucleotide binding domain protein (e.g., Cas9 or Cpfl). In an embodiment, the guide polynucleotide is a guide RNA (gRNA). gRNAs can exist as a complex of two or more RNAs, or as a single RNA molecule.
[0368] The term “haloaliphatic” refers to an aliphatic group substituted by one or more halogen atoms (e.g., one, two, three, four, five, six, or seven halo, such as fluoro, iodo, bromo, or chloro). In some embodiments, haloaliphatic groups contain 1-7 halogen atoms. In some embodiments, haloaliphatic groups contain 1-5 halogen atoms. In some embodiments, haloaliphatic groups contain 1-3 halogen atoms. ATTORNEY DOCKET NO. 180802-47402 / PCT
[0369] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0370] The term “haloalkyl” refers to an alkyl group substituted by one or more halogen atoms (e.g., one, two, three, four, five, six, or seven halo, such as fluoro, iodo, bromo, or chloro). In some embodiments, haloalkyl groups contain 1-7 halogen atoms. In some embodiments, haloalkyl groups contain 1-5 halogen atoms. In some embodiments, haloalkyl groups contain 1-3 halogen atoms.
[0371] The term “heteroalkylenyl” or “heteroalkylene”, as used herein, denotes an optionally substituted straight-chain ( / .<?., unbranched), or branched bivalent alkyl group (i.e., bivalent saturated hydrocarbon chain) having, in addition to carbon atoms, from one to five heteroatoms. The term “heteroatom” is described below. In some embodiments, heteroalkylenyl groups contain 2-10 carbon atoms wherein 1-3 carbon atoms are optionally and independently replaced with heteroatoms selected from oxygen, nitrogen, and sulfur. In some embodiments, heteroalkylenyl groups contain 2-8 carbon atoms wherein 1-3 carbon atoms are optionally and independently replaced with heteroatoms selected from oxygen, nitrogen, and sulfur. In some embodiments, heteroalkylenyl groups contain 4-8 carbon atoms, wherein 1-3 carbon atoms are optionally and independently replaced with heteroatoms selected from oxygen, nitrogen, and sulfur. In some embodiments, heteroalkylenyl groups contain 2-5 carbon atoms, wherein 1-2 carbon atoms are optionally and independently replaced with heteroatoms selected from oxygen, nitrogen, and sulfur. In yet other embodiments, heteroalkylenyl groups contain 1-3 carbon atoms, wherein 1 carbon atom is optionally and independently replaced with a heteroatom selected from oxygen, nitrogen, and sulfur. Suitable heteroalkylenyl groups include, but are not limited to -CH2O-, -(CH2)2O-, -CH2OCH2-, -O(CH2)2-, -(CH2)3O-, -(CH2)2OCH2-, -CH2O(CH2)2-, -O(CH2)3-, -(CH2)4O-, -(CH2)3OCH2-, -CH2O(CH2)3-, -(CH2)2O(CH2)2-, -O(CH2)4-. Unless otherwise specified, Cx heteroalkylenyl refers to heteroalkylenyl having x number of carbon atoms prior to replacement with heteroatoms.
[0372] The terms “heteroaryl” and “heteroar-”, used alone or as part of a larger moiety, e.g., “heteroaralkyl”, or “heteroaralkoxy”, refer to monocyclic or bicyclic ring groups having 5 to 10 ring atoms (e.g., 5- to 6-membered monocyclic heteroaryl or 9- to 10-membered bicyclic heteroaryl); having 6, 10, or 147t electrons shared in a cyclic array; and having, in addition to carbon atoms, from one to five heteroatoms. Exemplary heteroaryl groups include, without limitation, thienyl, furanyl, pyrrolyl, imidazolyl, pyrazolyl, triazolyl, tetrazolyl, oxazolyl, isoxazolyl, oxadiazolyl, thiazolyl, isothiazolyl, thiadiazolyl, pyridyl, pyridonyl, pyridazinyl, pyrimidinyl, pyrazinyl, indolizinyl, purinyl, naphthyridinyl, pteridinyl, imidazo[l,2-a]pyrimidinyl, imidazo[l,2-a]pyridinyl, thienopyrimidinyl, triazol opyridinyl, and benzoisoxazolyl. The terms “heteroaryl” and “heteroar-”, as used herein, also include groups in ATTORNEY DOCKET NO. 180802-47402 / PCT
[0373] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0374] which a heteroaromatic ring is fused to one or more aryl, cycloaliphatic, or heterocyclyl rings, where the radical or point of attachment is on the heteroaromatic ring (i.e., a bicyclic heteroaryl ring having 1 to 3 heteroatoms). Nonlimiting examples include indolyl, isoindolyl, benzothienyl, benzofuranyl, dibenzofuranyl, indazolyl, benzimidazolyl, benzothiazolyl, benzothiadiazolyl, benzoxazolyl, quinolyl, isoquinolyl, cinnolinyl, phthalazinyl, quinazolinyl, quinoxalinyl, H quinolizinyl, carbazolyl, acridinyl, phenazinyl, phenothiazinyl, phenoxazinyl, tetrahydroquinolinyl, tetrahydroisoquinolinyl, pyrido[2,3-b]-l,4-oxazin-3(4H)-one, and benzoisoxazolyl. The term “heteroaryl” may be used interchangeably with the terms “heteroaryl ring”, “heteroaryl group”, or “heteroaromatic”, any of which terms include rings that are optionally substituted.
[0375] The term “heteroatom” means one or more of oxygen, sulfur, nitrogen, phosphorus, or silicon (including, any oxidized form of nitrogen, sulfur, phosphorus, or silicon; the quatemized form of any basic nitrogen or; a substitutable nitrogen of a heterocyclic ring, for example N (as in 3,4-dihydro-2H-pyrrolyl), NH (as in pyrrolidinyl) or NR+(as in N-substituted pyrrolidinyl)).
[0376] The terms “heterocycle”, “heterocyclyl”, “heterocyclic radical”, and “heterocyclic ring” are used interchangeably herein, and refer to a stable 3- to 8-membered monocyclic, a 7- to 12-membered bicyclic, or a 10- to 16-membered polycyclic heterocyclic moiety that is either saturated or partially unsaturated, and having, in addition to carbon atoms, one or more, such as one to four, heteroatoms, as defined above. When used in reference to a ring atom of a heterocycle, the term “nitrogen” includes a substituted nitrogen. As an example, in a saturated or partially unsaturated ring having 0-3 heteroatoms selected from oxygen, sulfur or nitrogen, the nitrogen may be N (as in 3,4-dihydro-2H-pyrrolyl), NH (as in pyrrolidinyl), or NR+(as in N-substituted pyrrolidinyl). A heterocyclic ring can be attached to its pendant group at any heteroatom or carbon atom that results in a stable structure and any of the ring atoms can be optionally substituted. Examples of such saturated or partially unsaturated heterocyclic radicals include, without limitation, azetidinyl, oxetanyl, tetrahydrofuranyl, tetrahydrothienyl, pyrrolidinyl, piperidinyl, decahydroquinolinyl, oxazolidinyl, piperazinyl, tetrahydropyranyl, dioxanyl, dioxolanyl, diazepinyl, oxazepinyl, thiazepinyl, morpholinyl, thiamorpholinyl, and
[0377] . A heterocyclyl group may be mono-, bi-, tri-, or polycyclic, preferably mono-, bi-, or tricyclic, more preferably mono- or bicyclic. The term “heterocyclylalkyl” refers to an alkyl group substituted by a heterocyclyl, wherein the alkyl and heterocyclyl portions independently are optionally substituted. A bicyclic heterocyclic ring also includes groups in which the heterocyclic ring is fused to one or more aryl, heteroaryl, or cycloaliphatic rings. Exemplary ATTORNEY DOCKET NO. 180802-47402 / PCT
[0378] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0379] bicyclic heterocyclic groups include indolinyl, isoindolinyl, benzodi oxolyl, 1,3-dihydroisobenzofuranyl, 2,3-dihydrobenzofuranyl, and tetrahydroquinolinyl. A bicyclic heterocyclic ring can also be a spirocyclic ring system (e.g., 7- to 11-membered spirocyclic fused heterocyclic ring having, in addition to carbon atoms, one or more heteroatoms as defined above (e.g., one, two, three or four heteroatoms)). A bicyclic heterocyclic ring can also be a bridged ring system (e.g., 7- to 11-membered bridged heterocyclic ring having one, two, or three bridging atoms.
[0380] The term “hairpin” as used herein describes a duplex of nucleic acids that is created when a nucleic acid strand folds and forms base pairs with another section of the same strand. A hairpin may form a structure that comprises a loop or a U-shape. In some embodiments, a hairpin may be comprised of an RNA loop. Hairpins can be formed with two complementary sequences in a single nucleic acid molecule bind together, with a folding or wrinkling of the molecule. In some embodiments, hairpins comprise stem or stem loop structures. In the context of modified gRNAs described herein, a “hairpin region” refers to hairpin 1 and hairpin 2 from the 5' end to the 3’ end of the gRNA. A conserved portion of a gRNA locates between hairpin 1 and hairpin 2 of the gRNA.
[0381] “Hybridization” means hydrogen bonding, which may be Watson-Crick, Hoogsteen or reversed Hoogsteen hydrogen bonding, between complementary nucleobases. For example, adenine and thymine are complementary nucleobases that pair through the formation of hydrogen bonds.
[0382] By “increase” is meant a positive alteration of at least 10%, 25%, 50%, 75%, or 100%, or about 1.5 fold, about 2 fold, about 3-fold, about 4-fold, about 5-fold, about 6-fold, about 7-fold, about 8-fold, about 9-fold, about 10-fold, about 15-fold, about 20-fold, about 25-fold, about 30-fold, about 35-fold, about 40-fold, about 45-fold, about 50-fold, or about 100-fold. In embodiments, expression of a base editor described herein increases expression of functional galactose- 1 -phosphate uridylyltransferase (GALT) in a tissue of a subject. In embodiments, expression of a base editor described herein increases galactose- 1 -phosphate uridylyltransferase (GALT) activity in a tissue of a subject (e.g., in the liver).
[0383] The terms “inhibitor of base repair”, “base repair inhibitor”, “IBR” or their grammatical equivalents refer to a protein that is capable in inhibiting the activity of a nucleic acid repair enzyme, for example a base excision repair enzyme.
[0384] The terms “isolated,” “purified,” or “biologically pure” refer to material that is free to varying degrees from components which normally accompany it as found in its native state. “Isolate” denotes a degree of separation from original source or surroundings. “Purify” denotes a ATTORNEY DOCKET NO. 180802-47402 / PCT
[0385] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0386] degree of separation that is higher than isolation. A “purified” or “biologically pure” protein is sufficiently free of other materials such that any impurities do not materially affect the biological properties of the protein or cause other adverse consequences. That is, a nucleic acid or peptide of this disclosure is purified if it is substantially free of cellular material, viral material, or culture medium when produced by recombinant DNA techniques, or chemical precursors or other chemicals when chemically synthesized. Purity and homogeneity are typically determined using analytical chemistry techniques, for example, polyacrylamide gel electrophoresis or high performance liquid chromatography. The term “purified” can denote that a nucleic acid or protein gives rise to essentially one band in an electrophoretic gel. For a protein that can be subjected to modifications, for example, phosphorylation or glycosylation, different modifications may give rise to different isolated proteins, which can be separately purified.
[0387] By “isolated polynucleotide” is meant a nucleic acid molecule that is free of the genes which, in the naturally-occurring genome of the organism from which the nucleic acid molecule of the disclosure is derived, flank the gene. The term therefore includes, for example, a recombinant DNA that is incorporated into a vector; into an autonomously replicating plasmid or virus; or into the genomic DNA of a prokaryote or eukaryote; or that exists as a separate molecule (for example, a cDNA or a genomic or cDNA fragment produced by PCR or restriction endonuclease digestion) independent of other sequences. In addition, the term includes an RNA molecule that is transcribed from a DNA molecule, as well as a recombinant DNA that is part of a hybrid gene encoding additional polypeptide sequence.
[0388] By an “isolated polypeptide” is meant a polypeptide of the disclosure that has been separated from components that naturally accompany it. Typically, the polypeptide is isolated when it is at least 60%, by weight, free from the proteins and naturally-occurring organic molecules with which it is naturally associated. In embodiments, the preparation is at least 75%, at least 90%, or at least 99%, by weight, a polypeptide of the disclosure. An isolated polypeptide of the disclosure may be obtained, for example, by extraction from a natural source, by expression of a recombinant nucleic acid encoding such a polypeptide; or by chemically synthesizing the protein. Purity can be measured by any appropriate method, for example, column chromatography, polyacrylamide gel electrophoresis, or by HPLC analysis.
[0389] The term “linker”, as used herein, refers to a molecule that links two moieties. In one embodiment, the term “linker” refers to a covalent linker (e.g., covalent bond) or a non-covalent linker. ATTORNEY DOCKET NO. 180802-47402 / PCT
[0390] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0391] As used herein, a “lipid nanoparticle (LNP) composition” is a nanoparticle composition comprising one or more ionizable lipids. Exemplary LNP compositions include, but are not limited to, phospholipids, conjugate-linker lipids (e.g., PEG-lipids), sterols, and ionizable lipids.
[0392] The term “loop” as used herein describes a region of nucleotides that do not base pair (i.e., are not complementary) that may cap a stem. For example, a “tetraloop” describes a loop of 4 nucleotides. In some embodiments, the upper stem of a modified gRNA may comprise a tetraloop. In some embodiments, GAAA is the GNR. A tetraloop that is used to lock the structure of the hairpin. In some embodiments, the tetraloop is ANYA, CUYG, GNRA, UNAC, or UNCG.
[0393] By “marker” is meant any protein or polynucleotide having an alteration in expression, level, structure, or activity that is associated with a disease or disorder. In an embodiment, the level of galactose or galactose metabolites (e.g., galactitol or galactose- 1-P) in a sample (e.g., blood, serum, plasma, tissue) is a marker of galactosemia. In some embodiments, an aberrant glycome relative to a healthy subject or cell is a marker of galactosemia.
[0394] The term “mutation,” as used herein, refers to a substitution of a residue within a sequence, e.g., a nucleic acid or amino acid sequence, with another residue, or a deletion or insertion of one or more residues within a sequence. Mutations are typically described herein by identifying the original residue followed by the position of the residue within the sequence and by the identity of the newly substituted residue. Various methods for making the amino acid substitutions (mutations) provided herein are well known in the art, and are provided by, for example, Green and Sambrook, Molecular Cloning: A Laboratory Manual (4thed., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N. Y. (2012)).
[0395] As used herein, “N: P ratio” is the molar ratio of ionizable nitrogen atoms in a lipid(s) to phosphate groups in a nucleic acid molecular entity(ies). In some embodiments, the lipids are lipids of a lipid nanoparticle and the nucleic acid molecular entity(ies) is a payload of the lipid nanoparticle. In various embodiments the N: P ratio is about or at least about 2:1, 3:1, 4:1, 5:1, 6:1, 7:1, 8:1, 9:1, or 10:1. In various embodiments the N: P ratio less than about 2:1, 3:1, 4:1, 5:1, 6:1, 7:1, 8:1, 9:1, or 10:1. In some embodiments, the nitrogen atoms are ionizable within a physiological pH range. In some embodiments, the molecular entities in a nanoparticle composition include a lipid component and an RNA. Ionizable nitrogen atoms can include, for example, nitrogen atoms that can be protonated at about pH 1, about pH 2, about pH 3, about pH 4, about pH 4.5, about pH 5, about pH 5.5, about pH 6, about pH 6.5, about pH 7, about pH 7.5, or about pH 8 or higher. The physiological pH range can include, for example, the pH range of different cellular compartments (such as organs, tissues, and cells) and bodily fluids (such as blood, CSF, gastric juice, milk, bile, saliva, tears, and urine). In certain specific embodiments, ATTORNEY DOCKET NO. 180802-47402 / PCT
[0396] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0397] the physiological pH range refers to the pH range of blood in a mammal, for example, from about 7.35 to about 7.45. Similarly, for phosphate charge neutralizers that have one or more ionizable nitrogen atoms, the N / P ratio can refer to a molar ratio of ionizable nitrogen atoms in the phosphate charge neutralizer to the phosphate groups in a nucleic acid. In some embodiments, ionizable nitrogen atoms refer to those nitrogen atoms that are ionizable within a pH range between 5 and 14.
[0398] For a payload that does not contain a phosphate group, the N: P ratio can refer to a molar ratio of ionizable nitrogen atoms in a lipid to the total negative charge in the payload. For example, the N: P ratio of an LNP composition can refer to a molar ratio of the total ionizable nitrogen atoms in the LNP composition to the total negative charge in the payload that is present in the composition.
[0399] As used herein, the term “nanoparticle” refers to a particle having a diameter of less than 1000 nanometers (nm). In some embodiments, a nanoparticle has a diameter of less than 300 nm, as defined by the National Science Foundation. In some embodiments, a nanoparticle has a diameter of less than 100 nm as defined by the National Institutes of Health. In some embodiments, nanoparticles are micelles in that they comprise an enclosed compartment, separated from the bulk solution by a micellar membrane, typically comprised of amphiphilic entities which surround and enclose a space or compartment (e.g., to define a lumen). In some embodiments, a micellar membrane is comprised of at least one polymer, such as for example a biocompatible and / or biodegradable polymer. In some embodiments, lipid nanoparticles described herein can have an average hydrodynamic diameter from about 30 to about 170 nm. In some embodiments, lipid nanoparticles described herein can have an average hydrodynamic diameter that is about 30 nm, 35 nm, 40 nm, 45 nm, 50 nm, 55 nm, 60 nm, 65 nm, 70 nm, 75 nm, 80 nm, 85 nm, 90 nm, 95 nm, 100 nm, 105 nm, 110 nm, 115 nm, 120 nm, 125 nm, 130 nm, 135 nm, 140 nm, 145 nm, 150 nm, 155 nm, 160 nm, 165 nm, 170 nm, 170 nm, 175 nm, 180 nm, 185 nm, 190 nm, 195 nm, 200 nm, 205 nm, 210 nm, 215 nm, 220 nm, or any range having endpoints defined by any two of the aforementioned values. For example, in some embodiments, lipid nanoparticles described herein have an average hydrodynamic diameter from between 50 nm to 100 nm. For example, in some embodiments, compositions, preparations, nanoparticles, and / or nanomaterials described herein have an average hydrodynamic diameter from between 50 nm to 90 nm. In some embodiments, lipid nanoparticles described herein have an average hydrodynamic diameter from about 60 to about 80 nm. For example, in some embodiments, lipid nanoparticles described herein have an average hydrodynamic diameter from between 50 nm to 70 nm. ATTORNEY DOCKET NO. 180802-47402 / PCT
[0400] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0401] The terms “nucleic acid” and “nucleic acid molecule,” as used herein, refer to a compound comprising a nucleobase and an acidic moiety, e.g., a nucleoside, a nucleotide, or a polymer of nucleotides. Typically, polymeric nucleic acids, e.g., nucleic acid molecules comprising three or more nucleotides are linear molecules, in which adjacent nucleotides are linked to each other via a phosphodiester linkage. In some embodiments, “nucleic acid” refers to individual nucleic acid residues (e.g., nucleotides and / or nucleosides). In some embodiments, “nucleic acid” refers to an oligonucleotide chain comprising three or more individual nucleotide residues. As used herein, the terms “oligonucleotide” and “polynucleotide” can be used interchangeably to refer to a polymer of nucleotides (e.g, a string of at least three nucleotides). In some embodiments, “nucleic acid” encompasses RNA as well as single and / or doublestranded DNA. Nucleic acids may be naturally occurring, for example, in the context of a genome, a transcript, an mRNA, tRNA, rRNA, siRNA, snRNA, a plasmid, cosmid, chromosome, chromatid, or other naturally occurring nucleic acid molecule. On the other hand, a nucleic acid molecule may be a non-naturally occurring molecule, e.g, a recombinant DNA or RNA, an artificial chromosome, an engineered genome, or fragment thereof, or a synthetic DNA, RNA, DNA / RNA hybrid, or including non-naturally occurring nucleotides or nucleosides.
[0402] Furthermore, the terms “nucleic acid,” “DNA,” “RNA,” and / or similar terms include nucleic acid analogs, e.g., analogs having other than a phosphodiester backbone. Nucleic acids can be purified from natural sources, produced using recombinant expression systems and optionally purified, chemically synthesized, etc. Where appropriate, e.g., in the case of chemically synthesized molecules, nucleic acids comprise nucleoside analogs such as analogs having chemically modified bases or sugars, and backbone modifications. A nucleic acid sequence is presented in the 5' to 3' direction unless otherwise indicated. In some embodiments, a nucleic acid is or comprises natural nucleosides (e.g. adenosine, thymidine, guanosine, cytidine, uridine, deoxyadenosine, deoxythymidine, deoxyguanosine, and deoxycytidine); nucleoside analogs (e.g., 2-aminoadenosine, 2-thiothymidine, inosine, pyrrolo-pyrimidine, 3-methyl adenosine, 5-methylcytidine, 2-aminoadenosine, C5-bromouridine, C5-fluorouridine, C5-iodouridine, C5-propynyl-uridine, C5-propynyl-cytidine, C5-methylcytidine, 2-aminoadenosine, 7-deazaadenosine, 7-deazaguanosine, 8-oxoadenosine, 8-oxoguanosine, O(6)-methylguanine, and 2 -thiocytidine); chemically modified bases; biologically modified bases (e.g., methylated bases); intercalated bases; modified sugars (e.g., 2'-fluororibose, ribose, 2'-deoxyribose, arabinose, and hexose); and / or modified phosphate groups (e.g., phosphorothioates and 5'-A-phosphoramidite linkages). In some cases, a guide RNA of the present disclosure includes a modification selected from 2'-O-methyl modified nucleotides (e.g., 2'-O-methyl 3'-phosphorothioate (MS) nucleotide, ATTORNEY DOCKET NO. 180802-47402 / PCT
[0403] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0404] 2'-O-methyl 3 '-thioPACE (MSP) nucleotide), 2’-F modified nucleotides, locked nucleic acid, MOE (methoxy ethyl), DNA nucleotides functionalized for conjugation, and a combination thereof.
[0405] The term “nuclear localization sequence,” “nuclear localization signal,” or “NLS” refers to an amino acid sequence that promotes import of a protein into the cell nucleus. Nuclear localization sequences are known in the art and described, for example, in Plank et al., International PCT application, PCTZEP2000 / 011690, filed November 23, 2000, published as W 0 / 2001 / 038547 on May 31, 2001, the contents of which are incorporated herein by reference for their disclosure of exemplary nuclear localization sequences. In other embodiments, the NLS is an optimized NLS described, for example, by Koblan etal., Nature Biotech. 2018 doi:10.1038 / nbt.4172. In some embodiments, an NLS comprises the amino acid sequence KRTADGSEFESPKKKRKV (SEQ ID NO: 190), KRPAATKKAGQAKKKK (SEQ ID NO: 191), KKTELQTTNAENKTKKL (SEQ ID NO: 192), KRGINDRNFWRGENGRKTR (SEQ ID NO: 193), RKSGKIAAIVVKRPRK (SEQ ID NO: 194), PKKKRKV (SEQ ID NO: 195), MDSLLMNRRKFLYQFKNVRWAKGRRETYLC (SEQ ID NO: 196), PKKKRKVEGADKRTADGSEFESPKKKRKV (SEQ ID NO: 328), or RKSGKIAAIVVKRPRKPKKKRKV (SEQ ID NO: 329).
[0406] The term “nucleobase,” “nitrogenous base,” or “base,” used interchangeably herein, refers to a nitrogen-containing biological compound that forms a nucleoside, which in turn is a component of a nucleotide. The ability of nucleobases to form base pairs and to stack one upon another leads directly to long-chain helical structures such as ribonucleic acid (RNA) and deoxyribonucleic acid (DNA). Five nucleobases - adenine (A), cytosine (C), guanine (G), thymine (T), and uracil (U) - are called primary or canonical. Adenine and guanine are derived from purine, and cytosine, uracil, and thymine are derived from pyrimidine. DNA and RNA can also contain other (non-primary) bases that are modified. Non-limiting exemplary modified nucleobases can include hypoxanthine, xanthine, 7-methylguanine, 5,6-dihydrouracil, 5-methylcytosine (m5C), and 5-hydromethylcytosine. Hypoxanthine and xanthine can be created through mutagen presence, both of them through deamination (replacement of the amine group with a carbonyl group). Hypoxanthine can be modified from adenine. Xanthine can be modified from guanine. Uracil can result from deamination of cytosine. A “nucleoside” consists of a nucleobase and a five carbon sugar (either ribose or deoxyribose). Examples of a nucleoside include adenosine, guanosine, uridine, cytidine, 5-methyluridine (m5U), deoxyadenosine, deoxyguanosine, thymidine, deoxyuridine, and deoxycytidine. Examples of a nucleoside with a modified nucleobase includes inosine (I), xanthosine (X), 7-methylguanosine (m7G), ATTORNEY DOCKET NO. 180802-47402 / PCT
[0407] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0408] dihydrouridine (D), 5-methylcytidine (m5C), and pseudouridine (T). A “nucleotide” consists of a nucleobase, a five carbon sugar (either ribose or deoxyribose), and at least one phosphate group. Non-limiting examples of modified nucleobases and / or chemical modifications that a modified nucleobase may include are the following: pseudo-uridine, 5-Methyl-cytosine, 2'-O-methyl-3'-phosphonoacetate, 2'-O-methyl thioPACE (MSP), 2'-O-methyl-PACE (MP), 2'-fluoro RNA (2'-F-RNA), constrained ethyl (S-cEt), 2'-O-methyl (‘M’), 2'-O-methyl-3'-phosphorothioate (‘MS’), 2'-O-methyl-3'-thiophosphonoacetate (‘MSP’), 5-methoxyuridine, phosphorothioate, and N1 -Methylpseudouridine.
[0409] The term “nucleic acid programmable DNA binding protein” or “napDNAbp” may be used interchangeably with “polynucleotide programmable nucleotide binding domain” to refer to a protein that associates with a nucleic acid (e.g., DNA or RNA), such as a guide nucleic acid or guide polynucleotide (e.g., gRNA), that guides the napDNAbp to a specific nucleic acid sequence. In some embodiments, the polynucleotide programmable nucleotide binding domain is a polynucleotide programmable DNA binding domain. In some embodiments, the polynucleotide programmable nucleotide binding domain is a polynucleotide programmable RNA binding domain. In some embodiments, the polynucleotide programmable nucleotide binding domain is a Cas9 protein. In some embodiments, the polynucleotide programmable nucleotide binding domain is an SpRY, an SpRYc, an saCas9, an spCas9, or an spCas9 variant having specificity for a protospacer-adjacent motif (PAM) selected from one or more of: NG, NRRH, and NRTH, where “N” indicates A, C, G, or T, “R” indicates A or G, “Y” indicates C or T, and “H” indicates A, C, or T. A Cas9 protein can associate with a guide RNA that guides the Cas9 protein to a specific DNA sequence that is complementary to the guide RNA. In some embodiments, the napDNAbp is a Cas9 domain, for example a nuclease active Cas9, a Cas9 nickase (nCas9), or a nuclease inactive Cas9 (dCas9). Non-limiting examples of nucleic acid programmable DNA binding proteins include, Cas9 (e.g., dCas9 and nCas9), Casl2a / Cpfl, Casl2b / C2cl, Casl2c / C2c3, Casl2d / CasY, Casl2e / CasX, Casl2g, Casl2h, Casl2i, and Casl2j / Cas (Casl2j / Casphi). Non-limiting examples of Cas enzymes include Casl, CaslB, Cas2, Cas3, Cas4, Cas5, Cas5d, Cas5t, Cas5h, Cas5a, Cas6, Cas7, Cas8, Cas8a, Cas8b, Cas8c, Cas9 (also known as Csnl or Csxl2), CaslO, CaslOd, Casl2a / Cpfl, Casl2b / C2cl, Casl2c / C2c3, Casl2d / CasY, Casl2e / CasX, Casl2g, Casl2h, Casl2i, Casl2j / Cas, Cpfl, Csyl, Csy2, Csy3, Csy4, Csel, Cse2, Cse3, Cse4, Cse5e, Cscl, Csc2, Csa5, Csnl, Csn2, Csml, Csm2, Csm3, Csm4, Csm5, Csm6, Cmrl, Cmr3, Cmr4, Cmr5, Cmr6, Csbl, Csb2, Csb3, Csxl7, Csxl4, CsxlO, Csxl6, CsaX, Csx3, Csxl, CsxlS, Csxll, Csfl, Csf2, CsO, Csf4, Csdl, Csd2, Cstl, Cst2, Cshl, Csh2, Csal, Csa2, Csa3, Csa4, Csa5, Type II Cas effector proteins, Type V Cas ATTORNEY DOCKET NO. 180802-47402 / PCT
[0410] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0411] effector proteins, Type VI Cas effector proteins, CARF, DinG, homologues thereof, or modified or engineered versions thereof. Other nucleic acid programmable DNA binding proteins are also within the scope of this disclosure, although they may not be specifically listed in this disclosure. See, e.g., Makarova et al. “Classification and Nomenclature of CRISPR-Cas Systems: Where from Here?” CRISPRJ. 2018 Oct; 1:325-336. doi: 10.1089 / crispr.2018.0033; Yan eta / ., “Functionally diverse type V CRISPR-Cas systems” Science. 2019 Jan 4;363(6422):88-91. doi: 10.1126 / science.aav7271, the entire contents of each are hereby incorporated by reference.
[0412] Exemplary nucleic acid programmable DNA binding proteins and nucleic acid sequences encoding nucleic acid programmable DNA binding proteins are provided in the Sequence Listing as SEQ ID NOs: 197-231, 232-245, 254-257, 260, and 378. In some embodiments, the napDNAbp is the (CRISPR-associated system) Cas9 endonuclease, for example, Cas9 (Csnl) from Streptococcus pyogenes (e.g., SEQ ID NO: 197), Cas9 from Neisseria meningitidis (NmeCas9; SEQ ID NO: 208), Nme2Cas9 (SEQ ID NO: 209), Streptococcus constellatus (ScoCas9), or derivatives thereof (e.g., a sequence with at least about 85% sequence identity to a Cas9, such as Nme2Cas9 or spCas9). Further non-limiting examples of nucleic acid programmable DNA binding proteins include those disclosed or referenced in Rufflow, et al., “Design of highly functional genome editors by modeling of the universe of CRISPR-Cas Sequences,” bioRxiv, posted April 22, 2024, doi: 10.1101 / 2024.04.22.590591, the disclosure of which is incorporated herein by reference in its entirety for all purposes, which were designed using artificial intelligence. In some embodiments, the napDNAbp is OpenCRISPR-1, or a variant thereof (e.g., a variant comprising a D10A amino acid alteration and / or lacking an N-terminal methionine). Further non-limiting examples of nucleic acid programmable DNA binding proteins include those disclosed in International Patent Application No.
[0413] PCT / US2019 / 047996.
[0414] “Patient in need thereof’ or “subject in need thereof’ is referred to herein as a patient diagnosed with, at risk or having, predetermined to have, or suspected of having a disease or disorder.
[0415] The terms “pathogenic mutation”, “pathogenic variant”, “disease causing mutation”, “disease causing variant”, “deleterious mutation”, or “predisposing mutation” refers to a genetic alteration or mutation that is associated with a disease or disorder or that increases an individual’s susceptibility or predisposition to a certain disease or disorder. In some embodiments, the pathogenic mutation comprises at least one wild-type amino acid substituted by at least one pathogenic amino acid in a protein encoded by a gene. In some embodiments, the pathogenic mutation is in a terminating region (e.g., stop codon). In some embodiments, the ATTORNEY DOCKET NO. 180802-47402 / PCT
[0416] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0417] pathogenic mutation is in a non-coding region (e.g., intron, promoter, etc.). In some embodiments, the pathogenic mutation is a Q188R amino acid alteration in a GALT polypeptide.
[0418] The term “pharmaceutically acceptable salt”, as used herein, refers to salts of such compounds that are appropriate for use in pharmaceutical contexts, / .<., salts which are, within the scope of sound medical judgment, suitable for use in contact with the tissues of humans and lower animals without undue toxicity, irritation, allergic response and the like, and are commensurate with a reasonable benefit / risk ratio. Pharmaceutically acceptable salts are well known in the art. For example, S. M. Berge, et al. describes pharmaceutically acceptable salts in detail in J. Pharmaceutical Sciences, 66: 1-19 (1977).
[0419] As used herein, a “phospholipid” can refer to a lipid that includes a phosphate moiety and one or more carbon chains, such as unsaturated fatty acid chains. A phospholipid may include one or more multiple (e.g., double or triple) bonds. In some embodiments, a phospholipid may facilitate fusion to a membrane. For example, a cationic phospholipid may interact with one or more negatively charged phospholipids of a membrane (e.g., a cellular or intracellular membrane). Fusion of a phospholipid to a membrane may allow one or more elements of an LNP to pass through the membrane, i.e., delivery of the one or more elements to a cell.
[0420] The terms “protein”, “peptide”, “polypeptide”, and their grammatical equivalents are used interchangeably herein, and refer to a polymer of amino acid residues linked together by peptide (amide) bonds. A protein, peptide, or polypeptide can be naturally occurring, recombinant, or synthetic, or any combination thereof.
[0421] The term “fusion protein” as used herein refers to a hybrid polypeptide which comprises protein domains from at least two different proteins.
[0422] The term “recombinant” as used herein in the context of proteins or nucleic acids refers to proteins or nucleic acids that do not occur in nature but are the product of human engineering. For example, in some embodiments, a recombinant protein or nucleic acid molecule comprises an amino acid or nucleotide sequence that comprises at least one, at least two, at least three, at least four, at least five, at least six, or at least seven mutations as compared to any naturally occurring sequence.
[0423] By “reduces” is meant a negative alteration of at least 10%, 25%, 50%, 75%, or 100%. In embodiments, a base editor system of the invention reduces and / or normalizes levels of galactose or galactose metabolites (e.g., galactitol or galactose- 1-P) in a biological sample of the subject.
[0424] By “reference” is meant a standard or control condition. In one embodiment, the reference is the level of galactose or galactose metabolites (e.g., galactitol or galactose- 1-P) ATTORNEY DOCKET NO. 180802-47402 / PCT
[0425] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0426] present in the blood of a subject not having galactosemia. In another embodiment, the reference is the level of galactose or galactose metabolites (e.g., galactitol or galactose- 1-P) present in the blood of a subject having galactosemia prior to treatment. In another embodiment, the reference is a number that describes an ideal range for galactose or galactose metabolites (e.g., galactitol or galactose- 1-P) levels in blood.. In other embodiments and without limitation, a reference is an untreated cell that is not subjected to a test condition, or is subjected to placebo or normal saline, medium, buffer, and / or a control vector that does not harbor a polynucleotide of interest. In embodiments, a reference is a base editor system containing a base editor polypeptide and / or guide polynucleotide that differs from that of a base editor system of interest.
[0427] A “reference sequence” is a defined sequence used as a basis for sequence comparison. A reference sequence may be a subset of or the entirety of a specified sequence; for example, a segment of a full-length cDNA or gene sequence, or the complete cDNA or gene sequence. For polypeptides, the length of the reference polypeptide sequence will generally be at least about 16 amino acids, at least about 20 amino acids, at least about 25 amino acids, about 35 amino acids, about 50 amino acids, or about 100 amino acids. For nucleic acids, the length of the reference nucleic acid sequence will generally be at least about 50 nucleotides, at least about 60 nucleotides, at least about 75 nucleotides, about 100 nucleotides or about 300 nucleotides or any integer thereabout or therebetween. In some embodiments, a reference sequence is a wild-type sequence of a protein of interest. In other embodiments, a reference sequence is a polynucleotide sequence encoding a wild-type protein.
[0428] By “RrA3f polypeptide” is meant a deaminase capable of deaminating cytidine in DNA and comprising an amino acid sequence with at least 85% identity to the following amino acid sequence:
[0429] MKPQIRDHRPNPMEAMYPHIFYFHFENLEKAYGRNETWLCFTVEI IKQYLPVPWKKGVFRNQVD PETHCHAEKCFLSWFCNNTLSPKKNYQVTWYTSWSPCPECAGEVAEFLAEHSNVKLTI YEARLY YFWDEDYQEGLRSLSEEGASVEIMDYEDFQYCWENFVYDDGEPFKRWKGLKYNFQSLERRLREI LQ (SEQ ID NO: 67), or a functional fragment thereof. In various embodiments, the RrA3f polypeptide may be inserted within a Cas9 polypeptide between amino acid positions 906 and 907 referenced to SEQ ID NO: 197 (where a base editor with a deaminase inserted at this position may be referred to as an “IBE34” base editor). In some embodiments, the RrA3f polypeptide lacks an N-terminal methionine.
[0430] By “RrA3f polynucleotide” is meant a polynucleotide encoding an RrA3f polypeptide. By “ Streptococcus pyogenes Cas9 (spCas9) polypeptide” is meant a nucleic acid programmable DNA binding domain (napDNAbp) comprising an amino acid sequence with at ATTORNEY DOCKET NO. 180802-47402 / PCT
[0431] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0432] least 85% identity to the following amino acid sequence:
[0433] MDKKYSIGLDIGTNSVGWAVITDEYKVPSKKFKVLGNTDRHSIKKNLIGALLFDSGETAEATRL KRTARRRYTRRKNRICYLQEIFSNEMAKVDDSFFHRLEESFLVEEDKKHERHPIFGNIVDEVAY HEKYPTIYHLRKKLVDSTDKADLRLIYLALAHMIKFRGHFLIEGDLNPDNSDVDKLFIQLVQTY NQLFEENPINASGVDAKAILSARLSKSRRLENLIAQLPGEKKNGLFGNLIALSLGLTPNFKSNF DLAEDAKLQLSKDTYDDDLDNLLAQIGDQYADLFLAAKNLSDAILLSDILRVNTEITKAPLSAS MIKRYDEHHQDLTLLKALVRQQLPEKYKEIFFDQSKNGYAGYIDGGASQEEFYKFIKPILEKMD GTEELLVKLNREDLLRKQRTFDNGSIPHQIHLGELHAILRRQEDFYPFLKDNREKIEKILTFRI PYYVGPLARGNSRFAWMTRKSEETITPWNFEEVVDKGASAQSFIERMTNFDKNLPNEKVLPKHS LLYEYFTVYNELTKVKYVTEGMRKPAFLSGEQKKAIVDLLFKTNRKVTVKQLKEDYFKKIECFD SVEISGVEDRFNASLGTYHDLLKI IKDKDFLDNEENEDILEDIVLTLTLFEDREMIEERLKTYA HLFDDKVMKQLKRRRYTGWGRLSRKLINGIRDKQSGKTILDFLKSDGFANRNFMQLIHDDSLTF KEDIQKAQVSGQGDSLHEHIANLAGSPAIKKGILQTVKVVDELVKVMGRHKPENIVIEMARENQ TTQKGQKNSRERMKRIEEGIKELGSQILKEHPVENTQLQNEKLYLYYLQNGRDMYVDQELDINR LSDYDVDHIVPQSFLKDDSIDNKVLTRSDKNRGKSDNVPSEEVVKKMKNYWRQLLNAKLITQRK FDNLTKAERGGLS ELDKAGF I KRQLVETRQ I TKHVAQ I LDSRMNTKYDENDKL I RE VKVI TLKS KLVSDFRKDFQFYKVREINNYHHAHDAYLNAVVGTALIKKYPKLESEFVYGDYKVYDVRKMIAK SEQEIGKATAKYFFYSNIMNFFKTEITLANGEIRKRPLIETNGETGEIVWDKGRDFATVRKVLS MPQVNIVKKTEVQTGGFSKESILPKRNSDKLIARKKDWDPKKYGGFDSPTVAYSVLVVAKVEKG KSKKLKSVKELLGITIMERSSFEKNPIDFLEAKGYKEVKKDLI IKLPKYSLFELENGRKRMLAS AGELQKGNELALPSKYVNFLYLASHYEKLKGSPEDNEQKQLFVEQHKHYLDEI IEQISEFSKRV ILADANLDKVLSAYNKHRDKPIREQAENI IHLFTLTNLGAPAAFKYFDTTIDRKRYTSTKEVLD ATLIHQSITGLYETRIDLSQLGGD (SEQ ID NO: 197), or a functional fragment thereof. In some embodiments, the spCas9 polypeptide lacks an N-terminal methionine. In some embodiments, spCas9 polypeptide comprises alterations, as described herein. In some embodiments, spCas9 contains a D10A amino acid alteration. In some embodiments, the SpCas9 polypeptide further comprises the amino acid alterations LI 111R, DI 135V, G1218R, E1219F, A1322R, R1335V, and T1337R referenced to SEQ ID NO: 197 (i.e., is a “Nureki” SpCas9 variant) and has protospacer-adjacent motif (PAM) specificity for a PAM containing the nucleotide sequence NG, where “N” is A, C, T, or G. In some embodiments the SpCas9 polypeptide has PAM specificity for a PAM containing the nucleotide sequence NGG, where “N” is A, C, T, or G. In some embodiments, the SpCas9 polypeptide further comprises the amino acid alterations I322V, S409I, E427G, R654L, R753G, R1114G, DI 135N, VI 139A, DI 180G, E1219V, Q1221H, A1320V, and R1333K referenced to SEQ ID NO: 197 and has protospacer-adjacent motif (PAM) specificity for a PAM containing the nucleotide sequence NRRH, where ATTORNEY DOCKET NO. 180802-47402 / PCT
[0434] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0435] “N” is A, C, T, or G, “R” is A or G, and “H” is A, C, or T. In some embodiments, the SpCas9 polypeptide further comprises the amino acid alterations I322V, S409I, E427G, R654L, R753G, R1114G, D1135N, D1180G, G1218S, E1219V, Q1221H, P1249S, E1253K, P1321S, D1332G, and R1335L referenced to SEQ ID NO: 197 and has protospacer-adjacent motif (PAM) specificity for a PAM containing the nucleotide sequence NRTH, where “N” is A, C, T, or G, “R” is A or G, and “FT is A, C, or T.
[0436] By “ Streptococcus pyogenes Cas9 (spCas9) polynucleotide” is meant a polynucleotide encoding an spCas9 polypeptide.
[0437] By “ Staphylococcus aureus Cas9 (saCas9) polypeptide” is meant a nucleic acid programmable DNA binding domain (napDNAbp) comprising an amino acid sequence with at least 85% identity to the following amino acid sequence:
[0438] MKRNYILGLAIGITSVGYGI IDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRI QRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDT GNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQ LDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLY NALNDLNNLVITRDENEKLEYYEKFQI IENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGK PEFTNLKVYHDIKDITARKEI IENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQIS NLKGYTGTHNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSP VVKRS FIQSIKVINAII KKYGL PND HIE LARE KNS KDAQKM I NEMQKRNRQTNER I E E 11 RTT GKENAKYL I EKI KLHDMQEGKCL YS LEAI PLEDLLNNPFNYE VDH 11 PRS VS FDNS FNNKVLVK QEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKD FINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAED ALI IANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKD YKYSHRVDKKPNRELINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHH DPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYGNKLNAHLDITDD YPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQA EFIASFYNNDLIKINGELYRVIGVNNDLLNRIEVNMIDITYREYLENMNDKRPPRI IKTIASKT QSIKKYSTDILGNLYEVKSKKHPQIIKKG (SEQ ID NO: 218), or a functional fragment thereof. In some embodiments, the saCas9 polypeptide lacks an N-terminal methionine. In some embodiments, saCas9 polypeptide comprises alterations, as described herein. In some embodiments, saCas9 contains a D10A amino acid alteration. In some embodiments the SaCas9 polypeptide has PAM specificity for a PAM containing the nucleotide sequence NNGRRT, where “N” is A, C, T, or G, and where “R” is A or G.
[0439] By “ Staphylococcus aureus Cas9 (saCas9) polynucleotide” is meant a polynucleotide encoding an saCas9 polypeptide. ATTORNEY DOCKET NO. 180802-47402 / PCT
[0440] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0441] By “SpRY polypeptide” is meant a nucleic acid programmable DNA binding domain comprising an amino acid sequence with at least 85% identity to the following amino acid sequence:
[0442] MDKKYSIGLDIGTNSVGWAVITDEYKVPSKKFKVLGNTDRHSIKKNLIGALLFDSGETAEATRL KRTARRRYTRRKNRICYLQEIFSNEMAKVDDSFFHRLEESFLVEEDKKHERHPIFGNIVDEVAY HEKYPTIYHLRKKLVDSTDKADLRLIYLALAHMIKFRGHFLIEGDLNPDNSDVDKLFIQLVQTY NQLFEENPINASGVDAKAILSARLSKSRRLENLIAQLPGEKKNGLFGNLIALSLGLTPNFKSNF DLAEDAKLQLSKDTYDDDLDNLLAQIGDQYADLFLAAKNLSDAILLSDILRVNTEITKAPLSAS MIKRYDEHHQDLTLLKALVRQQLPEKYKEIFFDQSKNGYAGYIDGGASQEEFYKFIKPILEKMD GTEELLVKLNREDLLRKQRTFDNGSIPHQIHLGELHAILRRQEDFYPFLKDNREKIEKILTFRI PYYVGPLARGNSRFAWMTRKSEETITPWNFEEVVDKGASAQSFIERMTNFDKNLPNEKVLPKHS LLYEYFTVYNELTKVKYVTEGMRKPAFLSGEQKKAIVDLLFKTNRKVTVKQLKEDYFKKIECFD SVEISGVEDRFNASLGTYHDLLKI IKDKDFLDNEENEDILEDIVLTLTLFEDREMIEERLKTYA HLFDDKVMKQLKRRRYTGWGRLSRKLINGIRDKQSGKTILDFLKSDGFANRNFMQLIHDDSLTF KEDIQKAQVSGQGDSLHEHIANLAGSPAIKKGILQTVKVVDELVKVMGRHKPENIVIEMARENQ TTQKGQKNSRERMKRIEEGIKELGSQILKEHPVENTQLQNEKLYLYYLQNGRDMYVDQELDINR LSDYDVDHIVPQSFLKDDSIDNKVLTRSDKNRGKSDNVPSEEVVKKMKNYWRQLLNAKLITQRK FDNLTKAERGGLS ELDKAGF I KRQLVETRQ I TKHVAQ I LDSRMNTKYDENDKL I RE VKVI TLKS KLVSDFRKDFQFYKVREINNYHHAHDAYLNAVVGTALIKKYPKLESEFVYGDYKVYDVRKMIAK SEQEIGKATAKYFFYSNIMNFFKTEITLANGEIRKRPLIETNGETGEIVWDKGRDFATVRKVLS MPQVNIVKKTEVQTGGFSKESILPKRNSDKLIARKKDWDPKKYGGFDSPTVAYSVLVVAKVEKG KSKKLKSVKELLGITIMERSSFEKNPIDFLEAKGYKEVKKDLI IKLPKYSLFELENGRKRMLAS AGELQKGNELALPSKYVNFLYLASHYEKLKGSPEDNEQKQLFVEQHKHYLDEI IEQISEFSKRV ILADANLDKVLSAYNKHRDKPIREQAENI IHLFTLTNLGAPAAFKYFDTTIDRKRYTSTKEVLD ATLIHQSITGLYETRIDLSQLGGD (SEQ ID NO: 197) and comprises the following amino acid alterations referenced to SEQ ID NO: 197: A61R, LI 111R, DI 135L, SI 136W, G1218K, E1219Q, N1317R, A1322R, R1333P, R1335Q, and T1337R, or a functional fragment thereof In some embodiments, the SpRY polypeptide lacks an N-terminal methionine. In some embodiments, SpRY polypeptide comprises alterations, as described herein. SpRY is described in Walton, et al. “Unconstrained genome targeting with near-PAMless engineered CRISPR-Cas9 variants,” Science, 368:290-296 (2020), the disclosure of which is hereby incorporated by reference in its entirety for all purposes. In some embodiments, an SpRY polypeptide is a nucleic acid programmable DNA binding protein with specificity for protospacer-adjacent motifs having the sequence NRN and / or NYN, where “N” represents A, C, T, or G, “R” represents a purine (A or G), and “Y” represents a pyrimidine (C or T). In some embodiments, SpRY can recognize ATTORNEY DOCKET NO. 180802-47402 / PCT
[0443] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0444] almost all PAMs. In some embodiments the SpRY polypeptide has PAM specificity for a PAM containing the nucleotide sequence NRN, where “N” is A, C, T, or G, and where “R” is A or G.
[0445] By “SpRY polynucleotide” is meant a polynucleotide encoding an SpRY polypeptide. By “nSpRY polypeptide” is meant an SpRY polypeptide that is a nickase. In various embodiments, an nSpRY polypeptide is an SpRY polypeptide comprising aDlOA amino acid alteration.
[0446] By “nSpRY polynucleotide” is meant a polynucleotide encoding an nSpRY polypeptide. By “SpRYc polypeptide” is meant a nucleic acid programmable DNA binding domain comprising an amino acid sequence with at least 85% identity to the following amino acid sequence:
[0447] MEKKYSIGLDIGTNSVGWAVITDDYKVPSKKFKVLGNTNRKSIKKNLMGALLFDSGETAEATRL KRTARRRYTRRKNRIRYLQEIFANEMAKLDDSFFQRLEESFLVEEDKKNERHPIFGNLADEVAY HRNYPTIYHLRKKLADSPEKADLRLIYLALAHI IKFRGHFLIEGKLNAENSDVAKLFYQLIQTY NQLFEESPLDEIEVDAKGILSARLSKSKRLEKLIAVFPNEKKNGLFGNI IALALGLTPNFKSNF DLTEDAKLQLSKDTYDDDLDELLGQIGDQYADLFSAAKNLSDAILLSDILRSNSEVTKAPLSAS MVKRYDEHHQDLALLKTLVRQQFPEKYAEIFKDDTKNGYAGYVGADKKLRKRSGKLATEEEFYK FIKPILEKMDGAEELLAKLNRDDLLRKQRTFDNGSIPHQIHLKELHAILRRQEEFYPFLKENRE KIEKILTFRIPYYVGPLARGNSRFAWLTRKSEEAITPWNFEEVVDKGASAQSFIERMTNFDEQL PNKKVLPKHSLLYEYFTVYNELTKVKYVTERMRKPEFLSGEQKKAIVDLLFKTNRKVTVKQLKE DYFKKIECFDSVEI IGVEDRFNASLGTYHDLLKI IKDKDFLDNEENEDILEDIVLTLTLFEDRE MIEERLKTYAHLFDDKVMKQLKRRHYTGWGRLSRKMINGIRDKQSGKTILDFLKSDGFSNRNFM QLIHDDSLTFKEEIEKAQVSGQGDSLHEQIADLAGSPAIKKGILQTVKIVDELVKVMGHKPENI VIEMARENQTTTKGLQQSRERKKRIEEGIKELESQILKENPVENTQLQNEKLYLYYLQNGRDMY VDQELDINRLSDYDVDHIVPQSFIKDDSIDNKVLTRSVENRGKSDNVPSEEVVKKMKNYWRQLL NAKL I TQRKFDNLTKAERGGLS EADKAGF I KRQLVETRQ I TKHVAR I LDSRMNTKRDKNDKP I R EVKVITLKSKLVSDFRKDFQLYKVRDINNYHHAHDAYLNAVVGTALIKKYPKLESEFVYGDYKV YDVRKMIAKSEQEIGKATAKRFFYSNIMNFFKTEVKLANGEIRKRPLIETNGETGEVVWNKEKD FATVRKVLAMPQVNIVKKTEVQTGGFSKESIRPKRNSDKLIARKKDWDPKKYGGFLWPTVAYSV LVVAKVEKGKSKKLKSVKELLGITIMERSSFEKNPIDFLEAKGYKEVKKDLI IKLPKYSLFELE NGRKRMLASAKQLQKGNELALPSKYVNFLYLASHYEKLKGSPEDNEQKQLFVEQHKHYLDEI IE QISEFSKRVILADANLDKVLSAYNKHRDKPIREQAENI IHLFTLTRLGAPRAFKYFDTTIDPKQ YRSTKEVLDATLIHQSITGLYETRIDLSQLGGD (SEQ ID NO: 772), or a functional fragment thereof. In some embodiments, the SpRYc polypeptide lacks anN-terminal methionine. In some embodiments, SpRYc polypeptide comprises alterations, as described herein. In some embodiments, SpRYc contains a D10A amino acid alteration. In some embodiments the SpRYc ATTORNEY DOCKET NO. 180802-47402 / PCT
[0448] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0449] polypeptide has PAM specificity for a PAM containing the nucleotide sequence NNN, where “N” is A, C, T, or G.
[0450] By “SpRYc polynucleotide” is meant a polynucleotide encoding an SpRYc polypeptide. As used herein, the term “stem loop” describes a secondary structure of nucleotides that form a base-paired “stem” that ends in a loop of unpaired nucleic acids. A stem may be formed when two regions of the same nucleic acid strand are at least partially complementary in sequence when read in opposite directions.
[0451] The term “sterolyl,” as used herein, refers to a 17-membered fused polycyclic ring moiety that is either saturated or partially unsaturated and substituted with at least one hydroxyl group, and has a single point of attachment to the rest of the molecule at any substitutable carbon or oxygen atom. In some embodiments, a sterolyl group is a cholesterolyl group, or a variant or derivative thereof. In some embodiments, a cholesterolyl group is modified. In some embodiments, a cholesterolyl group is an oxidized cholesterolyl group (e.g., oxidized on the beta-ring structure or on the hydrocarbon tail structure). In some embodiments, a cholesterolyl group is an esterified cholesterolyl group. In some embodiments, a sterolyl group is a phytosterolyl group. Exemplary sterolyl groups include but are not limited to 25-hydroxycholesterolyl (25-OH), 20a-hydroxycholesterolyl (20a-OH), 27-hydroxycholesterolyl, 6-keto-5a-hydroxycholesterolyl, 7-ketocholesterolyl, 7P-hydroxycholesterolyl, 7a-hydroxycholesterolyl, 7P-25-dihydroxycholesterolyl, beta-sitosterolyl, stigmasterolyl, brassicasterolyl, and campesterolyl.
[0452] By “subject” or “patient” is meant a mammal, including, but not limited to, a human or non-human mammal. In embodiments, the mammal is a bovine, equine, canine, ovine, rabbit, rodent, nonhuman primate, or feline. In an embodiment, “patient” refers to a mammalian subject with a higher than average likelihood of developing a disease or a disorder. Exemplary patients can be humans, non-human primates, cats, dogs, pigs, cattle, cats, horses, camels, llamas, goats, sheep, rodents (e.g., mice, rabbits, rats, or guinea pigs) and other mammalians that can benefit from the therapies disclosed herein. Exemplary human patients can be male and / or female.
[0453] As described herein, compounds of this disclosure may be described as “substituted” or “optionally substituted”. That is, compounds may contain optionally substituted and / or substituted moieties. In general, the term “substituted,” whether preceded by the term “optionally” or not, means that one or more hydrogens of the designated moiety are replaced with a suitable substituent. “Substituted” applies to one or more hydrogens that are either
[0454] 'NH -FR1-j-R1explicit or implicit from the structure (e.g.,
[0455]
[0456] refers to at least
[0457]
[0458] ; and ATTORNEY DOCKET NO. 180802-47402 / PCT
[0459] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0460] 1 l^NH
[0461] refers to at least
[0462]
[0463] ,RJONH,R, or CAR). Unless otherwise indicated, an “optionally substituted” group may have a suitable substituent at each substitutable position of the group, and when more than one position in any given structure may be substituted with more than one substituent selected from a specified group, the substituent may be either the same or different at every position. Combinations of substituents envisioned by this disclosure are preferably those that result in the formation of stable or chemically feasible compounds. The term “stable,” as used herein, refers to compounds that are not substantially altered when subjected to conditions to allow for their production, detection, and, in certain embodiments, their recovery, purification, and use for one or more of the purposes disclosed herein. Groups described as being “substituted” preferably have between 1 and 4 substituents, more preferably 1 or 2 substituents. Groups described as being “optionally substituted” may be unsubstituted or be “substituted” as described above.
[0464] Suitable monovalent substituents include halogen; -(CH2)o4R°; -(CH2)o 4OR0; -0(CH2)o-4R°, -0-(CH2)O4C(O)ORO; -(CH2)O4CH(ORO)2; -(CH2)O4Ph, which may be substituted with R°; -(CH2)o40(CH2)o iPh which may be substituted with R°; -CH=CHPh, which may be substituted with R°; -(CH2)o-40(CH2)o-i-pyridyl which may be substituted with R°; -NO2; -CN; -N3; -(CH2)O4N(RO)2; -(CH2)O4N(RO)C(O)R°; -N(R°)C(S)R°; -(CH2)O-4N(RO)C(O)NRO2; -N(RO)C(S)NR°2; -(CH2)O4N(RO)C(O)OR°; -N(R°)N(R°)C(O)R°; -N(R°)N(R°)C(O)NR°2; -N(R°)N(R°)C(O)OR°; -(CH2)O4C(O)RO; -C(S)R°; -(CH2)O-4C(O)ORO; -(CH2)O4C(O)SRO; -(CH2)O ^C(O)OSiR°3; -(CH2)o4OC(O)R°; -OC(0)(CH2)o4SR0-, -SC(S)SR°; -(CH2)O4SC(O)R°; -(CH2)O4C(O)NRO2; -C(S)NRO2; -C(S)SR°; -SC(S)SR°, -(CH2)O4OC(O)NRO2; -C(O)N(OR°)R°; -C(O)C(O)R°; -C(O)CH2C(O)RO; -C(NOR°)R°; -(CH2)O4SSR°; -(CH2)O4S(O)2RO; -(CH2)O4S(O)2ORO; -(CH2)O4OS(O)2RO; -S(O)2NR°2; -(CH2)O4S(O)RC; -N(RO)S(O)2NR°2; -N(RO)S(O)2R°; -N(OR°)R°; -C(NH)NRO2; -P(O)2R°; -P(O)RO2; -OP(O)RO2; -OP(O)(ORO)2; -SiR°3; -OSiR°3; -(Ci4straight or branched alkylene)O-N(R°)2; or -(Ci-4straight or branched alkylene)C(O)O-N(R°)2, wherein each R° may be substituted as defined below and is independently hydrogen, C1-6 aliphatic, -CH2Ph, -0(CH2)o iPh, -CH2-(5-6 membered heteroaryl ring), or a 5-6-membered saturated, partially unsaturated, or aryl ring having 0-4 heteroatoms independently selected from nitrogen, oxygen, or sulfur, or, notwithstanding the definition above, two independent occurrences of R°, taken together with their intervening atom(s), form a 3-12-membered saturated, partially unsaturated, ATTORNEY DOCKET NO. 180802-47402 / PCT
[0465] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0466] or aryl mono- or bicyclic ring having 0-4 heteroatoms independently selected from nitrogen, oxygen, or sulfur, which may be substituted as defined below.
[0467] Suitable monovalent substituents on R° (or the ring formed by taking two independent occurrences of R° together with their intervening atoms), are independently halogen, -(CH2)o-2R*, -(haloR*), -(CH2)O 2OH, -(CH2)o-2OR*, -(CH2)o-2CH(OR*)2; -O(haloR’), -CN, -N3, -(CH2)O2C(O)R*, -(CH2)O2C(O)OH, -(CH2)O2C(O)OR*, -(CH2)O2C(O)NH2, -(CH2)O-2C(O)NHR*, -(CH2)O2C(O)NR*2, -(CH2)O2SR*, -(CH2)O2SH, -(CH2)O2NH2, -(CH2)O2NHR*, -(CH2)O-2NR*2, -NO2, -SiR*3, -OSiR*3, -C(O)SR* -(Ci^i straight or branched alkylene)C(O)OR*, or -SSR* wherein each R* is unsubstituted or where preceded by “halo” is substituted only with one or more halogens, and is independently selected from Ci-4 aliphatic, -CH2Ph, -0(CH2)o iPh, or a 5-6-membered saturated, partially unsaturated, or aryl ring having 0-4 heteroatoms independently selected from nitrogen, oxygen, or sulfur. Suitable divalent substituents on a saturated carbon atom of R° include =0 and =S.
[0468] Suitable divalent substituents include the following: =0, =S, =NNR*2, =NNHC(O)R*, =NNHC(O)OR*, =NNHS(O)2R*, =NR*, =N0R*, -O(C(R*2))23O-, or -S(C(R*2))23S-, wherein each independent occurrence of R* is selected from hydrogen, C1-6 aliphatic which may be substituted as defined below, or an unsubstituted 5-6-membered saturated, partially unsaturated, or aryl ring having 0-4 heteroatoms independently selected from nitrogen, oxygen, or sulfur. Suitable divalent substituents that are bound to vicinal substitutable carbons of an “optionally substituted” group include: -O(CR*2)23O-, wherein each independent occurrence of R* is selected from hydrogen, C1-6 aliphatic which may be substituted as defined below, or an unsubstituted 5-6-membered saturated, partially unsaturated, or aryl ring having 0-4 heteroatoms independently selected from nitrogen, oxygen, or sulfur.
[0469] Suitable substituents on the aliphatic group of R* include halogen, -R*, -(haloR*), -OH, -OR*, -O(haloR’), -CN, -C(O)OH, -C(O)OR*, -NH2, -NHR*, -NR*2, or -NO2, wherein each R* is unsubstituted or where preceded by “halo” is substituted only with one or more halogens, and is independently Ci 4 aliphatic, -CH2Ph, -0(CH2)o iPh, or a 5-6-membered saturated, partially unsaturated, or aryl ring having 0-4 heteroatoms independently selected from nitrogen, oxygen, or sulfur.
[0470] In some embodiments, suitable substituents on a substitutable nitrogen include -R1', - Rt2, -C(O)Rt, -C(O)ORt, -C(O)C(O)Rt, -C(O)CH2C(O)Rt, -S(O)2Rt, -S(O)2NRt2, -C(S)NRt2, -C(NH)NRJ2, or -^R^S^)^; wherein each R1' is independently hydrogen, C1-6 aliphatic which may be substituted as defined below, unsubstituted -OPh, or an unsubstituted 5-6-membered saturated, partially unsaturated, or aryl ring having 0-4 heteroatoms independently ATTORNEY DOCKET NO. 180802-47402 / PCT
[0471] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0472] selected from nitrogen, oxygen, or sulfur, or, notwithstanding the definition above, two independent occurrences of R1', taken together with their intervening atom(s) form an unsubstituted 3-12-membered saturated, partially unsaturated, or aryl mono- or bicyclic ring having 0-4 heteroatoms independently selected from nitrogen, oxygen, or sulfur.
[0473] Suitable substituents on the aliphatic group of R:are independently halogen, -R*, -(haloR*), -OH, -OR*, -O(haloR’), -CN, -C(O)OH, -C(O)OR*, -NH2, -NHR*, -NR*2, or -NO2, wherein each R* is unsubstituted or where preceded by “halo” is substituted only with one or more halogens, and is independently Ci 4 aliphatic, -CH2Ph, -0(CH2)o iPh, or a 5-6-membered saturated, partially unsaturated, or aryl ring having 0-4 heteroatoms independently selected from nitrogen, oxygen, or sulfur.
[0474] By “substantially identical” is meant a polypeptide or nucleic acid molecule exhibiting at least 50% identity to a reference amino acid sequence. In one embodiment, a reference sequence is a wild-type amino acid or nucleic acid sequence. In another embodiment, a reference sequence is any one of the amino acid or nucleic acid sequences described herein. In one embodiment, such a sequence is at least about 60%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, 99.9%, or even 99.99%, identical at the amino acid level or nucleic acid level to the sequence used for comparison.
[0475] Nucleic acid molecules useful in the methods of the disclosure include any nucleic acid molecule that encodes a polypeptide of the disclosure or a functional fragment thereof. Such nucleic acid molecules need not be 100% identical with an endogenous nucleic acid sequence but will typically exhibit substantial identity. Polynucleotides having “substantial identity” to an endogenous sequence are typically capable of hybridizing with at least one strand of a doublestranded nucleic acid molecule. Nucleic acid molecules useful in the methods of the disclosure include any nucleic acid molecule that encodes a polypeptide of the disclosure or a functional fragment thereof. Such nucleic acid molecules need not be 100% identical with an endogenous nucleic acid sequence but will typically exhibit substantial identity. Polynucleotides having “substantial identity” to an endogenous sequence are typically capable of hybridizing with at least one strand of a double-stranded nucleic acid molecule. By “hybridize” is meant pair to form a double-stranded molecule between complementary polynucleotide sequences (e.g., a gene described herein), or portions thereof, under various conditions of stringency. (See, e.g., Wahl, G. M. and S. L. Berger (1987) Methods Enzymol. 152:399; Kimmel, A. R. (1987) Methods Enzymol. 152:507). ATTORNEY DOCKET NO. 180802-47402 / PCT
[0476] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0477] By “TadA*8.19” is meant a deaminase capable of deaminating adenine in DNA and containing an amino acid sequence with at least 85% identity to SEQ ID NO: 1 having the amino acid alterations V82S, Y123H, Y147R, and Q154R, or a functional fragment thereof. In some embodiments, TadA*8.19 lacks anN-terminal methionine.
[0478] By “TadA*8.19 polynucleotide” is meant a polynucleotide encoding a TadA*8.19 polypeptide.
[0479] By “TadA*8.20” is meant a deaminase capable of deaminating adenine in DNA and containing an amino acid sequence with at least 85% identity to SEQ ID NO: 1 having the amino acid alterations I76Y, V82S, Y123H, Y147R, and Q154R, or a functional fragment thereof. In some embodiments, TadA*8.20 lacks an N-terminal methionine.
[0480] By “TadA*8.20 polynucleotide” is meant a polynucleotide encoding a TadA*8.20 polypeptide.
[0481] By “T1.52 polypeptide” is meant a deaminase capable of deaminating cytidine in DNA and comprising an amino acid sequence with at least 85% identity to the following amino acid sequence and further comprising the amino acid alterations F6Y, E27H, I49K, I76W, D77G, V82T, R107C, G112H, A114C, G115M, H122G, Y123H, N127P, A142E, Y147R, Q154R: MSEVEFSHEYWMRHALTLAKRARDEREVPVGAVLVLNNRVIGEGWNRAIGLHDPTAHAEIMALR QGGLVMQNYRL IDATL YVTFE PCVMCAGAM IHSR I GRWFGVRNAKTGAAGS LMDVLHYPGMNH RVEITEGILADECAALLCYFFRMPRQVFNAQKKAQSSTD (SEQ ID NO: 1), or a functional fragment thereof. In various embodiments, the T1.52 polypeptide may be inserted within a Cas9 polypeptide between amino acid positions 906 and 907 referenced to SEQ ID NO: 197 (where a base editor with a deaminase inserted at this position may be referred to as an “IBE34” base editor). In some embodiments, the T1.52 polypeptide lacks an N-terminal methionine. A representative T1.52 polypeptide sequence is provided below:
[0482] SEVEYSHEYWMRHALTLAKRARDERHVPVGAVLVLNNRVIGEGWNRAKGLHDPTAHAEIMALRQ GGLVMQNYRLWGATLYTTFE PCVMCAGAM IHSR I GRVVFGVCNAKTHACMS LMDVLGHPGMPHR VEITEGILADECEALLCRFFRMPRRVFNAQKKAQSSTD (SEQ ID NO: 773)
[0483] By “T1.52 polynucleotide” is meant a polynucleotide encoding an T1.52 polypeptide. The term “target site” refers to a nucleotide sequence or nucleobase of interest within a nucleic acid molecule that is modified. In embodiments, the modification is deamination of a base. The deaminase can be a cytidine or an adenine deaminase.
[0484] As used herein, the terms “treat,” treating,” “treatment,” and the like refer to reducing or ameliorating a disorder and / or symptoms associated therewith or obtaining a desired pharmacologic and / or physiologic effect. It will be appreciated that, although not precluded, ATTORNEY DOCKET NO. 180802-47402 / PCT
[0485] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0486] treating a disorder or condition does not require that the disorder, condition or symptoms associated therewith be completely eliminated. In some embodiments, the effect is therapeutic, / .<., without limitation, the effect partially or completely reduces, diminishes, abrogates, abates, alleviates, reduces the intensity of, or cures a disease and / or adverse symptom attributable to the disease. In some embodiments, the effect is preventative, / .<., the effect protects or prevents an occurrence or reoccurrence of a disease or condition. To this end, the presently disclosed methods comprise administering a therapeutically effective amount of a composition as described herein.
[0487] By “uracil glycosylase inhibitor” or “UGI” is meant an agent that inhibits the uracil-excision repair system. Base editors comprising a cytidine deaminase convert cytosine to uracil, which is then converted to thymine through DNA replication or repair. In various embodiments, a uracil DNA glycosylase (UGI) prevent base excision repair which changes the U back to a C. In some instances, contacting a cell and / or polynucleotide with a UGI and a base editor prevents base excision repair which changes the U back to a C. An exemplary UGI comprises an amino acid sequence as follows:
[0488] >splP14739IUNGI_BPPB2 Uracil-DNA glycosylase inhibitor MTNLSDIIEKETGKQLVIQESILMLPEEVEEVIGNKPESDILVHTAYDESTDENVMLLTSDAPE YKPWALVIQDSNGENKIKML (SEQ ID NO: 231).
[0489] In some embodiments, the agent inhibiting the uracil-excision repair system is a uracil stabilizing protein (USP). See, e.g., WO 2022015969 Al, incorporated herein by reference.
[0490] As used herein, the term “vector” refers to a means of introducing a nucleic acid molecule into a cell, resulting in a transformed cell. Vectors include plasmids, transposons, phages, viruses, liposomes, lipid nanoparticles, and episomes.
[0491] Ranges provided herein are understood to be shorthand for all of the values within the range. For example, a range of 1 to 50 is understood to include any number, combination of numbers, or sub-range from the group consisting 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, or 50.
[0492] The recitation of a listing of chemical groups in any definition of a variable herein includes definitions of that variable as any single group or combination of listed groups. The recitation of an embodiment for a variable or aspect herein includes that embodiment as any single embodiment or in combination with any other embodiments or portions thereof.
[0493] All terms are intended to be understood as they would be understood by a person skilled in the art. Unless defined otherwise, all technical and scientific terms used herein have the same ATTORNEY DOCKET NO. 180802-47402 / PCT
[0494] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0495] meaning as commonly understood by one of ordinary skill in the art to which the disclosure pertains
[0496] In this application, the use of the singular includes the plural unless specifically stated otherwise. It must be noted that, as used in the specification, the singular forms “a,” “an” and “the” include plural referents unless the context clearly dictates otherwise. In this application, the use of “or” means “and / or” unless stated otherwise. Furthermore, use of the term “including” as well as other forms, such as “include”, “includes,” and “included,” is not limiting.
[0497] As used in this specification and claim(s), the words “comprising” (and any form of comprising, such as “comprise” and “comprises”), “having” (and any form of having, such as “have” and “has”), “including” (and any form of including, such as “includes” and “include”) or “containing” (and any form of containing, such as “contains” and “contain”) are inclusive or open-ended. This wording indicates that specified elements, features, components, and / or method steps are present, but does not exclude the presence of other elements, features, components, and / or method steps. Any embodiments specified as “comprising” a particular component(s) or element(s) are also contemplated as “consisting of’ or “consisting essentially of’ the particular component(s) or element(s) in some embodiments. It is contemplated that any embodiment discussed in this specification can be implemented with respect to any method or composition of the present disclosure, and vice versa. Furthermore, compositions of the present disclosure can be used to achieve methods of the present disclosure.
[0498] The term “about” or “approximately” means within an acceptable error range for the particular value as determined by one of ordinary skill in the art, which will depend in part on how the value is measured or determined, z.e., the limitations of the measurement system.
[0499] Reference in the specification to “some embodiments,” “an embodiment,” “one embodiment” or “other embodiments” means that a particular feature, structure, or characteristic described in connection with the embodiments is included in at least some embodiments, but not necessarily all embodiments, of the present disclosures.
[0500] BRIEF DESCRIPTION OF THE DRAWINGS FIG. 1 provides a schematic diagram providing a description of the reaction catalyzed by galactose- 1 -phosphate uridylyltransferase (GALT) and the metabolic context of the reaction. In FIG. 1 the terms “Type I”, “Type II”, “Type III”, and “Type IV” indicate the galactosemia type caused by a pathogenic mutation in the indicated enzyme.
[0501] FIG. 2 provides a schematic diagram showing a region of the galactose- 1 -phosphate uridylyltransferase (GALT) polynucleotide targeted for base editing, where the target cytidine for ATTORNEY DOCKET NO. 180802-47402 / PCT
[0502] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0503] base editing is underlined. The nucleotide sequence of FIG. 2 is a reverse complement of the coding strand of the GALT polynucleotide. The nucleotide sequence in FIG. 2 is CCTGACACCCTTACCCGGCAGTGGGGGTGGGGGTTAGAACAGCCCAT (SEQ ID NO: 690).
[0504] FIG. 3 provides a schematic diagram showing a region of the GALT polynucleotide targeted for base editing, where the target cytidine (C) for base editing (“target”) and its complementary guanine (G) are highlighted and shown in lowercase highlighted text, a C and its complementary G corresponding to a synonymous bystander edit (“synonymous”) are shown in ITALICS UPPERCASE TEXT, a C and its complementary G corresponding to a splice site disruption bystander edit (“splice”) are shown in BOLD UPPERCASE TEXT, a C and its complementary G corresponding to a nonsynonymous bystander edit (“nonsynonymous”) are shown in BOLD ITALICIZED UPPERCASE TEXT, and nucleotides corresponding to an intron are shown in HIGHLIGHTED UPPERCASE TEXT. In FIG. 3, “sense” refers to the coding strand of a GALT polynucleotide, and “antisense” refers to the non-coding strand of the GALT polynucleotide. The two nucleotide sequences shown in FIG. 3, are CTAACCCCCACCCCCACTGCCgGGTAAGGGTGTCAG (SEQ ID NO: 691; lowercase g is complementary to the target C) and CTGACACCCTTACCcGGCAGTGGGGGTGGGGGTTAG (SEQ ID NO: 692; target C is shown in lowercase), which are reverse complements of one another.
[0505] FIG. 4 provides a stacked bar graph showing the allelic distribution of the indicated base edits combinations, which are described in part in FIG. 3, for fibroblast cells from two different donors (GM00440 and GM00441) (n=3 for cells from each donor) contacted with a base editor system containing a combination of an mRNA encoding a base editor and a guide RNA, as indicated along the x-axis. The sequences for the guide RNAs are provided in Table 1. The number following the hyphen after each guide RNA number represents a lot number. A description of the base editors encoded by each mRNA is provided in FIG. 4, where RrA3f, T1.52, or 6b identify the deaminase domain of each cytidine base editor (CBE), IBE34 indicates a base editor where the deaminase domain is inserted within the nucleic acid programmable DNA binding protein domain (napDNAbp) of the base editor, and the terms NNGRRT, NRTH, NG-Nureki, SpRY, SpRYc, NRRH, and NGG indicate the identity of the Cas9 variant serving as the napDNAbp of each base editor and / or the protospacer-adjacent motif (PAM) specificity of the Cas9 variant, where “N” indicates any nucleotide, “R” indicates A or G, “Y” indicates C or T, and “H” indicates A, C, or T. Each stacked bar of FIG. 4 represents, from top to bottom, the alleles corresponding to the base edits “other”, “intronic or synonymous bystanders”, “splice_or_nonsynonymous”, “target_and_splice”, “target_and_nonsynonymous,”
[0506] “target and synonymous,” and “target alone”, where “intronic” indicates the base editing of an ATTORNEY DOCKET NO. 180802-47402 / PCT
[0507] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0508] intronic C that is not within a splice site or exon, “nonsynonymous bystanders” or “nonsynonymous” indicates base editing of a C that results in the change of an encoded amino acid, “synonymous bystanders” or “synonymous” indicates base editing of a C that results in a silent mutation, “splice” indicates base editing of a C in a splice site, “target” indicates base editing of the target C, and “target alone” indicates editing of only the target C, and “other” indicates remaining alleles that do not meet the criteria of any of the aforementioned categories. Each stacked bar represents means of from 6 biological replicates, and the bars represent standard deviation. The bars shown within the boxed region correspond to base editor systems containing the guide RNA gRNA9843, where the target site (“protospacer”) for gRNA9843 is surrounded by a dashed box within the sequence provided in FIG. 4. The nucleotide sequence shown in FIG. 4 is CTGACACCCTTACCcGGCAGTGGGGGTGGGGGTTAG (SEQ ID NO: 692). The numbers following the hyphen after each guide RNA identifier in FIG. 4 refer to lot numbers.
[0509] FIG. 5 provides a stacked bar graph showing the allelic distribution of the indicated base edits combinations, which are described in part in FIG. 3, for cells from two different donors (GM00440 and GM00441) (n=3 for cells from each donor) contacted with a base editor system containing a combination of an mRNA encoding a base editor and a guide RNA, as indicated along the x-axis. The sequences for the guide RNAs are provided in Table 1. The number following the hyphen after each guide RNA number represents a lot number. A description of the base editors encoded by each mRNA is provided in FIG. 5, where RrA3f, T1.52, or 6b identify the deaminase domain of each cytidine base editor (CBE), IBE34 indicates a base editor where the deaminase domain is inserted within the nucleic acid programmable DNA binding protein domain (napDNAbp) of the base editor, and the terms NNGRRT, NRTH, NG-Nureki, SpRY, SpRYc, NRRH, and NGG indicate the identity of the Cas9 variant serving as the napDNAbp of each base editor and / or the protospacer-adjacent motif (PAM) specificity of the Cas9 variant, where “N” indicates any nucleotide, “R” indicates A or G, “Y” indicates C or T, and “H” indicates A, C, or T. Each stacked bar of FIG. 5 represents, from top to bottom, the alleles corresponding to the base edits “other”, “intronic or synonymous bystanders”, “splice_or_nonsynonymous”, “target_and_splice”, “target_and_nonsynonymous,”
[0510] “target and synonymous,” and “target alone”, where “intronic” indicates the base editing of an intronic C that is not within a splice site or exon, “nonsynonymous bystanders” or “nonsynonymous” indicates base editing of a C that results in the change of an encoded amino acid, “synonymous bystanders” or “synonymous” indicates base editing of a C that results in a silent mutation, “splice” indicates base editing of a C in a splice site, “target” indicates base ATTORNEY DOCKET NO. 180802-47402 / PCT
[0511] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0512] editing of the target C, and “target alone” indicates editing of only the target C, and “other” indicates remaining alleles that do not meet the criteria of any of the aforementioned categories. Each stacked bar represents means of from 6 biological replicates, and the bars represent standard deviation. The bars shown within the boxed region correspond to base editor systems containing the guide RNA gRNA9843, where the target site (“protospacer”) for gRNA9846 is surrounded by a dashed box within the sequence provided in FIG. 5. The nucleotide sequence shown in FIG. 5 is CTGACACCCTTACCcGGCAGTGGGGGTGGGGGTTAG (SEQ ID NO: 692). The numbers following the hyphen after each guide RNA identifier in FIG. 5 refer to lot numbers. FIG. 5 presents the same data as shown in FIG. 4.
[0513] FIG. 6 provides a bar graph presenting an alternative presentation of the data from FIGs.
[0514] 4 and 5 and showing maximum OT percent base editing for each base editor system used to transfect the fibroblasts from both donors (n=6). The x-axis of FIG. 6 identifies the base editor of the base editor system corresponding to each bar, where the base editors are described in FIGs. 4 and 5, and the guides corresponding to each bar are indicated by the letter above each bar. The sequences for the guide RNAs are provided in Table 1. The number following the hyphen after each guide RNA number represents a lot number.
[0515] FIG. 7 provides a bar graph presenting the same data as shown in FIG. 6, but where the bars are clustered by base editor.
[0516] FIG. 8 provides a stacked bar graph presenting an alternative presentation of data from FIGs. 4 and 5 where only data corresponding to fibroblasts from donor GM00440 (n=3) is shown. A description of the base edits represented in each stacked bar and of the base editor systems indicated along the x-axis corresponding to each stacked bar is provided in FIGs. 4 and 5.
[0517] FIG. 9 provides a stacked bar graph presenting an alternative presentation of data from FIGs. 4 and 5 where only data corresponding to fibroblasts from donor GM00441 (n=3) is shown. A description of the base edits represented in each stacked bar and of the base editor systems indicated along the x-axis corresponding to each stacked bar is provided in FIGs. 4 and 5.
[0518] FIG. 10 provides a stacked bar graph providing an alternative presentation of the same data as presented in FIG. 8 where the edits stacked within each bar are grouped differently. In FIG. 10, each stacked bar represents from top-to-bottom the alleles corresponding to the base edits “other”, “silent”, “bystander”, and “target”, where “silent” indicates a base edit that does not alter a splice site and that does not result in a missense mutation, “bystander” represents a base edit that results in a missense mutation, and “target” represents base editing of the target C. ATTORNEY DOCKET NO. 180802-47402 / PCT
[0519] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0520] FIG. 11 provides a stacked bar graph providing an alternative presentation of the same data as presented in FIG. 9 where the edits stacked within each bar are grouped differently. In FIG. 11, each stacked bar represents from top-to-bottom the alleles corresponding to the base edits “other”, “silent”, “bystander”, and “target”, where “silent” indicates a base edit that does not alter a splice site and that does not result in a missense mutation, “bystander” represents a base edit that results in a missense mutation, and “target” represents base editing of the target C.
[0521] FIG. 12 provides a schematic diagram showing the hairpin structures of exemplary end-modified sgRNA, standard heavily modified sgRNA modification and a LONGEST gRNA design including 5 base pair in the upper stem of the gRNA. The chemical modified sgRNAs (end mod. and heavily mod.) include only 4 base pairs in the upper stem. The sequences disclosed in FIG. 12 correspond, in order of occurrence, to SEQ ID NOs: 694 to 695. The first 20 nucleotides from the 5’ end of the sgRNA sequences of FIG. 12 represent a spacer sequence, which in various embodiments may be replaced with any spacer of interest, where the length of the spacer may be greater than or less than the spacer listed in the sequences of FIG. 12. The nucleotides following the spacer sequence correspond to a scaffold sequence.
[0522] FIG. 13 provides a schematic diagram showing the hairpin structures of gRNA 4 (standard end-modifications), gRNA 5 (GOLD modification) and gRNA 6(LONGEST-GOLD modifications). The sequences disclosed in FIG. 13 correspond, in order of occurrence, to SEQ ID NOs: 696 to 697. The N’s at the 5’ end of the gRNA sequences of FIG. 13 represent a spacer sequence, which may be replaced with any spacer of interest, where the length of the spacer may be greater than or less than the spacer listed in the sequences of FIG. 13. The nucleotides following the spacer sequence correspond to a scaffold sequence.
[0523] FIG. 14 provides a plot showing maximum C-to-T (C> T) base-editing efficiency for bystander mutation installation in a wild-type GALT gene in Huh-7 cells from a wild-type donor (i.e., a donor containing a GALT gene encoding a GALT polypeptide having a Q at position 188 (Q188)) administered the indicated doses (i.e., lipid nanoparticle (LNP) concentrations) of lipid nanoparticles containing a base editor system (i.e., LNPL A or LNPL B).
[0524] FIG. 15 provides a plot showing maximum C-to-T (C> T) base-editing efficiency for altering codon 188 in a mutant GALT gene encoding an R such that the codon encodes a Q (i.e., correction of a Q188R mutation) in fibroblasts from human donor GM00441 administered the indicated doses (i.e., lipid nanoparticle (LNP) concentrations) of lipid nanoparticles containing a base editor system (i.e., LNPL A or LNPL B). Donor GM00441 contained a GALT gene encoding a GALT polypeptide having an R at position 188 (Q188R). ATTORNEY DOCKET NO. 180802-47402 / PCT
[0525] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0526] DETAILED DESCRIPTION
[0527] Provided herein are base editors and base editor systems for use in editing, modifying, or altering a target polynucleotide. In particular embodiments, a base editor or endonuclease of the present disclosure modifies a galactose- 1 -phosphate uridylyltransferase (GALT) polynucleotide. In particular embodiments, a base editor of the invention deaminates a pathogenic nucleobase in a GALT polynucleotide to increase or restore activity of the GALT polypeptide encoded by the same.
[0528] The invention is based, at least in part, upon the discovery that base editor systems (e.g., base editor systems containing a cytidine base editor selected from CBE RrA3f NNGRRT, CBE RrA3f NGG, CBE RrA3f NRTH, CBE RrA3f NG - Nureki, CBE RrA3f SpRY, CBE RrA3F SpRYc, CBE RrA3F NRRH, CBE-T1.52 IBE34 NGG, and CBE6b NGG and a guide RNA containing a sequence selected from those listed in Table 1) may be used to correct a Q188R pathogenic mutation in a GALT polynucleotide.
[0529] In embodiments, the methods of the present disclosure include altering a GALT polynucleotide to restore or increase activity of a GALT polypeptide encoded thereby. For example, the base editors or base editor systems provided herein can be used for editing a nucleobase in an exon of the GALT polynucleotide. In some embodiments, the target sequence is or contains codon 188 of the GALT polynucleotide. In some embodiments, the deamination of an A or C nucleobase in the exon results in a restoration or increase of activity of a GALT polypeptide. In some embodiments, the subject has or has the potential to develop galactosemia.
[0530] In some instances, the methods of the present disclosure include modifying a GALT polynucleotide to increase levels or activity of the GALT polynucleotide and / or polypeptide. The alterations can be effected by a base editor system, such as those described herein.
[0531] In some embodiments, the present disclosure provides base editor systems that efficiently generate an intended mutation, such as an alteration of a nucleotide, in a nucleic acid molecule (e.g., a nucleic acid within a genome of a subject) without generating a significant number of unintended mutations, such as unintended point mutations. In some embodiments, an intended mutation is a mutation that is generated by a specific base editor (e.g., an adenosine base editor or a cytidine base editor) bound to a guide polynucleotide (e.g., gRNA) specifically designed to generate the intended mutation. In some embodiments, the intended mutation is an adenine (A) to guanine (G) point mutation within the coding region of a gene. In some embodiments, the intended mutation is a cytosine (C) to thymine (T) point mutation within the coding region of a gene. In some embodiments, the intended mutation is a mutation of a nucleobase corresponding to codon 188 of a GALT polynucleotide, where in some embodiments codon 188 encodes an ATTORNEY DOCKET NO. 180802-47402 / PCT
[0532] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0533] arginine (R) amino acid prior to base editing. In some instances, the intended mutation results in a R188Q amino acid alteration in the GALT polypeptide encoded by the GALT polynucleotide. In some embodiments, the intended mutation is a cytidine (C) to thymine (T) point mutation in codon 188 of the GALT polynucleotide. In some embodiments, the intended mutation is a missense mutation. In some cases, the intendent mutation results in the correction of a Q188R pathogenic mutation in a GALT polypeptide.
[0534] In some embodiments, any of the base editors provided herein are capable of generating a ratio of intended mutations to unintended mutations (e.g., intended point mutations: unintended point mutations) that is greater than 1: 1. In some embodiments, any of the base editors provided herein are capable of generating a ratio of intended mutations to unintended mutations (e.g., intended point mutations: unintended point mutations) that is at least 1.5: 1, at least 2: 1, at least 2.5: 1, at least 3: 1, at least 3.5: 1, at least 4: 1, at least 4.5: 1, at least 5: 1, at least 5.5: 1, at least 6: 1, at least 6.5: 1, at least 7: 1, at least 7.5: 1, at least 8: 1, at least 10: 1, at least 12: 1, at least 15: 1, at least 20: 1, at least 25: 1, at least 30: 1, at least 40: 1, at least 50: 1, at least 100: 1, at least 150: 1, at least 200: 1, at least 250: 1, at least 500: 1, or at least 1000: 1, or more.
[0535] In some embodiments, editing of a plurality of nucleobase pairs in one or more genes using the methods provided herein results in formation of at least one intended mutation. In some embodiments, the formation of the at least one intended mutation is in the exon of a disease-associated gene and results in a restoration or increase in activity of a polypeptide encoded by the disease-associated gene. It should be appreciated that multiplex editing can be accomplished using any method or combination of methods provided herein.
[0536] The present disclosure provides methods for the treatment of a subject diagnosed with a galactosemia. For example, in some embodiments, a method is provided that comprises administering to a subject having or having a propensity to develop galactosemia, an effective amount of a nucleobase editor (e.g., an adenosine deaminase base editor or a cytidine deaminase base editor) to effect an alteration in a GALT polynucleotide sequence, thereby reducing levels of galactose and / or galactose metabolites (e.g., galactitol or galactose- 1-P) in a biological sample of the subject.
[0537] GALACTOSEMIA
[0538] Galactosemia is an autosomal recessive disorder that results from a deficiency of any 1 of the 4 enzymes catalyzing the conversion of galactose to glucose: galactose- 1 -phosphate uridylyltransf erase (GALT; FIG. 1), galactokinase, uridine diphosphate galactose-4-epimerase, and galactose mutarotase. In some cases, galactosemia is caused by a Q188R mutation in GALT. ATTORNEY DOCKET NO. 180802-47402 / PCT
[0539] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0540] GALT deficiency is the most common cause (-75% of all galactosemia patients) of galactosemia and is often referred to as classic galactosemia or Type I classic galactosemia (see, e.g., Elsas 2ndand Lai, “The molecular biology of galactosemia,” Genet Med, 1:40-48 (1998) and Wang, et al., “Molecular and biochemical basis of galactosemia,” Mol Genet Metab, 63:263-269 (1998)). About 2250 patients suffer from classic galactosemia in the United States as of 2024, and over 50 new patients are diagnosed each year. The complete or near-complete deficiency of GALT enzyme is life-threatening if left untreated. Complications in the neonatal period include failure to thrive, liver failure, sepsis, and death.
[0541] Galactosemia is treated by a galactose-restricted diet, which allows for rapid recovery from the acute symptoms and a generally good prognosis. Despite adequate treatment from an early age, individuals with galactosemia remain at increased risk for developmental delays, speech problems, and motor function abnormalities. Female patients with galactosemia are at increased risk for premature ovarian failure. Based upon reports by newborn screening programs, the frequency of classic galactosemia in the United States is approximately 1 in 30,000, although literature reports range from 1 in 10,000 to 1 in 60,000 live births.
[0542] Galactose- 1 -phosphate (Gal IP) accumulates in the erythrocytes of patients with galactosemia. The quantitative measurement of Gal IP is useful for monitoring compliance with dietary therapy. Gal IP is thought to be the causative factor for development of liver disease in these patients. Because of this, patients should maintain low levels and be monitored on a regular basis.
[0543] The method by which potentially affected individuals are detected may include the measurement of total galactose (galactose and Gal IP) and / or determining the activity of the GALT enzyme. The diagnosis of galactosemia is established by follow-up quantitative measurement of GALT enzyme activity. If enzyme level is less than 24.5 nmol / h / mg of hemoglobin, sequencing of the GALT gene is performed.
[0544] In various embodiments, galactosemia may be diagnosed using a genetic screen and / or using a galactosemia screen. In a galactosemia screen, blood levels of GALT are measured in a subject, where, in various embodiments, a level less than about 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, or 26 nmol / h / mg Hb selects a patient as having galactosemia, where one of skill in the art understands 1 nmol / h / mg indicates the amount of GALT that catalyzes the reaction of 1 nmol of galactose per hour per mg of hemoglobin.
[0545] Accordingly, the present disclosure provides compositions and methods for use in the treatment of galactosemia in a subject. In embodiments, the methods of the disclosure are associated with an increase in GALT activity in a tissue (e.g., a liver) of a subject. In various ATTORNEY DOCKET NO. 180802-47402 / PCT
[0546] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0547] embodiments, the GALT activity in a biological sample (e.g., a blood sample) from the subject is increased to a level that is greater than about 19 nmol / h / mg Hb, 20 nmol / h / mg Hb, 21, nmol / h / mg Hb, 22 nmol / h / mg Hb, 23 nmol / h / mg Hb, 24 nmol / h / mg Hb, 25 nmol / h / mg Hb, 26 nmol / h / mg Hb, 27 nmol / h / mg Hb, 29 nmol / h / mg Hb, 29 nmol / h / mg Hb, or 30 nmol / h / mg Hb. Methods for diagnosing galactosemia are known in the art and described; for example, Mayo Clinic Laboratories “Galactosemia Reflex, Blood,” Pediatric Catalog, pediatric.testcatalog.org / show / GCT, Accessed 11 November 2024.
[0548] In some embodiments, the methods of the disclosure result in decreased levels of circulating glactose-l-P (Gallp) in the blood of a subject. In some cases, the levels of Gallp in the blood of the subject is decreased to a level that is less than 10 mg / dL, 5 mg / dL, 4.9 mg / dL, 4.8 mg / dL, 4.7 mg / dL, 4.6 mg / dL, 4.5 mg / dL, 4.25 mg / dL, 4 mg / dL, 3 mg / dL, 2 mg / dL, 1 mg / dL, 0.9 mg / dL, 0.8 mg / dL, 0.7 mg / dL, 0.6 mg / dL, or 0.5 mg / dL.
[0549] EDITING OF TARGET GENES
[0550] To produce the gene edits described herein, cells (e.g., hepatocytes) are contacted with one or more guide RNAs and a nucleobase editor polypeptide (e.g., by transfecting the cells with an mRNA encoding a nucleobase editor polypeptide) comprising a nucleic acid programmable DNA binding protein (napDNAbp) and a cytidine deaminase or adenosine deaminase or comprising one or more deaminases with cytidine deaminase and / or adenosine deaminase activity (e.g., a “dual deaminase” which has cytidine and adenosine deaminase activity). In some embodiments, cells to be edited are contacted with at least one nucleic acid, wherein the at least one nucleic acid encodes one or more guide RNAs and a nucleobase editor polypeptide comprising a nucleic acid programmable DNA binding protein (napDNAbp) and a cytidine deaminase. In some embodiments, the gRNA comprises nucleotide analogs. In some instances, the gRNA is added directly to a cell. These nucleotide analogs can inhibit degradation of the gRNA from cellular processes.
[0551] In various instances, it is advantageous for a spacer sequence to include a 5' and / or a 3' “G” nucleotide. In some cases, for example, any spacer sequence or guide polynucleotide provided herein comprises or further comprises a 5' “G”, where, in some embodiments, the 5' “G” is or is not complementary to a target sequence. In some embodiments, the 5' “G” is added to a spacer sequence that does not already contain a 5' “G ” For example, it can be advantageous for a guide RNA to include a 5' terminal “G” when the guide RNA is expressed under the control of a U6 promoter or the like because the U6 promoter prefers a “G” at the transcription start site (see Cong, L. et al. “Multiplex genome engineering using CRISPR / Cas systems. Science ATTORNEY DOCKET NO. 180802-47402 / PCT
[0552] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0553] 339:819-823 (2013) doi: 10.1126 / science.l231143). In some cases, a 5' terminal “G” is added to a guide polynucleotide that is to be expressed under the control of a promoter but is optionally not added to the guide polynucleotide if or when the guide polynucleotide is not expressed under the control of a promoter.
[0554] Exemplary guide polynucleotide sequences and target site nucleotide sequences are provided in the following Table 1, and exemplary base editor sequences are provided in the following Table 2. Variants of the spacer sequences listed in Table 1 comprising 1, 2, 3, 4, or 5 nucleobase alterations are contemplated. For example, variation of a target polynucleotide sequence within a population (e.g., single nucleotide polymorphisms) may require said alterations to a spacer sequence to allow the spacer to better bind a variant of a target sequence in a subject.
[0555] Table 1 Exemplary guide polynucleotide sequences.
[0556] Guide Target Target Site (protospacer SEQ ID Spacer Sequence SEQ ID ID Site ID adjacent motifs (PAMs) are in NO NO bold)
[0557] gRNA9 TSBTxl CCCTTACCcGGCAGTGGGGGTG 698 CCCUUACCCGGCAG 699 828 1431 UGGGGG
[0558] gRNA9 TSBTxl CCTTACCcGGCAGTGGGGGTGG 700 CCUUACCCGGCAGU 701 829 1432 GGGGGU
[0559] gRNA9 TSBTxl CTTACCcGGCAGTGGGGGTGGG 702 CUUACCCGGCAGUG 703 830 1433 GGGGUG
[0560] gRNA9 TSBTxl TTACCcGGCAGTGGGGGTGGGG 704 UUACCCGG CAGUGG 705 831 1434 GGGUGG
[0561] gRNA9 TSBTxl TACCcGGCAGTGGGGGTGGGGG 706 UACCCGGCAGUGGG 707 832 1435 GGUGGG
[0562] gRNA9 TSBTxl CTGACACCCTTACCcGGCAGTG 708 CUGACACCCUUACC 709 833 1436 G CGGCAG
[0563] gRNA9 TSBTxl TGACACCCTTACCcGGCAGTGG 710 UGACACCCUUACCC 711 834 1437 G GGCAGU
[0564] gRNA9 TSBTxl GACACCCTTACCcGGCAGTGGG 712 GACACCCUUACCCG 713 835 1438 G GCAGUG
[0565] gRNA9 TSBTxl ACACCCTTACCcGGCAGTGGGG 714 ACACCCUUACCCGG 715 836 1439 G CAGUGG
[0566] gRNA9 TSBTxl CCCTTACCcGGCAGTGGGGGTG 716 CCCUUACCCGGCAG 717 837 1444 GGGGT UGGGGGU
[0567] gRNA9 TSBTxl ACCcGGCAGTGGGGGTGGGGGT 718 ACCCGGCAGUGGGG 719 838 1448 T GUGGGG
[0568] gRNA9 TSBTxl ACACCCTTACCcGGCAGTGGGG 720 ACACCCUUACCCGG 715 839 1439 G CAGUGG
[0569] gRNA9 TSBTxl CCCTTACCcGGCAGTGGGGGTG 721 CCCUUACCCGGCAG 699 840 1431 UGGGGG
[0570] gRNA9 TSBTxl CCTTACCcGGCAGTGGGGGTGG 722 CCUUACCCGGCAGU 701 841 1432 GGGGGU
[0571] gRNA9 TSBTxl CTGACACCCTTACCcGGCAGTG 723 CUGACACCCUUACC 709
[0572]
[0573] 842 1436 G CGGCAG ATTORNEY DOCKET NO. 180802-47402 / PCT
[0574] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0575] Guide Target Target Site (protospacer SEQ ID Spacer Sequence SEQ ID ID Site ID adjacent motifs (PAMs) are in NO NO bold)
[0576] gRNA9 TSBTxl CTTACCcGGCAGTGGGGGTGGG 724 CUUACCCGGCAGUG 703 843 1433 GGGGUG
[0577] gRNA9 TSBTxl GACACCCTTACCcGGCAGTGGG 725 GACACCCUUACCCG 713 844 1438 G GCAGUG
[0578] gRNA9 TSBTxl TGACACCCTTACCcGGCAGTGG 726 UGACACCCUUACCC 711 845 1437 G GGCAGU
[0579] gRNA9 TSBTxl TTACCcGGCAGTGGGGGTGGGG 727 UUACCCGGCAGUGG 705 846 1434 GGGUGG
[0580] gRNAl TTACCCGGCAGTGGGGGTG 795 UUACCCGGCAGUGG 793
[0581]
[0582] 0320 GGGUG
[0583] Table 1 (CONTINUED).1
[0584] Guide ID gRNA nucleotide sequence SEQ ID NO
[0585] gRNA9828 mCsmCsmCsUUACCCGGCAGUGGGGGGUUUUAGAGCUAGAAAUAGCAAGUUA 728
[0586] AAAUAAGGCUAGUCCGUUAUCAACUUGAAAAAGUGGCACCGAGUCGGUGCmU smUsmUsU
[0587] gRNA9829 mCsmCsmUsUACCCGGCAGUGGGGGUGUUUUAGAGCUAGAAAUAGCAAGUUA 729 AAAUAAGGCUAGUCCGUUAUCAACUUGAAAAAGUGGCACCGAGUCGGUGCmU smUsmUsU
[0588] gRNA9830 mCsmUsmUsACCCGGCAGUGGGGGUGGUUUUAGAGCUAGAAAUAGCAAGUUA 730 AAAUAAGGCUAGUCCGUUAUCAACUUGAAAAAGUGGCACCGAGUCGGUGCmU smUsmUsU
[0589] gRNA9831 mUsmUsmAsCCCGGCAGUGGGGGUGGGUUUUAGAGCUAGAAAUAGCAAGUUA 731
[0590] AAAUAAGGCUAGUCCGUUAUCAACUUGAAAAAGUGGCACCGAGUCGGUGCmU smUsmUsU
[0591] gRNA9832 mUsmAsmCsCCGGCAGUGGGGGUGGGGUUUUAGAGCUAGAAAUAGCAAGUUA 732 AAAUAAGGCUAGUCCGUUAUCAACUUGAAAAAGUGGCACCGAGUCGGUGCmU smUsmUsU
[0592] gRNA9833 mCsmUsmGsACACC CUU AC C CGG C AGGUUUU AG AG CU AG AAAU AG C AAGUU A 733 AAAUAAGGCUAGUCCGUUAUCAACUUGAAAAAGUGGCACCGAGUCGGUGCmU smUsmUsU
[0593] gRNA9834 mUsmGsmAsCACCCUUACCCGGCAGUGUUUUAGAGCUAGAAAUAGCAAGUUA 734 AAAUAAGGCUAGUCCGUUAUCAACUUGAAAAAGUGGCACCGAGUCGGUGCmU smUsmUsU
[0594] gRNA9835 mGsmAsmCsACCCUUACCCGGCAGUGGUUUUAGAGCUAGAAAUAGCAAGUUA 735 AAAUAAGGCUAGUCCGUUAUCAACUUGAAAAAGUGGCACCGAGUCGGUGCmU smUsmUsU
[0595] gRNA9836 mAs mC smAs CC CUUACC CGGC AGUGGGUUUUAG AG CU AG AAAU AG CAAGUUA 736 AAAUAAGGCUAGUCCGUUAUCAACUUGAAAAAGUGGCACCGAGUCGGUGCmU smUsmUsU
[0596] gRNA9837 mCsmCsmCsUUACCCGGCAGUGGGGGUGUUUUAGUACUCUGUAAUGAAAAUU 737 ACAGAAUCUACUAAAACAAGGCAAAAUGCCGUGUUUAUCUCGUCAACUUGUU GGCGAGAUsmUsmUsmU
[0597] gRNA9838 mAsmCsmCsCGGCAGUGGGGGUGGGGGUUUUAGAGCUAGAAAUAGCAAGUUA 738
[0598] AAAUAAGGCUAGUCCGUUAUCAACUUGAAAAAGUGGCACCGAGUCGGUGCmU
[0599]
[0600] smUsmUsU
[0601] 1“mN” indicates a 2'-OMe modification of the nucleotide “N”, and “Ns” indicates that the nucleotide “N” is linked to the following (i.e., 3') nucleotide by a phosphorothioate (PS). ATTORNEY DOCKET NO. 180802-47402 / PCT
[0602] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0603] Guide ID gRNA nucleotide sequence SEQ ID NO
[0604] gRNA9839 mAsmCsmAsCCCUUACCCGGCAGUGGGUUUUAGAmGmCmCmGmGmCmGmGmA 739 mAmAmCmGmCmCmGmGmCAAGUUAAAAUAAGGCUAGUCCGUUAUCAmAmCmU mUmGmAmAmAmAmAmGmUmGmGmCmAmCmCmGmAmGmUmCmGmGmUmGmCmU smUsmUsmU
[0605] gRNA9840 mCsmCsmCsUUACCCGGCAGUGGGGGGUUUUAGAmGmCmCmGmGmCmGmGmA 740 mAmAmCmGmCmCmGmGmCAAGUUAAAAUAAGGCUAGUCCGUUAUCAmAmCmU mUmGmAmAmAmAmAmGmUmGmGmCmAmCmCmGmAmGmUmCmGmGmUmGmCmU smUsmUsmU
[0606] gRNA9841 mCsmCsmUsUACCCGGCAGUGGGGGUGUUUUAGAmGmCmCmGmGmCmGmGmA 741 mAmAmCmGmCmCmGmGmCAAGUUAAAAUAAGGCUAGUCCGUUAUCAmAmCmU mUmGmAmAmAmAmAmGmUmGmGmCmAmCmCmGmAmGmUmCmGmGmUmGmCmU smUsmUsmU
[0607] gRNA9842 mC s mU smG s AC AC CCUUAC CCGG CAGGUUUUAG AmGmCmCmGmGmCmGmGmA 742 mAmAmCmGmCmCmGmGmCAAGUUAAAAUAAGGCUAGUCCGUUAUCAmAmCmU mUmGmAmAmAmAmAmGmUmGmGmCmAmCmCmGmAmGmUmCmGmGmUmGmCmU smUsmUsmU
[0608] gRNA9843 mCsmUsmUsACCCGGCAGUGGGGGUGGUUUUAGAmGmCmCmGmGmCmGmGmA 743 mAmAmCmGmCmCmGmGmCAAGUUAAAAUAAGGCUAGUCCGUUAUCAmAmCmU mUmGmAmAmAmAmAmGmUmGmGmCmAmCmCmGmAmGmUmCmGmGmUmGmCmU smUsmUsmU
[0609] gRNA9844 mGsmAsmCsACCCUUACCCGGCAGUGGUUUUAGAmGmCmCmGmGmCmGmGmA 744 mAmAmCmGmCmCmGmGmCAAGUUAAAAUAAGGCUAGUCCGUUAUCAmAmCmU mUmGmAmAmAmAmAmGmUmGmGmCmAmCmCmGmAmGmUmCmGmGmUmGmCmU smUsmUsmU
[0610] gRNA9845 mUsmGsmAsCACCCUUACCCGGCAGUGUUUUAGAmGmCmCmGmGmCmGmGmA 745 mAmAmCmGmCmCmGmGmCAAGUUAAAAUAAGGCUAGUCCGUUAUCAmAmCmU mUmGmAmAmAmAmAmGmUmGmGmCmAmCmCmGmAmGmUmCmGmGmUmGmCmU smUsmUsmU
[0611] gRNA9846 mUsmUsmAsCCCGGCAGUGGGGGUGGGUUUUAGAmGmCmCmGmGmCmGmGmA 746 mAmAmCmGmCmCmGmGmCAAGUUAAAAUAAGGCUAGUCCGUUAUCAmAmCmU mUmGmAmAmAmAmAmGmUmGmGmCmAmCmCmGmAmGmUmCmGmGmUmGmCmU smUsmUsmU
[0612] gRNA 10320 mUsmUsmAsCCCGGCAGUGGGGGUGGUUUUAGAmGmCmCmGmGmCmGmGmAm 791 AmAmCmGmCmCmGmGmCAAGUUAAAAUAAGGCUAGUCCGUUAUCAmAmCmUm
[0613] UmGmAmAmAmAmAmGmUmGmGmCmAmCmCmGmAmGmUmCmGmGmUmGmCmUs
[0614]
[0615] mUsmUsmU
[0616] Table 2: Exemplary base editor amino acid sequences.2
[0617] Base Editor Encoding Sequence SEQ Name mRNA ID ID NO CBE-T1.52 MRNA74 MDKKYS I GLAI GTNS VGWAVI TDEYKVPSKKFKVLGNTDRHS I KKN 774
[0618]
[0619] IBE34 NGG 89 LIGALLFDSGETAEATRLKRTARRRYTRRKNRICYLOEIFSNEMAK
[0620] 2Amino acid sequences corresponding to Cas9 domains are shown as DOUBLE UNDERLINED ALL CAPS TEXT, amino acid sequences corresponding to deaminase domains are shown as lowercase plain text, amino acid sequences corresponding to linkers are shown as ITALIC ALL CAPS TEXT, amino acid sequences corresponding to nuclear localization signals (NLS) are shown as UNDERLINED ALL CAPS TEXT, and amino acid sequences corresponding to uracil ATTORNEY DOCKET NO. 180802-47402 / PCT
[0621] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0622] Base Editor Encoding Sequence SEQ Name mRNA ID ID NO VDDSFFHRLEESFLVEEDKKHERHPIFGNIVDEVAYHEKYPTIYHL RKKLVDS TDKADLRL I YLALAHM I KFRGH FL I EGDLNPDNS DVDKL
[0623] F 10LV0TYN0L FEENP I NASGVDAKAI LS ARLS KS RRLENL I AOL P GEKKNGLFGNLIALSLGLTPNFKSNFDLAEDAKLOLSKDTYDDDLD NLLAO I GDO YADL FLAAKNLS DAI LLS D I LRVNTE I TKAPLS ASM I KRYDEHHODLTLLKALVROOL PEKYKE I F FDOS KNGYAG Y I DGGAS OEEFYKFIKPI LEKMDGTEELLVKLNREDLLRKORTFDNGS I PHO I HLGELHAILRROEDFYPFLKDNREKIEKILTFRIPYYVGPLARGNS RFAWMTRKSEETITPWNFEEWDKGASAOSFIERMTNFDKNLPNEK VLPKHSLLYEYFTVYNELTKVKYVTEGMRKPAFLSGEOKKAIVDLL FKTNRKVTVKOLKEDYFKKIECFDSVEISGVEDRFNASLGTYHDLL KI IKDKDFLDNEENEDILEDIVLTLTLFEDREMIEERLKTYAHLFD DKVMKOLKRRR YTGWGRLS RKL I NG I RDKOSGKTI LD FLKS DG FAN RNFMOL I HDDS LTFKED I OKAOVSGOGDS LHEH I ANLAGS PAI KKG ILOTVKWDELVKVMGRHKPENIVIEMARENOTTOKGOKNSRERMK R I EEG I KELGS 01 LKEH PVENTOLONEKL YL YYLONGRDMYVDOEL DINRLSDYDVDHIVPOSFLKDDSIDNKVLTRSDKNRGKSDNVPSEE WKKMKNYWRQLLNAKL I TQRKFDNLTKAERGGSS GSETPGTSESA TPESSGseveysheywmrhaltlakrarderhvpvgavlvlnnrvi gegwnrakglhdptahaeimalrqgglvmqnyrlwgatlyttf epc vmcagamihsrigrvvf gvcnakthacmslmdvlghpgmphrveit egiladeceallcrf f rmprrvfnaqkkaqsstdGSSGSGSSGSGL SELDKAGFIKROLVETROITKHVAOILDSRMNTKYDENDKLIREVK VITLKSKLVSDFRKDFOFYKVREINNYHHAHDAYLNAWGTALIKK Y PKLE SE FVYGDYKVYDVRKM I AKS EOE I GKATAKYF FYSN I MNF F KTE I TLANGE I RKRPLI ETNGETGE I VWDKGRDFATVRKVLSMPOV N I VKKTE VOTGG F S KE S I L PKRNS DKL I ARKKD WD PKKYGG FD S PT VAYSVLWAKVEKGKSKKLKSVKELLGITIMERSSFEKNPIDFLEA KGYKEVKKDLI IKLPKYSLFELENGRKRMLASAGELOKGNELALPS KYVNFLYLASHYEKLKGSPEDNEOKOLFVEOHKHYLDEI IEOISEF
[0624]
[0625] SKRVILADANLDKVLSAYNKHRDKPIREOAENI IHLFTLTNLGAPA
[0626] glycosylase inhibitor (UGI) domains are shown as BOLD ALL CAPS TEXT. The term “CBE” in each base editor name indicates a base editor containing a deaminase domain capable of deaminating cytidine in DNA (i.e., a “cytidine base editor”, where a base editor containing a “cytidine adenosine base editor” is considered a “cytidine base editor”). The terms “T1.52”, “6b”, “RrA3f ’ in each base editor name identifies the deaminase of the base editor. The terms “NGG”, “NG”, “NNGRRT”, “NNN”, “NRN”, “NRRH”, and “NRTH” in the base editor names indicate the protospacer-adjacent motif specificity of the Cas9 domain of the base editor, where “N” indicates A, C, G, or T, “R” indicates A or G, “Y” indicates C or T, and “H” indicates A, C, or T. The terms SpRYc, SpRY, and Nureki in the base editor names indicate the Cas9 domain of the base editor. The term “IBE34” in a base editor name indicates an internal base editor where the deaminase domain has been inserted within the Cas9 domain between amino acid positions 906 and 907 referenced to SEQ ID NO: 197. In some embodiments, the N-terminal methionine of the RrA3f domain is absent. ATTORNEY DOCKET NO. 180802-47402 / PCT
[0627] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0628] Base Editor Encoding Sequence SEQ Name mRNA ID ID NO AFKYFDTTI DRKRYTSTKEVLDATL I HOS I TGLYETR I DLSOLGGD SGGSGGSGGSTNLSDIIEKETGKQLVIQESILMLPEEVEEVIGNKP ESDILVHTAYDESTDENVMLLTSDAPEYKPWALVIQDSNGENKIKM LSGGSGGSGGSTNLSDIIEKETGKQLVIQESILMLPEEVEEVIGNK PESDILVHTAYDESTDENVMLLTSDAPEYKPWALVIQDSNGENKIK MLEGADKRTADGSEFESPKKKRKV CBE6b MRNA71 MKRTADGSEFESPKKKRKVs s eve f sheywmrhal 11 akrardege 775 NGG 67 or apvgavlvlnnrvigegwvrriglhdptahaeimalrqgglvmqnp MRNA92 rlidatlyvtf epcvmcagaminsrigrvvf gvrnskrgaagslmn 78 vlnypgmnhrveitegiladecaallcdf yrmprqvf naqkkaqss lnSGGSSGGSSGSETPGTSESATPESSGGSSGGSDKK. YSIGLAI. GT NSVGWAVITDEYKVPSKKFKVLGNTDRHSIKKNLIGALLFDSGETA EATRLKRTARRRYTRRKNR I CYLOE I FSNEMAKVDDS FFHRLEES F LVEEDKKHERHPI FGNIVDEVAYHEKYPTIYHLRKKLVDSTDKADL RLI YLALAHMIKFRGHFLIEGDLNPDNSDVDKLFIOLVOTYNOLFE ENPINASGVDAKAILSARLSKSRRLENLIAOLPGEKKNGLFGNLIA LSLGLTPNFKSNFDLAEDAKLOLSKDTYDDDLDNLLAOIGDOYADL FLAAKNLSDAILLSDILRVNTEITKAPLSASMIKRYDEHHODLTLL KALVROOLPEKYKEI FFDOSKNGYAGYIDGGASOEEFYKFIKPILE KMDGTEELLVKLNREDLLRKORTFDNGS I PHOI HLGELHAI LRROE DFYPFLKDNREKIEKILTFRI PYYVGPLARGNSRFAWMTRKSEETI TPWNFEEWDKGASAOSFIERMTNFDKNLPNEKVLPKHSLLYEYFT VYNELTKVKYVTEGMRKPAFLSGEOKKAIVDLLFKTNRKVTVKOLK EDYFKKIECFDSVEISGVEDRFNASLGTYHDLLKI IKDKDFLDNEE NED I LED I VLTLTLFEDREMI EERLKTYAHLFDDKVMKOLKRRRYT GWGRLSRKLINGIRDKOSGKTILDFLKSDGFANRNFMOLIHDDSLT FKE D 10 KAO VS GOGDSLHEHI ANLAGS PA I KKG I LOTVKWDE LVK VMGRHKPENIVIEMARENOTTOKGOKNSRERMKRIEEGIKELGSOI LKEHPVENTOLONEKLYLYYLONGRDMYVDOELDINRLSDYDVDHI VPOSFLKDDSIDNKVLTRSDKNRGKSDNVPSEEWKKMKNYWROLL NAKLITORKFDNLTKAERGGLSELDKAGFIKROLVETROITKHVAO ILDSRMNTKYDENDKLIREVKVITLKSKLVSDFRKDFOFYKVREIN NYHHAHDAYLNAWGTAL I KKYPKLES E FVYGD YKVYDVRKM I AKS EOEIGKATAKYFFYSNIMNFFKTEITLANGEIRKRPLIETNGETGE IVWDKGRDFATVRKVLSMPOVNIVKKTEVOTGGFSKESILPKRNSD KLIARKKDWDPKKYGGFDSPTVAYSVLWAKVEKGKSKKLKSVKEL LG I TI MERS S FEKNP I D FLEAKG YKEVKKDL 11 KL PKYS LFELENG RKRMLASAGELOKGNELALPSKYVNFLYLASHYEKLKGSPEDNEOK OLFVEOHKHYLDE 11 EO I SEFSKRVI LADANLDKVLS AYNKHRDKP I REOAEN 11 HL FTLTNLGAPAAFKY FDTT I DRKRYTS TKEVLDATL
[0629] I HOS I TGLYETRI DLSOLGGDSGGSGGSGGSTNLS DIIEKETGKQL VIQESILMLPEEVEEVIGNKPESDILVHTAYDESTDENVMLLTSDA PEYKPWALVIQDSNGENKIKMLSGGSGGSGGSTNLSDI IEKETGKQ LVIQESILMLPEEVEEVIGNKPESDILVHTAYDESTDENVMLLTSD APEYKPWALVIQDSNGENKIKMLSGGSKRTADGSEFESPKKKRKV
[0630] RrA3fNG MRNA62 MKRTADGSEFESPKKKRKVmkpqirdhrpnpmeamyphi f y f hf en 776 (Nureki) 63 lekaygrnetwlcf tveiikqylpvpwkkgvf rnqvdpethchaek cf Iswf cnntlspkknyqvtwytswspcpecagevaef laehsnvk Itiytarlyyfwdtdyqeglrslseegasveimdyedfqycwenfv y ddgep f krwkgl kynf qsltrrlrei 1 qSGGSSGGSSGSETPGTS ESATPESSGGSSGGSDKKYSIGLAIGTNSVGWAVITDEYKVPSKKF KVLGNTDRHSIKKNLIGALLFDSGETAEATRLKRTARRRYTRRKNR I CYLOE I FSNEMAKVDDS FFHRLEES FLVEEDKKHERHP I FGNI VD
[0631]
[0632] E VAYHEKYPTI YHLRKKLVDS TDKADLRL I YLALAHM I KFRGH FL I ATTORNEY DOCKET NO. 180802-47402 / PCT
[0633] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0634] Base Editor Encoding Sequence SEQ Name mRNA ID ID NO EGDLNPDNSDVDKLFIOLVOTYNOLFEENPINASGVDAKAILSARL SKSRRLENLIAOLPGEKKNGLFGNLIALSLGLTPNFKSNFDLAEDA KLOLS KDTYDDDLDNLLAO I GDO YADL FLAAKNLS DAI LLS D I LRV NTE I TKAPLSASM I KRYDEHHODLTLLKALVROOLPEKYKE I FFDO SKNGYAGYIDGGASOEEFYKFIKPILEKMDGTEELLVKLNREDLLR KORTFDNGSIPHOIHLGELHAILRROEDFYPFLKDNREKIEKILTF RIPYYVGPLARGNSRFAWMTRKSEETITPWNFEEWDKGASAOSFI ERMTNFDKNLPNEKVLPKHSLLYEYFTVYNELTKVKYVTEGMRKPA FLSGEOKKAIVDLLFKTNRKVTVKOLKEDYFKKIECFDSVEISGVE DRFNASLGTYHDLLKI IKDKDFLDNEENEDILEDIVLTLTLFEDRE M I EERLKTYAHLFDDKVMKOLKRRR YTGWGRLS RKL I NG I RDKOSG KTILDFLKSDGFANRNFMOLIHDDSLTFKEDIOKAOVSGOGDSLHE H I ANLAGS PAI KKGI LOTVKVVDELVKVMGRHKPENI VI EMARENO TTOKGOKNS RERMKR I EEG I KELGS 01 LKEH PVENTOLONEKL YL Y YLONGRDMYVDOELDINRLSDYDVDHIVPOSFLKDDSIDNKVLTRS DKNRGKSDNVPSEEWKKMKNYWROLLNAKLITORKFDNLTKAERG GLSELDKAGFIKROLVETROITKHVAOILDSRMNTKYDENDKLIRE VKVITLKSKLVSDFRKDFOFYKVREINNYHHAHDAYLNAWGTALI KKY PKLE SE FVYGDYKVYDVRKM I AKS EOE I GKATAKYF FYSN I MN FFKTE ITLANGEI RKRPLI ETNGETGE I VWDKGRDFATVRKVLSMP
[0635] 0 VN I VKKTE VOTGG F S KE S I R PKRNS DKL I ARKKD WD PKKYGG FVS PTVAYSVLWAKVEKGKSKKLKSVKELLGITIMERSSFEKNPIDFL EAKGYKEVKKDLI IKLPKYSLFELENGRKRMLASARFLOKGNELAL PSKYVNFLYLASHYEKLKGSPEDNEOKOLFVEOHKHYLDEI IEOIS E FS KRVI LADANLDKVLS AYNKHRDKP I REOAENI I HLFTLTNLGA PRAFKYFDTTI DRKVYRSTKEVLDATL I HOS I TGLYETR I DLSOLG GDSGGSGGSGGSTNLSDIIEKETGKQLVIQESILMLPEEVEEVIGN KPESDILVHTAYDESTDENVMLLTSDAPEYKPWALVIQDSNGENKI KMLSGGSGGSGGSTNLSDIIEKETGKQLVIQESILMLPEEVEEVIG NKPESDILVHTAYDESTDENVMLLTSDAPEYKPWALVIQDSNGENK I KMLSGGSKRTADGS E FES PKKKRKV
[0636] RrA3fNGG MRNA61 mkpqi rdhrpnpmeamyphi f y f hf enl ekaygrne t wl c f t ve i i 777
[0637] 60 kqy Ipvpwkkgvf rnqvdpe t hchaekc f 1 s wf cnnt 1 spkknyqv twytswspcpecagevaef laehsnvkltiytarlyyfwdtdyqeg Irslseegasveimdyedf qycwenfvyddgepf krwkglkynf qs Itrrlreil CjSGGSSGGSSGSETPGTSESATPESSGGSSGGSDKKY SIGLAIGTNSVGWAVITDEYKVPSKKFKVLGNTDRHSIKKNLIGAL LFDSGETAEATRLKRTARRRYTRRKNR I CYLOE I FSNEMAKVDDS F FHRLEESFLVEEDKKHERHPI FGNIVDEVAYHEKYPTIYHLRKKLV DSTDKADLRLI YLALAHMIKFRGHFLIEGDLNPDNSDVDKLFIOLV OTYNOLFEENPINASGVDAKAILSARLSKSRRLENLIAOLPGEKKN GLFGNLIALSLGLTPNFKSNFDLAEDAKLOLSKDTYDDDLDNLLAO IGDOYADLFLAAKNLSDAILLSDILRVNTEITKAPLSASMIKRYDE HHODLTLLKALVROOLPEKYKEI FFDOSKNGYAGYIDGGASOEEFY KFI KP I LEKMDGTEELLVKLNREDLLRKORTFDNGS I PHOI HLGEL HAILRROEDFYPFLKDNREKIEKILTFRI PYYVGPLARGNSRFAWM TRKSEET I TPWNFEE WDKGAS AOS F I ERMTNFDKNL PNEKVL PKH SLLYEYFTVYNELTKVKYVTEGMRKPAFLSGEOKKAIVDLLFKTNR KVTVKOLKEDYFKKIECFDSVEISGVEDRFNASLGTYHDLLKI IKD KDFLDNEENED I LED I VLTLTLFEDREMI EERLKTYAHLFDDKVMK OLKRRRYTGWGRLSRKLINGIRDKOSGKTILDFLKSDGFANRNFMO LIHDDSLTFKEDIOKAOVSGOGDSLHEHIANLAGSPAIKKGILOTV KWDELVKVMGRHKPENIVIEMARENOTTOKGOKNSRERMKRIEEG
[0638]
[0639] IKELGSOILKEHPVENTOLONEKLYLYYLONGRDMYVDOELDINRL ATTORNEY DOCKET NO. 180802-47402 / PCT
[0640] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0641] Base Editor Encoding Sequence SEQ Name mRNA ID ID NO SDYDVDHIVPOSFLKDDSIDNKVLTRSDKNRGKSDNVPSEEWKKM KNYWROLLNAKLITORKFDNLTKAERGGLSELDKAGFIKROLVETR OITKHVAOILDSRMNTKYDENDKLIREVKVITLKSKLVSDFRKDFO FYKVREINNYHHAHDAYLNAWGTALIKKYPKLESEFVYGDYKVYD VRKMIAKSEOEIGKATAKYFFYSNIMNFFKTEITLANGEIRKRPLI ETNGETGEIVWDKGRDFATVRKVLSMPOVNIVKKTEVOTGGFSKES ILPKRNSDKLIARKKDWDPKKYGGFDSPTVAYSVLWAKVEKGKSK KLKS VKELLG I TI MERS S FEKNP I D FLEAKG YKEVKKDL 11 KL PKY SLFELENGRKRMLASAGELOKGNELALPSKYVNFLYLASHYEKLKG S PEDNEOKOLFVEOHKHYLDE 11 EG I SEFSKRVI LADANLDKVLS A YNKHRDKP I REQAEN 11 HL FTLTNLGAPAAFKY FDTT I DRKRYTS T KEVLDATLIHOSITGLYETRIDLSOLGGDSGGSGGSGGSTNLSDI I EKETGKQLVIQESILMLPEEVEEVIGNKPESDILVHTAYDESTDEN VML LTSDAPE YKPWALV I QDSNGENK I KMLSGGSGGSGGSTNL SD I IEKETGKQLVIQESILMLPEEVEEVIGNKPESDILVHTAYDESTDE NVMLLTSDAPEYKPWALVIQDSNGENKIKMLSGGSKRTADGSEFES PKKKRKV
[0642] RrA3f MRNA39 mkpqi rdhrpnpmeamyphi fyfhf enl ekaygrne t wl c f t ve i i 778 NNGRRT 88 kqy Ipvpwkkgvf rnqvdpe t hchaekc f 1 s wf cnnt 1 spkknyqv twytswspcpecagevaef laehsnvkltiytarlyyfwdtdyqeg Irslseegasveimdyedf qycwenfvyddgepf krwkglkynf qs Itrrlreil qSGGSSGGSSGSETPGTSESATPESSGGSSGGSGKRN Y I LGLAI G I TS VG YG 11 DYETRDVI DAGVRL FKEANVENNEGRRS K RGARRLKRRRRHRIORVKKLLFDYNLLTDHSELSGINPYEARVKGL SOKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEOISRNS KALEEKYVAELOLERLKKDGEVRGSINRFKTSDYVKEAKOLLKVOK AYHOLDOS F I DTY I DLLETRRTYYEGPGEGS PFGWKD I KEWYEMLM GHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEK FO 11 ENVFKOKKKPTLKO I AKE I LVNEED I KGYRVTS TGKPE FTNL KVYHDIKDITARKEI IENAELLDOIAKILTI YOSSEDIOEELTNLN SELTOEEIEOISNLKGYTGTHNLSLKAINLILDELWHTNDNOIAIF NRLKLVPKKVDLSOOKEIPTTLVDDFILSPWKRSFIOSIKVINAI I KKYGLPND 111 ELAREKNSKDAQKMINEMQKRNRQTNERI EE HR TTGKENAKYLIEKIKLHDMOEGKCLYSLEAI PLEDLLNNPFNYEVD HIIPRSVSFDNSFNNKVLVKOEENSKKGNRTPFOYLSSSDSKISYE T FKKH I LNL AKGKGR I S KTKKE YLLEE RD INRFSVOKDFI NRNLVD TRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKER NKGYKHHAEDALI IANADFI FKEWKKLDKAKKVMENQMFEEKQAES MPEIETEOEYKEI FITPHQIKHIKDFKDYKYSHRVDKKPNRELIND TLYSTRKDDKGNTLIVNNLNGLYDKDNDKLKKLINKSPEKLLMYHH DPOTYOKLKLIMEOYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIK KIKYYGNKLNAHLDITDDYPNSRNKWKLSLKPYRFDVYLDNGVYK FVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNOAEFIASFYNND LIKINGELYRVI G VNNDLLNR I E VNM I D I TYRE YLENMNDKR P PR I I KT I AS KTQ S I KKYS TD I LGNL YE VKS KKH PQ 11 KKGGS PKKKRKV S SDYKDHDGDYKDHD I DYKDDDDKSGGSGGSGGSTNLSD I 1 EKETG KQLVIQESILMLPEEVEEVIGNKPESDILVHTAYDESTDENVMLLT SDAPEYKPWALVI QDSNGENK IKMLSGGSGGSGGSTNLSD I I EKET GKQLVIQESILMLPEEVEEVIGNKPESDILVHTAYDESTDENVMLL TSDAPE YKPWALV I QDSNGENK I KMLSGGSKRTADGSEFESPKKKR KV
[0643] RrA3fNNN MRNA65 mkpqi rdhrpnpmeamyphi fyfhf enl ekaygrne twl c f t ve i i 779 (SpRYc) 09 kqy Ipvpwkkgvf rnqvdpe t hchaekc f 1 s wf cnnt 1 spkknyqv
[0644]
[0645] twytswspcpecagevaef laehsnvkltiytarlyyfwdtdyqeg ATTORNEY DOCKET NO. 180802-47402 / PCT
[0646] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0647] Base Editor Encoding Sequence SEQ Name mRNA ID ID NO
[0648] Irslseegasveimdyedf qycwenfvyddgepf krwkglkynf qs Itrrlreil gSGGSSGGSSGSETPGTSESATPESSGGSSGGSEKKY SIGLDIGTNSVGWAVITDDYKVPSKKFKVLGNTNRKSIKKNLMGAL L FDSGETAE ATRLKRTARRRYTRRKNR I R YLOE I FANEMAKLDDS F FORLEESFLVEEDKKNERHPI FGNLADEVAYHRNYPTIYHLRKKLA DSPEKADLRLI YLALAHI IKFRGHFLIEGKLNAENSDVAKLFYOLI OTYNOLFEESPLDEIEVDAKGILSARLSKSKRLEKLIAVFPNEKKN GLFGNI IALALGLTPNFKSNFDLTEDAKLOLSKDTYDDDLDELLGO IGDOYADLFSAAKNLSDAILLSDILRSNSEVTKAPLSASMVKRYDE HHODLALLKTLVROOFPEKYAEI FKDDTKNGYAGYVGADKKLRKRS GKLATEEEFYKFIKPILEKMDGAEELLAKLNRDDLLRKORTFDNGS I PHOI HLKELHAI LRROEEFYPFLKENREKI EKILTFRI PYYVGPL ARGNSRFAWLTRKSEEAITPWNFEEWDKGASAOSFIERMTNFDEO LPNKKVLPKHSLLYEYFTVYNELTKVKYVTERMRKPEFLSGEOKKA I VDLL FKTNRKVTVKOLKE DYFKKIECFDSVEI I G VE DR FNAS LGT YHDLLKI I KDKDFLDNEENED I LED I VLTLTLFEDREMI EERLKTY AHLFDDKVMKOLKRRHYTGWGRLSRKMINGIRDKOSGKTILDFLKS DGFSNRNFMOLIHDDSLTFKEEIEKAOVSGOGDSLHEOIADLAGSP AIKKGILOTVKIVDELVKVMGHKPENIVIEMARENOTTTKGLOOSR ERKKRIEEGIKELESOILKENPVENTOLONEKLYLYYLONGRDMYV DOELDINRLSDYDVDHIVPOSFIKDDSIDNKVLTRSVENRGKSDNV PSEEWKKMKNYWROLLNAKLITORKFDNLTKAERGGLSEADKAGF I KROLVETRO I TKHVAR I LDS RMNTKRDKNDKP I REVKVI TLKSKL VSD FRKD FOLYKVRD INNYHHAHDAYLNAWGTAL I KKY PKLE SE F VYGDYKVYDVRKM I AKS EOE I GKATAKRF FYSN I MNF FKTE VKLAN GET RKRPL I ETNGETGE WWNKEKD FATVRKVLAM POVN I VKKTE V OTGGFSKESIRPKRNSDKLIARKKDWDPKKYGGFLWPTVAYSVLW AKVEKGKSKKLKSVKELLGITIMERSSFEKNPIDFLEAKGYKEVKK DLI IKLPKYSLFELENGRKRMLASAKOLOKGNELALPSKYVNFLYL ASHYEKLKGSPEDNEOKOLFVEOHKHYLDEI IEOISEFSKRVILAD ANLDKVLSAYNKHRDKPIREOAENI IHLFTLTRLGAPRAFKYFDTT I DPKO YRSTKE VLDATL I HQS I TGL YETR I DLS QEGGDSGGSGGSG GSTNLSDIIEKETGKQLVIQESILMLPEEVEEVIGNKPESDILVHT AYDESTDENVMLLTSDAPEYKPWALVIQDSNGENKIKMLSGGSGGS GGSTNLSDIIEKETGKQLVIQESILMLPEEVEEVIGNKPESDILVH TAYDESTDENVMLLTSDAPEYKPWALVIQDSNGENKIKMLSGGSKR TADGSEFESPKKKRKV
[0649] RrA3fNRN MRNA62 mkpqi rdhrpnpmeamyphi f y f hf enl ekaygrne t wl c f t ve i i 780 (SpRY) 76 kqy Ipvpwkkgvf rnqvdpe t hchaekc f 1 s wf cnnt 1 spkknyqv twytswspcpecagevaef laehsnvkltiytarlyyfwdtdyqeg Irslseegasveimdyedf qycwenfvyddgepf krwkglkynf qs Itrrlreil aSGGSSGGSSGSETPGTSESATPESSGGSSGGSDKK. Y SIGLAIGTNSVGWAVITDEYKVPSKKFKVLGNTDRHSIKKNLIGAL LFDSGETAERTRLKRTARRRYTRRKNR I CYLOE I FSNEMAKVDDS F FHRLEESFLVEEDKKHERHPI FGNIVDEVAYHEKYPTIYHLRKKLV DSTDKADLRLI YLALAHMIKFRGHFLIEGDLNPDNSDVDKLFIOLV OTYNOLFEENPINASGVDAKAILSARLSKSRRLENLIAOLPGEKKN GLFGNLIALSLGLTPNFKSNFDLAEDAKLOLSKDTYDDDLDNLLAO IGDOYADLFLAAKNLSDAILLSDILRVNTEITKAPLSASMIKRYDE HHODLTLLKALVROOLPEKYKEI FFDOSKNGYAGYIDGGASOEEFY KFI KP I LEKMDGTEELLVKLNREDLLRKORTFDNGS I PHOI HLGEL HAI LRROEDFYPFLKDNREKI EKILTFRI PYYVGPLARGNSRFAWM TRKSEET I TPWNFEE WDKGAS AOS F I ERMTNFDKNL PNEKVL PKH
[0650]
[0651] SLLYEYFTVYNELTKVKYVTEGMRKPAFLSGEOKKAIVDLLFKTNR ATTORNEY DOCKET NO. 180802-47402 / PCT
[0652] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0653] Base Editor Encoding Sequence SEQ Name mRNA ID ID NO KVTVKOLKEDYFKKIECFDSVEISGVEDRFNASLGTYHDLLKI IKD KDFLDNEENED I LED I VLTLTLFEDREMI EERLKTYAHLFDDKVMK OLKRRRYTGWGRLSRKLINGIRDKOSGKTILDFLKSDGFANRNFMO LIHDDSLTFKEDIOKAOVSGOGDSLHEHIANLAGSPAIKKGILOTV KWDELVKVMGRHKPENIVIEMARENOTTOKGOKNSRERMKRIEEG IKELGSOILKEHPVENTOLONEKLYLYYLONGRDMYVDOELDINRL SDYDVDHIVPOSFLKDDSIDNKVLTRSDKNRGKSDNVPSEEWKKM KNYWROLLNAKLITORKFDNLTKAERGGLSELDKAGFIKROLVETR OITKHVAOILDSRMNTKYDENDKLIREVKVITLKSKLVSDFRKDFO FYKVREINNYHHAHDAYLNAWGTALIKKYPKLESEFVYGDYKVYD VRKMIAKSEOEIGKATAKYFFYSNIMNFFKTEITLANGEIRKRPLI ETNGETGEIVWDKGRDFATVRKVLSMPOVNIVKKTEVOTGGFSKES IRPKRNSDKLIARKKDWDPKKYGGFLWPTVAYSVLWAKVEKGKSK KLKS VKELLG I TI MERE S FEKNP I D FLEAKG YKEVKKDL 11 KL PKY SLFELENGRKRMLASAKOLOKGNELALPSKYVNFLYLASHYEKLKG S PEDNEOKOLFVEOHKHYLDE 11 EO I SEFSKRVI LADANLDKVLS A YNKHRDKP I REOAEN 11 HL FTLTRLGAPRAFKY FDTT I D PKOYRS T KEVLDATLIHOSITGLYETRIDLSOLGGDSGGSGGSGGSTNLSDI I EKETGKQLVIQESILMLPEEVEEVIGNKPESDILVHTAYDESTDEN VML LTSDAPE YKPWALV I QDSNGENK I KMLSGGSGGSGGSTNL SD I IEKETGKQLVIQESILMLPEEVEEVIGNKPESDILVHTAYDESTDE NVMLLTSDAPEYKPWALVIQDSNGENKIKMLSGGSKRTADGSEFES PKKKRKV
[0654] RrA3f MRNA62 mkpqi rdhrpnpmeamyphi f y f hf enl ekaygrne t wl c f t ve i i 781 NRRH 78 kqy Ipvpwkkgvf rnqvdpe t hchaekc f 1 s wf cnnt 1 spkknyqv twytswspcpecagevaef laehsnvkltiytarlyyfwdtdyqeg Irslseegasveimdyedf qycwenfvyddgepf krwkglkynf qs Itrrlreil CjSGGSSGGSSGSETPGTSESATPESSGGSSGGSDKKY SIGLAIGTNSVGWAVITDEYKVPSKKFKVLGNTDRHSIKKNLIGAL LFDSGETAEATRLKRTARRRYTRRKNR I CYLOE I FSNEMAKVDDS F FHRLEESFLVEEDKKHERHPI FGNIVDEVAYHEKYPTIYHLRKKLV DSTDKADLRLI YLALAHMIKFRGHFLIEGDLNPDNSDVDKLFIOLV OTYNOLFEENPINASGVDAKAILSARLSKSRRLENLIAOLPGEKKN GLFGNLIALSLGLTPNFKSNFDLAEDAKLOLSKDTYDDDLDNLLAO IGDOYADLFLAAKNLSDAILLSDILRVNTEITKAPLSASMVKRYDE HHODLTLLKALVROOLPEKYKEI FFDOSKNGYAGYIDGGASOEEFY KFIKPILEKMDGTEELLVKLNREDLLRKORTFDNGI I PHOIHLGEL HAILRROGDFYPFLKDNREKIEKILTFRI PYYVGPLARGNSRFAWM TRKSEET I TPWNFEE WDKGAS AOS F I ERMTNFDKNL PNEKVL PKH SLLYEYFTVYNELTKVKYVTEGMRKPAFLSGEOKKAIVDLLFKTNR KVTVKOLKEDYFKKIECFDSVEISGVEDRFNASLGTYHDLLKI IKD KDFLDNEENED I LED I VLTLTLFEDREMI EERLKTYAHLFDDKVMK OLKRLRYTGWGRLSRKLINGIRDKOSGKTILDFLKSDGFANRNFMO LIHDDSLTFKEDIOKAOVSGOGDSLHEHIANLAGSPAIKKGILOTV KWDELVKVMGGHKPENIVIEMARENOTTOKGOKNSRERMKRIEEG IKELGSOILKEHPVENTOLONEKLYLYYLONGRDMYVDOELDINRL SDYDVDHIVPOSFLKDDSIDNKVLTRSDKNRGKSDNVPSEEWKKM KNYWROLLNAKLITORKFDNLTKAERGGLSELDKAGFIKROLVETR OITKHVAOILDSRMNTKYDENDKLIREVKVITLKSKLVSDFRKDFO FYKVREINNYHHAHDAYLNAWGTALIKKYPKLESEFVYGDYKVYD VRKMIAKSEOEIGKATAKYFFYSNIMNFFKTEITLANGEIRKRPLI ETNGETGEIVWDKGRDFATVRKVLSMPOVNIVKKTEVOTGGFSKES I LPKGNSDKLI ARKKDWDPKKYGGFNS PTAAYSVLWAKVEKGKSK
[0655]
[0656] KLKS VKELLG I TI MERE S FEKNP I G FLEAKG YKEVKKDL 11 KL PKY ATTORNEY DOCKET NO. 180802-47402 / PCT
[0657] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0658] Base Editor Encoding Sequence SEQ Name mRNA ID ID NO SLFELENGRKRMLASAGVLHKGNELALPSKYVNFLYLASHYEKLKG S PEDNEOKOLFVEOHKHYLDE 11 EO I SEFSKRVI LADANLDKVLS A YNKHRDKP I REOAEN 11 HL FTLTNLGVPAAFKY FDTT I DKKRYTS T KEVLDATLIHOSITGLYETRIDLSOLGGDSGGSGGSGGSTNLSDI I EKETGKQLVIQESILMLPEEVEEVIGNKPESDILVHTAYDESTDEN VML LTSDAPE YKPWALV I QDSNGENK I KMLSGGSGGSGGSTNL SD I IEKETGKQLVIQESILMLPEEVEEVIGNKPESDILVHTAYDESTDE NVMLLTSDAPEYKPWALVIQDSNGENKIKMLSGGSKRTADGSEFES PKKKRKV
[0659] RrA3f MRNA62 MKRTADGSEFESPKKKRKVmkpqirdhrpnpmeamyphi f y f hf en 782 NRTH 60 lekaygrnetwlcf tveiikqylpvpwkkgvf rnqvdpethchaek cf Iswf cnntlspkknyqvtwytswspcpecagevaef laehsnvk Itiytarlyyfwdtdyqeglrslseegasveimdyedfqycwenfv y ddgep f krwkgl kynf qsltrrlrei 1 qSGGSSGGSSGSETPGTS ESATPESSGGSSGGSDKKYSIGLAIGTNSVGWAVITDEYKVPSKKF KVLGNTDRHSIKKNLIGALLFDSGETAEATRLKRTARRRYTRRKNR I CYLOE I FSNEMAKVDDS FFHRLEES FLVEEDKKHERHP I FGNI VD E VAYHEKYPTI YHLRKKLVDS TDKADLRL I YLALAHM I KFRGH FL I EGDLNPDNSDVDKLFIOLVOTYNOLFEENPINASGVDAKAILSARL SKSRRLENLIAOLPGEKKNGLFGNLIALSLGLTPNFKSNFDLAEDA KLOLS KDTYDDDLDNLLAO I GDO YADL FLAAKNLS DAI LLS D I LRV NTEITKAPLSASMVKRYDEHHODLTLLKALVROOLPEKYKEIFFDO SKNGYAGYIDGGASOEEFYKFIKPILEKMDGTEELLVKLNREDLLR KORTFDNGI IPHOIHLGELHAILRROGDFYPFLKDNREKIEKILTF RIPYYVGPLARGNSRFAWMTRKSEETITPWNFEEWDKGASAOSFI ERMTNFDKNLPNEKVLPKHSLLYEYFTVYNELTKVKYVTEGMRKPA FLSGEOKKAIVDLLFKTNRKVTVKOLKEDYFKKIECFDSVEISGVE DRFNASLGTYHDLLKI IKDKDFLDNEENEDILEDIVLTLTLFEDRE M I EERLKTYAHLFDDKVMKOLKRLR YTGWGRLS RKL I NG I RDKOSG KTILDFLKSDGFANRNFMOLIHDDSLTFKEDIOKAOVSGOGDSLHE H I ANLAGS PAI KKGI LOTVKVVDELVKVMGGHKPENI VI EMARENO TTOKGOKNS RERMKR I EEG I KELGS 01 LKEH PVENTOLONEKL YL Y YLONGRDMYVDOELDINRLSDYDVDHIVPOSFLKDDSIDNKVLTRS DKNRGKSDNVPSEEWKKMKNYWROLLNAKLITORKFDNLTKAERG GLSELDKAGFIKROLVETROITKHVAOILDSRMNTKYDENDKLIRE VKVITLKSKLVSDFRKDFOFYKVREINNYHHAHDAYLNAWGTALI KKY PKLE SE FVYGDYKVYDVRKM I AKS EOE I GKATAKYF FYSN I MN FFKTE ITLANGEI RKRPLI ETNGETGE I VWDKGRDFATVRKVLSMP
[0660] 0 VN I VKKTE VOTGG F S KE S I L PKGNS DKL I ARKKD WD PKKYGG FNS PTVAYSVLWAKVEKGKSKKLKSVKELLGITIMERSSFEKNPIGFL EAKGYKEVKKDLI IKLPKYSLFELENGRKRMLASASVLHKGNELAL PSKYVNFLYLASHYEKLKGSSEDNKOKOLFVEOHKHYLDEI IEOIS E FS KRVI LADANLDKVLS AYNKHRDKP I REOAENI I HLFTLTNLGA S AAFKYFDTTI GRKLYTSTKEVLDATL I HOS I TGLYETR I DLSOLG
[0661] CrDSGGSGGSGGSTNLSDIIEKETGKQLVIQESILMLPEEVEEVIGN KPESDILVHTAYDESTDENVMLLTSDAPEYKPWALVIQDSNGENKI KMLSGGSGGSGGSTNLSDIIEKETGKQLVIQESILMLPEEVEEVIG NKPESDILVHTAYDESTDENVMLLTSDAPEYKPWALVIQDSNGENK I KMLSGGSKRTADGS E FES PKKKRKV CBE6b_NGG MRNA92 MKRTADGSEFESPKKKRKVs s eve f sheywmrhal 11 akrardege 792 (version 2) 79 apvgavlvlnnrvigegwvrriglhdptahaeimalrqgglvmqnp rlidatlyvtf epcvmcagaminsrigrvvf gvrnskrgaagslmn vlnypgmnhrveitegiladecaallcdf yrmprqvf naqkkaqss
[0662]
[0663] i nEGGSEEEEESGSDKKYSIGLAIGTNSVGWAVITDEYKVPSKKFK ATTORNEY DOCKET NO. 180802-47402 / PCT
[0664] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0665] Base Editor Encoding Sequence SEQ Name mRNA ID ID NO VLGNTDRHS I KKNL I GALL FDSGETAE ATRLKRTARRRYTRRKNR I CYLOEIFSNEMAKVDDSFFHRLEESFLVEEDKKHERHPIFGNIVDE VAYHEKYPTIYHLRKKLVDSTDKADLRLI YLALAHMIKFRGHFLIE GDLNPDNSDVDKLFIOLVOTYNOLFEENPINASGVDAKAILSARLS KSRRLENLIAOLPGEKKNGLFGNLIALSLGLTPNFKSNFDLAEDAK LOLSKDTYDDDLDNLLAOIGDOYADLFLAAKNLSDAILLSDILRVN TEITKAPLSASMIKRYDEHHODLTLLKALVROOLPEKYKEI FFDOS KNG YAGY I DGGAS GEE F YKF I KP I LEKMDGTEELLVKLNREDLLRK ORTFDNGS I PHQI HLGELHAI LRRQEDFYPFLKDNREKI EKILTFR I PYYVGPLARGNSRFAWMTRKSEETITPWNFEEWDKGASAOSFIE RMTNFDKNLPNEKVLPKHSLLYEYFTVYNELTKVKYVTEGMRKPAF LSGEOKKAIVDLLFKTNRKVTVKOLKEDYFKKIECFDSVEISGVED RFNASLGTYHDLLKI I KDKDFLDNEENED I LED I VLTLTLFEDREM IEERLKTYAHLFDDKVMKOLKRRRYTGWGRLSRKLINGIRDKOSGK TILDFLKSDGFANRNFMOLIHDDSLTFKEDIOKAOVSGOGDSLHEH I ANLAGS PAI KKG I LOTVKWDELVKVMGRHKPENI VI EMARENOT TOKGOKNSRERMKRIEEGIKELGSOILKEHPVENTOLONEKLYLYY LONGRDMYVDOELDINRLSDYDVDHIVPOSFLKDDSIDNKVLTRSD KNRGKSDNVPSEEWKKMKNYWROLLNAKLITORKFDNLTKAERGG LSELDKAGFIKROLVETROITKHVAOILDSRMNTKYDENDKLIREV KVITLKSKLVSDFRKDFOFYKVREINNYHHAHDAYLNAWGTALIK KYPKLESEFVYGDYKVYDVRKMIAKSEOEIGKATAKYFFYSNIMNF FKTEITLANGEIRKRPLIETNGETGEIVWDKGRDFATVRKVLSMPO VNI VKKTEVOTGG FS KE S I LPKRNS DKL I ARKKDWDPKKYGGFDS P TVAYS VL WAKVEKGKS KKLKS VKELLG I TI MERE S FEKNP I D FLE AKGYKEVKKDLI IKLPKYSLFELENGRKRMLASAGELOKGNELALP SKYVNFLYLASHYEKLKGS PEDNEOKOLFVEOHKHYLDE 11 EO I SE FSKRVILADANLDKVLSAYNKHRDKPIREOAENI IHLFTLTNLGAP AAFKY FDTT I DRKRYTS TKEVLDATL I HOS I TGLYETR I DLSOLGG
[0666] gSGGSGGSGGSTNLSDIIEKETGKQLVIQESILMLPEEVEEVIGNK PESDILVHTAYDESTDENVMLLTSDAPEYKPWALVIQDSNGENKIK MLSGGSGGSGGSTNLSDIIEKETGKQLVIQESILMLPEEVEEVIGN KPESDILVHTAYDESTDENVMLLTSDAPEYKPWALVIQDSNGENKI
[0667]
[0668] KML SGGSKRTADGSE FE S PKKKRKV
[0669] sg23 / sgRNA_088 spacer sequence: CAGGAUCCGCACAGACUCCA (SEQ ID NO: 747) (target gene: ALAS1). The target sequence corresponding to sg23 is as follows, where the PAM sequence is in bold: CAGGATCCGCACAGACTCCAGGG (SEQ ID NO: 748).
[0670] NUCLEOBASE EDITORS
[0671] Useful in the methods and compositions described herein are nucleobase editors that edit, modify or alter a target nucleotide sequence of a polynucleotide. Nucleobase editors described herein typically include a polynucleotide programmable nucleotide binding domain and a nucleobase editing domain (e.g., adenosine deaminase, cytidine deaminase, or a dual deaminase). A polynucleotide programmable nucleotide binding domain, when in conjunction with a bound ATTORNEY DOCKET NO. 180802-47402 / PCT
[0672] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0673] guide polynucleotide (e.g., gRNA), can specifically bind to a target polynucleotide sequence and thereby localize the base editor to the target nucleic acid sequence desired to be edited.
[0674] Polynucleotide Programmable Nucleotide Binding Domain
[0675] Polynucleotide programmable nucleotide binding domains bind polynucleotides (e.g, RNA, DNA). A polynucleotide programmable nucleotide binding domain of a base editor can itself comprise one or more domains (e.g., one or more nuclease domains). In some embodiments, the nuclease domain of a polynucleotide programmable nucleotide binding domain comprises an endonuclease or an exonuclease.
[0676] Disclosed herein are base editors comprising a polynucleotide programmable nucleotide binding domain comprising all or a portion (e.g., a functional portion) of a CRISPR protein (i.e., a base editor comprising as a domain all or a portion (e.g., a functional portion) of a CRISPR protein (e.g., a Cas protein), also referred to as a “CRISPR protein-derived domain” of the base editor). A CRISPR protein-derived domain incorporated into a base editor can be modified compared to a wild-type or natural version of the CRISPR protein. A CRISPR protein-derived domain can comprise one or more mutations, insertions, deletions, rearrangements and / or recombinations relative to a wild-type or natural version of the CRISPR protein.
[0677] Cas proteins that can be used herein include class 1 and class 2. Non-limiting examples of Cas proteins include Casl, CaslB, Cas2, Cas3, Cas4, Cas5, Cas5d, Cas5t, Cas5h, Cas5a, Cas6, Cas7, Cas8, Cas9 (also known as Csnl or Csxl2), CaslO, Csyl, Csy2, Csy3, Csy4, Csel, Cse2, Cse3, Cse4, Cse5e, Cscl, Csc2, Csa5, Csnl, Csn2, Csml, Csm2, Csm3, Csm4, Csm5, Csm6, Cmrl, Cmr3, Cmr4, Cmr5, Cmr6, Csbl, Csb2, Csb3, Csxl7, Csxl4, CsxlO, Csxl6, CsaX, Csx3, Csxl, CsxlS, Csfl, Csf2, CsO, Csf4, Csdl, Csd2, Cstl, Cst2, Cshl, Csh2, Csal, Csa2, Csa3, Csa4, Csa5, Casl2a / Cpfl, Casl2b / C2cl (e.g, SEQ ID NO: 232), Casl2c / C2c3, Casl2d / CasY, Casl2e / CasX, Casl2g, Casl2h, Casl2i, and Casl2j / Cas, CARF, DinG, Turbo Cas9 (i.e., an SpCas9 with the amino acid alterations Q844R, V842L, F846Y, L847M, and I852F), homologues thereof, or modified versions thereof. A CRISPR enzyme can direct cleavage of one or both strands at a target sequence, such as within a target sequence and / or within a complement of a target sequence. For example, a CRISPR enzyme can direct cleavage of one or both strands within about 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, 25, 50, 100, 200, 500, or more base pairs from the first or last nucleotide of a target sequence.
[0678] A vector that encodes a CRISPR enzyme that is mutated to with respect to a corresponding wild-type enzyme such that the mutated CRISPR enzyme lacks the ability to cleave one or both strands of a target polynucleotide containing a target sequence can be used. A ATTORNEY DOCKET NO. 180802-47402 / PCT
[0679] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0680] Cas protein (e.g., Cas9, Cast 2) or a Cas domain (e.g., Cas9, Cast 2) can refer to a polypeptide or domain with at least or at least about 50%, 60%, 70%, 80%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% sequence identity and / or sequence homology to a wild-type exemplary Cas polypeptide or Cas domain. Cas (e.g., Cas9, Cas 12) can refer to the wild-type or a modified form of the Cas protein that can comprise an amino acid change such as a deletion, insertion, substitution, variant, mutation, fusion, chimera, or any combination thereof.
[0681] In some embodiments, a CRISPR protein-derived domain of a base editor can include all or a portion (e.g., a functional portion) of Cas9 from Corynebacterium ulcerans (NCBI Refs:
[0682] NC_015683.1, NC_017317.1); Corynebacterium diphtheria (NCBI Refs: NC_016782.1, NC_016786.1); Spiroplasma syrphidicola (NCBI Ref: NC_021284.1); Prevotella intermedia (NCBI Ref: NC_017861.1); Spiroplasma taiwanense (NCBI Ref: NC_021846.1); Streptococcus iniae (NCBI Ref: NC_021314.1); Belliella baltica (NCBI Ref: NC_018010.1); Psychroflexus torquis (NCBI Ref: NC 018721.1); Streptococcus thermophilus (NCBI Ref: YP 820832.1); Listeria innocua (NCBI Ref: NP 472073.1); Campylobacter jejuni (NCBI Ref:
[0683] YP_002344900.1); Neisseria meningitidis (NCBI Ref: YP_002342100.1), Streptococcus pyogenes, or Staphylococcus aureus.
[0684] Some aspects of the disclosure provide high fidelity Cas9 domains. High fidelity Cas9 domains are known in the art and described, for example, in KI einstiver, B. P., et al. “High-fidelity CRISPR-Cas9 nucleases with no detectable genome-wide off-target effects.” Nature 529, 490-495 (2016); and Slaymaker, I. M., et al. “Rationally engineered Cas9 nucleases with improved specificity.” Science 351, 84-88 (2015); the entire contents of each of which are incorporated herein by reference. An Exemplary high fidelity Cas9 domain is provided in the Sequence Listing as SEQ ID NO: 233.
[0685] In some embodiments, any of the Cas9 fusion proteins or complexes provided herein comprise one or more of a D10A, N497X, a R661X, a Q695X, and / or a Q926X mutation, or a corresponding mutation in any of the amino acid sequences provided herein, wherein X is any amino acid..
[0686] Typically, Cas9 proteins, such as Cas9 from S. pyogenes (spCas9), require a “protospacer adjacent motif (PAM)” or PAM-like motif, which is a 2-6 base pair DNA sequence immediately following the DNA sequence targeted by the Cas9 nuclease in the CRISPR bacterial adaptive immune system. The presence of an NGG PAM sequence is required to bind a particular nucleic acid region, where the “N” in “NGG” is adenosine (A), thymidine (T), or cytosine (C), and the G is guanosine. In some embodiments, any of the fusion proteins or complexes provided herein may contain a Cas9 domain that is capable of binding a nucleotide sequence that does not ATTORNEY DOCKET NO. 180802-47402 / PCT
[0687] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0688] contain a canonical (e.g., NGG) PAM sequence. Cas9 domains that bind to non-canonical PAM sequences have been described in the art and would be apparent to the skilled artisan. For example, Cas9 domains that bind non-canonical PAM sequences have been described in Kleinstiver, B. P., et al., “Engineered CRISPR-Cas9 nucleases with altered PAM specificities” Nature 523, 481-485 (2015); and Kleinstiver, B. P., etal., “Broadening the targeting range of Staphylococcus aureus CRISPR-Cas9 by modifying PAM recognition” Nature Biotechnology 33, 1293-1298 (2015); the entire contents of each are hereby incorporated by reference.
[0689] In some embodiments, the napDNAbp is a circular permutant (e.g., SEQ ID NO: 238). In some embodiments, the polynucleotide programmable nucleotide binding domain comprises a nickase domain. Herein the term “nickase” refers to a polynucleotide programmable nucleotide binding domain comprising a nuclease domain that is capable of cleaving only one strand of the two strands in a duplexed nucleic acid molecule (e.g., DNA). For example, where a polynucleotide programmable nucleotide binding domain comprises a nickase domain derived from Cas9, the Cas9-derived nickase domain can include a D10A mutation and a histidine at position 840. In another example, a Cas9-derived nickase domain comprises an H840A mutation, while the amino acid residue at position 10 remains a D.
[0690] In some embodiments, a Cas9 nuclease has an inactive (e.g., an inactivated) DNA cleavage domain, that is, the Cas9 is a nickase, referred to as an “nCas9” protein (for “nickase” Cas9; SEQ ID NO: 201). The Cas9 nickase may be a Cas9 protein that is capable of cleaving only one strand of a duplexed nucleic acid molecule (e.g., a duplexed DNA molecule). In some embodiments the Cas9 nickase comprises an amino acid sequence that is at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or at least 99.5% identical to any one of the Cas9 nickases provided herein. Additional suitable Cas9 nickases will be apparent to those of skill in the art based on this disclosure and knowledge in the field and are within the scope of this disclosure.
[0691] Also provided herein are base editors comprising a polynucleotide programmable nucleotide binding domain which is catalytically dead (i.e., incapable of cleaving a target polynucleotide sequence). For example, in the case of a base editor comprising a Cas9 domain, the Cas9 can comprise both a D10A mutation and an H840A mutation. In further embodiments, a catalytically dead polynucleotide programmable nucleotide binding domain comprises a point mutation (e.g., D10A or H840A) as well as a deletion of all or a portion (e.g., a functional portion) of a nuclease domain. dCas9 domains are known in the art and described, for example, in Qi et al., “Repurposing CRISPR as an RNA-guided platform for sequence-specific control of ATTORNEY DOCKET NO. 180802-47402 / PCT
[0692] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0693] gene expression.” Cell. 2013; 152(5): 1173-83, the entire contents of which are incorporated herein by reference.
[0694] The term “protospacer adjacent motif (PAM)” or PAM-like motif refers to a 2-6 base pair DNA sequence immediately following the DNA sequence targeted by a nucleic acid programmable DNA binding protein. In some embodiments, the PAM can be a 5' PAM (z.e., located upstream of the 5' end of the protospacer). In other embodiments, the PAM can be a 3' PAM (z.e., located downstream of the 5' end of the protospacer). The PAM sequence can be any PAM sequence known in the art. Suitable PAM sequences include, but are not limited to, NGG, NGA, NGC, NGN, NGT, NGTT, NGCG, NGAG, NGAN, NGNG, NGCN, NGCG, NGTN, NNGRRT, NNNRRT, NNGRR(N), TTTV, TYCV, TYCV, TATV, NNNNGATT, NNAGAAW, or NAAAAC. Y is a pyrimidine; N is any nucleotide base; W is A or T.
[0695] A base editor provided herein can comprise a CRISPR protein-derived domain that is capable of binding a nucleotide sequence that contains a canonical or non-canonical protospacer adjacent motif (PAM) sequence.
[0696] In some embodiments, the PAM is an “NRN” PAM where the “N” in “NRN” is adenine (A), thymine (T), guanine (G), or cytosine (C), and the R is adenine (A) or guanine (G); or the PAM is an “NYN” PAM, wherein the “N” in NYN is adenine (A), thymine (T), guanine (G), or cytosine (C), and the Y is cytidine (C) or thymine (T), for example, as described in R. T. Walton etal., 2020, Science, 10.1126 / science.aba8853 (2020), the entire contents of which are incorporated herein by reference.
[0697] Several PAM variants are described in Table 3 below.
[0698] Table 3. Cas9 proteins and corresponding PAM sequences. N is A, C, T, or G; and V is A, C, or G.
[0699] Variant PAM
[0700] spCas9 NGG
[0701] spCas9-VRQR NGA
[0702] spCas9-VRER NGCG
[0703] xCas9 (sp) NGN
[0704] saCas9 NNGRRT
[0705] saCas9-KKH NNNRRT
[0706] spCas9-MQKSER NGCG
[0707] spCas9-MQKSER NGCN
[0708] spCas9-LRKIQK NGTN
[0709]
[0710] ATTORNEY DOCKET NO. 180802-47402 / PCT
[0711] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0712] Variant PAM
[0713] spCas9-LRVSQK NGTN
[0714] spCas9-LRVSQL NGTN
[0715] spCas9-MQKFRAER NGC
[0716] Cpfl 5 ' (TTTV)
[0717] SpyMac 5 ' -NAA-3 '
[0718]
[0719] In some embodiments, the PAM is NGC. In some embodiments, the NGC PAM is recognized by a Cas9 variant. In some embodiments, the NGC PAM Cas9 variant includes one or more amino acid substitutions selected from DI 135M, SI 136Q, G1218K, E1219F, A1322R, D1332A, R1335E, and T1337R (collectively termed “MQKFRAER”) of spCas9 (SEQ ID No: 197), or a corresponding mutation in another Cas9. In some embodiments, the Cas9 variant contains one or more amino acid substitutions selected from DI 135V, G1218R, R1335Q, and T1337R (collectively termed VRQR) of spCas9 (SEQ ID No: 197), or a corresponding mutation in another Cas9. In some embodiments, the Cas9 variant contains one or more amino acid substitutions selected from DI 135V, G1218R, R1335E, and T1337R (collectively termed VRER) of spCas9 (SEQ ID No: 197), or a corresponding mutation in another Cas9. In some embodiments, the Cas9 variant contains one or more amino acid substitutions selected from E782K, N968K, and R1015H (collectively termed KHH) of saCas9 (SEQ ID NO: 218). In some embodiments, the Cas9 variant includes one or more amino acid substitutions selected from D1135M, S1136Q, G1218K, E1219S, R1335E, and T1337R (collectively termed “MQKSER”) of spCas9 (SEQ ID No: 197), or a corresponding mutation in another Cas9.
[0720] In some cases, a Cas9 variant has specificity for the PAM 5 " -NGC-3 ". In some embodiments, a Cas9 variant includes one or more amino acid substitutions selected from D1135M, S1136Y, G1218K, E1219F, A1322R, D1332A, R1335E, and T1337K of spCas9 (SEQ ID No: 197), or a corresponding mutation in another Cas9. In some embodiments, the Cas9 variant includes one or more amino acid substitutions selected from DI 135L, SI 136Y, G1218K, E1219F, A1322R, D1332A, R1335E, and T1337R of spCas9 (SEQ ID No: 197), or a corresponding mutation in another Cas9. In some embodiments, a Cas9 variant includes one or more amino acid substitutions selected from D1135M, S1136Y, G1218K, E1219F, A1322R, D1332A, R1335E, and T1337K of spCas9 (SEQ ID No: 197), or a corresponding mutation in another Cas9. In some embodiments, a Cas9 variant includes one or more amino acid substitutions selected from DI 135L, S1136Y, G1218K, E1219F, A1283D, A1322R, D1332A, R1335E, and T1337K of spCas9 (SEQ ID No: 197), or a corresponding mutation in another ATTORNEY DOCKET NO. 180802-47402 / PCT
[0721] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0722] Cas9. In some embodiments, a Cas9 variant includes one or more amino acid substitutions selected from A61R, LI 111R, DI 135L, SI 136W, G1218K, E1219Q, N1317R, A1322R, R1333P, R1335Q, and T1337R of spCas9 (SEQ ID No: 197) (SpRY), or a corresponding mutation in another Cas9. In some embodiments, a Cas9 variant includes one or more amino acid substitutions selected from D1135L, S1136Q, G1218K, E1219F, E1250K, A1283D, A1322R, D1332A, R1335E, and T1337K of spCas9 (SEQ ID No: 197), or a corresponding mutation in another Cas9. In some embodiments, a Cas9 variant includes one or more amino acid substitutions selected from D1135M, S1136Y, G1218K, E1219F, E1250K, A1283D, A1322R, D1332A, R1335E, and T1337R of spCas9 (SEQ ID No: 197), or a corresponding mutation in another Cas9. In some embodiments, a Cas9 variant includes one or more amino acid substitutions selected from R765A, Q768A, D1135L, S1136Y, G1218K, A1283D, E1219F, A1322R, D1332A, R1335E, and T1337K of spCas9 (SEQ ID No: 197), or a corresponding mutation in another Cas9. In some embodiments, any of the Cas9 proteins provided herein, including an SpCas9 comprises any one, two, three, four, five, six, seven, eight, nine, or ten of the following amino acid substitutions in a corresponding residue: R765A, Q768A, W1126R, R1359W, E1250K, A1239T, A1239V, A1283D, R1335D, D1135L, D1135M, D1135R, D1135W, S1136H, S1136Q, S1136Y, G1218D, G1218K, G1218R, G1218E, G1218L, E1219F, E1219K, E1219N, A1322A, A1322R, A1322K, D1332A, R1335V, T1337K, T1337T, D1332A, DI 135V and T1337R.
[0723] In some embodiments, a CRISPR protein-derived domain of a base editor comprises all or a portion (e.g., a functional portion) of a Cas9 protein with a canonical PAM sequence (NGG). In other embodiments, a Cas9-derived domain of a base editor can employ a non-canonical PAM sequence. Such sequences have been described in the art and would be apparent to the skilled artisan. For example, Cas9 domains that bind non-canonical PAM sequences have been described in Kleinstiver, B. P., etal., “Engineered CRISPR-Cas9 nucleases with altered PAM specificities” Nature 523, 481-485 (2015); and Kleinstiver, B. P., etal., “Broadening the targeting range of Staphylococcus aureus CRISPR-Cas9 by modifying PAM recognition” Nature Biotechnology 33, 1293-1298 (2015); R. T. Walton et al. “Unconstrained genome targeting with near-PAMless engineered CRISPR-Cas9 variants” Science 10.1126 / science.aba8853 (2020); Hu et al. “Evolved Cas9 variants with broad PAM compatibility and high DNA specificity,” Nature, 2018 Apr. 5, 556(7699), 57-63; Miller etal., “Continuous evolution of SpCas9 variants compatible with non-G PAMs” Nat. Biotechnol., 2020 Apr;38(4):471-481; the entire contents of each are hereby incorporated by reference. ATTORNEY DOCKET NO. 180802-47402 / PCT
[0724] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0725] Fusion Proteins or Complexes Comprising a NapDNAbp and a Cytidine Deaminase and / or Adenosine Deaminase
[0726] Some aspects of the disclosure provide fusion proteins or complexes comprising a Cas9 domain or other nucleic acid programmable DNA binding protein (e.g., Cast 2) and one or more cytidine deaminase, adenosine deaminase, or cytidine adenosine deaminase domains. It should be appreciated that the Cas9 domain may be any of the Cas9 domains or Cas9 proteins (e.g., dCas9 or nCas9) provided herein. In some embodiments, any of the Cas9 domains or Cas9 proteins (e.g., dCas9 or nCas9) provided herein may be fused with any of the cytidine deaminases and / or adenosine deaminases provided herein. The domains of the base editors disclosed herein can be arranged in any order.
[0727] In some embodiments, the fusion proteins or complexes comprising a cytidine deaminase or adenosine deaminase and a napDNAbp (e.g., Cas9 or Casl2 domain) do not include a linker sequence. In some embodiments, a linker is present between the cytidine or adenosine deaminase and the napDNAbp. In some embodiments, cytidine or adenosine deaminase and the napDNAbp are fused via any of the linkers provided herein. For example, in some embodiments the cytidine or adenosine deaminase and the napDNAbp are fused via any of the linkers provided herein.
[0728] It should be appreciated that the fusion proteins or complexes of the present disclosure may comprise one or more additional features. For example, in some embodiments, the fusion protein or complex may comprise inhibitors, cytoplasmic localization sequences, export sequences, such as nuclear export sequences, or other localization sequences, as well as sequence tags that are useful for solubilization, purification, or detection of the fusion proteins or complexes. Suitable protein tags provided herein include, but are not limited to, biotin carboxylase carrier protein (BCCP) tags, myc-tags, calmodulin-tags, FLAG-tags, hemagglutinin (HA)-tags, polyhistidine tags, also referred to as histidine tags or His-tags, maltose binding protein (MBP)-tags, nus-tags, glutathione-S-transferase (GST)-tags, green fluorescent protein (GFP)-tags, thioredoxin-tags, S-tags, Softags (e.g., Softag 1, Softag 3), strep-tags, biotin ligase tags, FlAsH tags, V5 tags, and SBP-tags. Additional suitable sequences will be apparent to those of skill in the art. In some embodiments, the fusion protein or complex comprises one or more His tags.
[0729] Exemplary, yet nonlimiting, fusion proteins are described in International PCT Application Nos. PCT / US2017 / 045381, PCT / US2019 / 044935, and PCT / US2020 / 016288, each of which is incorporated herein by reference for its entirety. ATTORNEY DOCKET NO. 180802-47402 / PCT
[0730] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0731] Fusion Proteins or Complexes with Internal Insertions
[0732] Provided herein are fusion proteins or complexes comprising a heterologous polypeptide fused to a nucleic acid programmable nucleic acid binding protein, for example, a napDNAbp. The heterologous polypeptide can be fused to the napDNAbp at a C-terminal end of the napDNAbp, an N-terminal end of the napDNAbp, or inserted at an internal location of the napDNAbp. In some embodiments, the heterologous polypeptide is a deaminase (e.g., cytidine or adenosine deaminase) or a functional fragment thereof. For example, a fusion protein can comprise a deaminase flanked by an N- terminal fragment and a C-terminal fragment of a Cas9 or Casl2 (e.g., Casl2b / C2cl), polypeptide.
[0733] The deaminase can be a circular permutant deaminase. In some embodiments, the deaminase is a circular permutant TadA, circularly permutated at amino acid residue 116, 136, or 65 as numbered in a TadA reference sequence.
[0734] The fusion protein or complexes can comprise more than one deaminase. The fusion protein or complex can comprise, for example, 1, 2, 3, 4, 5 or more deaminases. The deaminases in a fusion protein or complex can be adenosine deaminases, cytidine deaminases, or a combination thereof.
[0735] In some embodiments, the napDNAbp in the fusion protein or complex contains a Cas9 polypeptide or a fragment thereof. The Cas9 polypeptide can be a variant Cas9 polypeptide. The Cas9 polypeptide can be a circularly permuted Cas9 protein.
[0736] The heterologous polypeptide (e.g., deaminase) can be inserted in the napDNAbp (e.g., Cas9 or Casl2 (e.g., Casl2b / C2cl)) at a suitable location, for example, such that the napDNAbp retains its ability to bind the target polynucleotide and a guide nucleic acid. A deaminase (e.g., adenosine deaminase, cytidine deaminase, or adenosine deaminase and cytidine deaminase (dual deaminase)) can be inserted into a napDNAbp without compromising function of the deaminase (e.g., base editing activity) or the napDNAbp (e.g., ability to bind to target nucleic acid and guide nucleic acid).
[0737] In some embodiments, the deaminase (e.g., adenosine deaminase, cytidine deaminase, or adenosine deaminase and cytidine deaminase) is inserted in regions of the Cas9 polypeptide comprising higher than average B-factors (e.g., higher B factors compared to the total protein or the protein domain comprising the disordered region). Cas9 polypeptide positions comprising a higher than average B-factor can include, for example, residues 768, 792, 1052, 1015, 1022, 1026, 1029, 1067, 1040, 1054, 1068, 1246, 1247, and 1248 as numbered in SEQ ID NO: 197. Cas9 polypeptide regions comprising a higher than average B-factor can include, for example, residues 792-872, 792-906, and 2-791 as numbered in SEQ ID NO: 197. ATTORNEY DOCKET NO. 180802-47402 / PCT
[0738] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0739] In some embodiments, a heterologous polypeptide (e.g., deaminase) is inserted in a flexible loop of a Cas9 polypeptide. The flexible loop portions can be selected from the group consisting of 530-537, 569-570, 686-691, 943-947, 1002-1025, 1052-1077, 1232-1247, or 1298-1300 as numbered in SEQ ID NO: 197, or a corresponding amino acid residue in another Cas9 polypeptide. The flexible loop portions can be selected from the group consisting of: 1-529, 538-568, 580-685, 692-942, 948-1001, 1026-1051, 1078-1231, or 1248-1297 as numbered in SEQ ID NO: 197, or a corresponding amino acid residue in another Cas9 polypeptide.
[0740] A heterologous polypeptide (e.g., adenine deaminase) can be inserted into a Cas9 polypeptide region corresponding to amino acid residues: 1017-1069, 1242-1247, 1052-1056, 1060-1077, 1002 - 1003, 943-947, 530-537, 568-579, 686-691, 1242-1247, 1298 - 1300, 1066-1077, 1052-1056, or 1060-1077 as numbered in SEQ ID NO: 197, or a corresponding amino acid residue in another Cas9 polypeptide.
[0741] A heterologous polypeptide (e.g., adenine deaminase) can be inserted in place of a deleted region of a Cas9 polypeptide. The deleted region can correspond to an N-terminal or C-terminal portion of the Cas9 polypeptide. Exemplary internal fusions base editors are provided in Table 4 below:
[0742] Table 4: Insertion loci in Cas9 proteins
[0743] BE ID Modification Other ID IBE001 Cas9 Tad A ins 1015 ISLAY01 IBE002 Cas9 Tad A ins 1022 ISLAY02 IBE003 Cas9 Tad A ins 1029 ISLAY03 IBE004 Cas9 Tad A ins 1040 ISLAY04 IBE005 Cas9 TadA ins 1068 ISLAY05 IBE006 Cas9 TadA ins 1247 ISLAY06 IBE007 Cas9 TadA ins 1054 ISLAY07 IBE008 Cas9 TadA ins 1026 ISLAY08 IBE009 Cas9 TadA ins 768 ISLAY09 IBE020 delta HNH TadA 792 ISLAY20 IBE021 N-term fusion single TadA helix truncated 165-end ISLAY21 IBE029 TadA-Circular Permutantl 16 insl067 ISLAY29 IBE031 TadA- Circular Permutant 136 insl248 ISLAY31 IBE032 TadA- Circular Permutant 136ins 1052 ISLAY32 IBE035 delta 792-872 TadA ins ISLAY35 IBE036 delta 792-906 TadA ins ISLAY36 IBE043 TadA-Circular Permutant 65 insl246 ISLAY43
[0744]
[0745] IBE044 TadA ins C-term truncate2791 ISLAY44
[0746] A heterologous polypeptide (e.g., deaminase) can be inserted within a structural or functional domain of a Cas9 polypeptide. A heterologous polypeptide (e.g., deaminase) can be ATTORNEY DOCKET NO. 180802-47402 / PCT
[0747] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0748] inserted between two structural or functional domains of a Cas9 polypeptide. A heterologous polypeptide (e.g., deaminase) can be inserted in place of a structural or functional domain of a Cas9 polypeptide, for example, after deleting the domain from the Cas9 polypeptide. The structural or functional domains of a Cas9 polypeptide can include, for example, RuvC I, RuvC II, RuvC III, Reel, Rec2, PI, or HNH.
[0749] A fusion protein can comprise a linker between the deaminase and the napDNAbp polypeptide. The linker can be a peptide or a non-peptide linker. For example, the linker can be an XTEN, (GGGS)n (SEQ ID NO: 246), SGGSSGGS (SEQ ID NO: 330), (GGGGS)n (SEQ ID NO: 247), (G)n, (EAAAK)n (SEQ ID NO: 248), (GGS)n, SGSETPGTSESATPES (SEQ ID NO: 249). In some embodiments, the fusion protein comprises a linker between the N-terminal Cas9 fragment and the deaminase. In some embodiments, the fusion protein comprises a linker between the C-terminal Cas9 fragment and the deaminase. In some embodiments, the N-terminal and C-terminal fragments of napDNAbp are connected to the deaminase with a linker. In some embodiments, the N-terminal and C-terminal fragments are joined to the deaminase domain without a linker. In some embodiments, the fusion protein comprises a linker between the N-terminal Cas9 fragment and the deaminase but does not comprise a linker between the C-terminal Cas9 fragment and the deaminase. In some embodiments, the fusion protein comprises a linker between the C-terminal Cas9 fragment and the deaminase but does not comprise a linker between the N-terminal Cas9 fragment and the deaminase.
[0750] In some embodiments, the napDNAbp in the fusion protein or complex is a Casl2 polypeptide, e.g., Casl2b / C2cl, or a functional fragment thereof capable of associating with a nucleic acid (e.g., a gRNA) that guides the Casl2 to a specific nucleic acid sequence. The Casl2 polypeptide can be a variant Casl2 polypeptide. In other embodiments, the N- or C-terminal fragments of the Casl2 polypeptide comprise a nucleic acid programmable DNA binding domain or a RuvC domain. In other embodiments, the fusion protein contains a linker between the Cast 2 polypeptide and the catalytic domain. In other embodiments, the amino acid sequence of the linker is GGSGGS (SEQ ID NO: 250) or GSSGSETPGTSESATPESSG (SEQ ID NO: 251). In other embodiments, the linker is a rigid linker. In other embodiments of the above aspects, the linker is encoded by GGAGGCTCTGGAGGAAGC (SEQ ID NO: 252) or GGCTCTTCTGGATCTGAAACACCTGGCACAAGCGAGAGCGCCACCCCTGAGAGCTCTGGC (SEQ ID NO: 253).
[0751] In other embodiments, the fusion protein or complex contains a nuclear localization signal (e.g., a bipartite nuclear localization signal). In other embodiments, the amino acid sequence of the nuclear localization signal is MAPKKKRKVGIHGVPAA (SEQ ID NO: 261). In ATTORNEY DOCKET NO. 180802-47402 / PCT
[0752] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0753] other embodiments of the above aspects, the nuclear localization signal is encoded by the following sequence:
[0754] ATGGCCCCAAAGAAGAAGCGGAAGGTCGGTATCCACGGAGTCCCAGCAGCC (SEQ ID NO: 262). In other embodiments, the Casl2b polypeptide contains a mutation that silences the catalytic activity of a RuvC domain. In other embodiments, the Cast 2b polypeptide contains D574A, D829A and / or D952A mutations.
[0755] In some embodiments, the fusion protein or complex comprises a napDNAbp domain (e.g., Casl2-derived domain) with an internally fused nucleobase editing domain (e.g., all or a portion (e.g., a functional portion) of a deaminase domain, e.g, an adenosine deaminase domain). In some embodiments, the napDNAbp is a Casl2b. In some embodiments, the base editor comprises a BhCasl2b domain with an internally fused TadA*8 domain inserted at the loci provided in Table 5 below.
[0756] Table 5: Insertion loci in Casllb proteins
[0757] BhCasl2b Insertion site Inserted between aa
[0758] position 1 153 PS
[0759] position 2 255 KE
[0760] position 3 306 DE
[0761] position 4 980 DG
[0762] position 5 1019 KL
[0763] position 6 534 FP
[0764] position 7 604 KG
[0765] position 8 344 HF
[0766] BvCasl2b Insertion site Inserted between aa
[0767] position 1 147 PD
[0768] position 2 248 GG
[0769] position 3 299 PE
[0770] position 4 991 GE
[0771] position 5 1031 KM
[0772] AaCasl2b Insertion site Inserted between aa
[0773] position 1 157 PG
[0774] position 2 258 VG
[0775] position 3 310 DP
[0776] position 4 1008 GE
[0777]
[0778] position 5 1044 GK
[0779] In some embodiments, the base editing system described herein is an ABE with TadA inserted into a Cas9. Polypeptide sequences of relevant ABEs with TadA inserted into a Cas9 are provided in the attached Sequence Listing as SEQ ID NOs: 263-308. ATTORNEY DOCKET NO. 180802-47402 / PCT
[0780] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0781] Exemplary, yet nonlimiting, fusion proteins are described in International PCT Application Nos. PCT / US2020 / 016285 and U. S. Provisional Application Nos. 62 / 852,228 and 62 / 852,224, the contents of which are incorporated by reference herein in their entireties.
[0782] Cto T Editing
[0783] In some embodiments, a base editor disclosed herein comprises a fusion protein or complex comprising cytidine deaminase capable of deaminating a target cytidine (C) base of a polynucleotide to produce uridine (U), which has the base pairing properties of thymine. In some embodiments, for example where the polynucleotide is double-stranded (e.g., DNA), the uridine base can then be substituted with a thymidine base (e.g., by cellular repair machinery) to give rise to a C: G to a T: A transition. In other embodiments, deamination of a C to U in a nucleic acid by a base editor cannot be accompanied by substitution of the U to a T.
[0784] The deamination of a target C in a polynucleotide to give rise to a U is a non-limiting example of a type of base editing that can be executed by a base editor described herein. In another example, a base editor comprising a cytidine deaminase domain can mediate conversion of a cytosine (C) base to a guanine (G) base. For example, a U of a polynucleotide produced by deamination of a cytidine by a cytidine deaminase domain of a base editor can be excised from the polynucleotide by a base excision repair mechanism e.g., by a uracil DNA glycosylase (UDG) domain), producing an abasic site. The nucleobase opposite the abasic site can then be substituted (e.g., by base repair machinery) with another base, such as a C, by for example a translesion polymerase. Although it is typical for a nucleobase opposite an abasic site to be replaced with a C, other substitutions (e.g., A, G or T) can also occur.
[0785] Accordingly, in some embodiments a base editor described herein comprises a deamination domain (e.g., cytidine deaminase domain) capable of deaminating a target C to a U in a polynucleotide. Further, as described below, the base editor can comprise additional domains which facilitate conversion of the U resulting from deamination to, in some embodiments, a T or a G. For example, a base editor comprising a cytidine deaminase domain can further comprise a uracil glycosylase inhibitor (UGI) domain to mediate substitution of a U by a T, completing a C-to-T base editing event. In another example, the base editor may comprise a uracil stabilizing protein as described herein. In another example, a base editor can incorporate a translesion polymerase to improve the efficiency of C-to-G base editing, since a translesion polymerase can facilitate incorporation of a C opposite an abasic site (i.e., resulting in incorporation of a G at the abasic site, completing the C-to-G base editing event). ATTORNEY DOCKET NO. 180802-47402 / PCT
[0786] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0787] A base editor comprising a cytidine deaminase as a domain can deaminate a target C in any polynucleotide, including DNA, RNA and DNA-RNA hybrids.
[0788] In some embodiments, a cytidine deaminase of a base editor comprises all or a portion (e.g., a functional portion) of an apolipoprotein B mRNA editing complex (APOBEC) family deaminase. APOBEC is a family of evolutionarily conserved cytidine deaminases. Members of this family are C-to-U editing enzymes. The N-terminal domain of APOBEC like proteins is the catalytic domain, while the C-terminal domain is a pseudocatalytic domain. More specifically, the catalytic domain is a zinc dependent cytidine deaminase domain and is important for cytidine deamination. APOBEC family members include APOBEC 1, APOBEC2, APOBEC3A, APOBEC3B, APOBEC3C, APOBEC3D (“APOBEC3E” now refers to this), APOBEC3F, APOBEC3G, APOBEC3H, APOBEC4, and Activation-induced (cytidine) deaminase.
[0789] Other exemplary deaminases that can be fused to Cas9 according to aspects of this disclosure are provided below. In embodiments, the deaminases are activation-induced deaminases (AID). It should be understood that, in some embodiments, the active domain of the respective sequence can be used, e.g., the domain without a localizing signal (nuclear localization sequence, without nuclear export signal, cytoplasmic localizing signal).
[0790] Some aspects of the present disclosure are based on the recognition that modulating the deaminase domain catalytic activity of any of the fusion proteins or complexes described herein, for example by making point mutations in the deaminase domain, affect the processivity of the fusion proteins (e.g., base editors) or complexes. For example, mutations that reduce, but do not eliminate, the catalytic activity of a deaminase domain within a base editing fusion protein or complexes can make it less likely that the deaminase domain will catalyze the deamination of a residue adjacent to a target residue, thereby narrowing the deamination window. The ability to narrow the deamination window can prevent unwanted deamination of residues adjacent to specific target residues, which can reduce or prevent off-target effects.
[0791] In some embodiments, an APOBEC deaminase incorporated into a base editor can comprise one or more mutations selected from the group consisting of R33A, K34A, E63A, H102P, D104N, H121R, H122R, H122L, D124N; R126A, R126E, R118A, W90A, W90Y, and R132E of rAPOBECl; D316R, D317R, R320A, R320E, R313A, W285A, W285Y, and R326E of hAPOBEC3G; and any alternative mutation at the corresponding position, or one or more corresponding mutations in another APOBEC deaminase. In some embodiments, an APOBEC deaminase incorporated into a base editor can comprise one or more combinations of mutations selected from K34A, H122L, and D124N (AALN); H102P and D104N (evoFERNY derived from FERNY); W90Y and R126E (YE1); W90Y and R132E (YE2); R126E and R132E (EE); ATTORNEY DOCKET NO. 180802-47402 / PCT
[0792] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0793] W90Y, R126E, and R132E (YEE), or rAPOBECl; and any alternative mutation at the corresponding positions, or one or more corresponding mutations in another APOBEC deaminase.
[0794] A number of modified cytidine deaminases are commercially available, including, but not limited to, SaBE3, SaKKH-BE3, VQR-BE3, EQR-BE3, VRER-BE3, YE1-BE3, EE-BE3, YE2-BE3, and YEE-BE3, which are available from Addgene (plasmids 85169, 85170, 85171, 85172, 85173, 85174, 85175, 85176, 85177). In some embodiments, a deaminase incorporated into a base editor comprises all or a portion (e.g., a functional portion) of an APOBEC1 deaminase.
[0795] In some embodiments, the fusion proteins or complexes of the disclosure comprise one or more cytidine deaminase domains. In some embodiments, the cytidine deaminases provided herein are capable of deaminating cytosine or 5-methylcytosine to uracil or thymine. In some embodiments, the cytidine deaminases provided herein are capable of deaminating cytosine in DNA. The cytidine deaminase may be derived from any suitable organism. In some embodiments, the cytidine deaminase is a naturally-occurring cytidine deaminase that includes one or more mutations corresponding to any of the mutations provided herein. One of skill in the art will be able to identify the corresponding residue in any homologous protein, e.g., by sequence alignment and determination of homologous residues. Accordingly, one of skill in the art would be able to generate mutations in any naturally-occurring cytidine deaminase that corresponds to any of the mutations described herein. In some embodiments, the cytidine deaminase is from a prokaryote. In some embodiments, the cytidine deaminase is from a bacterium. In some embodiments, the cytidine deaminase is from a mammal (e.g., human).
[0796] In some embodiments, the cytidine deaminase comprises an amino acid sequence that is at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or at least 99.5% identical to any one of the cytidine deaminase amino acid sequences set forth herein. It should be appreciated that cytidine deaminases provided herein may include one or more mutations (e.g., any of the mutations provided herein). Some embodiments provide a polynucleotide molecule encoding the cytidine deaminase nucleobase editor polypeptide of any previous aspect or as delineated herein. In some embodiments, the polynucleotide is codon optimized.
[0797] In embodiments, a fusion protein of the disclosure comprises two or more nucleic acid editing domains.
[0798] Details of C to T nucleobase editing proteins are described in International PCT Application No. PCT / US2016 / 058344 (WO2017 / 070632) and Komor, A. C., eta!., “Programmable editing of a target base in genomic DNA without double-stranded DNA ATTORNEY DOCKET NO. 180802-47402 / PCT
[0799] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0800] cleavage” Nature 533, 420-424 (2016), the entire contents of which are hereby incorporated by reference. Further non-limiting examples of C to T nucleobase editing proteins are described in PCT Applications No. PCT / US2020 / 062428 and PCT / US2019 / 033848, the entire contents of which are hereby incorporated by reference.
[0801] Cytidine Adenosine Base Editors (CABEs)
[0802] In some embodiments, a base editor described herein comprises an adenosine deaminase variant that has increased cytidine deaminase activity. Such base editors may be referred to as “cytidine adenosine base editors (CABEs)” or “cytosine base editors derived from TadA* (CBE-Ts),” and their corresponding deaminase domains may be referred to as “TadA* acting on DNA cytosine (TADC)” domains or TadA-derived cytidine deaminases (TadA-CD). Base editors containing adenosine deaminase variants having both cytidine deaminase and adenosine deaminase activity (i.e., TadA-Dual deaminases) may be referred to as TadA-based dual editors (TadDE). In some instances, an adenosine deaminase variant has both adenine and cytosine deaminase activity (i.e., is a dual deaminase). In some embodiments, the adenosine deaminase variants deaminate adenine and cytosine in DNA. In some embodiments, the adenosine deaminase variants deaminate adenine and cytosine in single-stranded DNA. In some embodiments, the adenosine deaminase variants deaminate adenine and cytosine in RNA. In some embodiments, the adenosine deaminase variant predominantly deaminates cytosine in DNA and / or RNA (e.g., greater than 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95%, or 99% of all deaminations catalyzed by the adenosine deaminase variant, or the number of cytosine deaminations catalyzed by the variant is about or at least about 2-fold, 3 -fold, 4-fold, 5-fold, 6-fold, 7-fold, 8-fold, 9-fold, 10-fold, 25-fold, 50-fold, 75-fold, 100-fold, 500-fold, or 1,000-fold greater than the number adenine deaminations catalyzed by the variant). In some embodiments, the adenosine deaminase variant has approximately equal cytosine and adenosine deaminase activity (e.g., the two activities are within about 10% or 20% of each other). In some embodiments, the adenosine deaminase variant has predominantly cytosine deaminase activity, and little, if any, adenosine deaminase activity. In some embodiments, the adenosine deaminase variant has cytosine deaminase activity, and no significant or no detectable adenosine deaminase activity. In some embodiments, the target polynucleotide is present in a cell in vitro or in vivo. In some embodiments, the cell is a bacteria, yeast, fungi, insect, plant, or mammalian cell.
[0803] Examples of adenosine deaminase variants having increased cytidine deaminase activity include those described in International Patent Application Publications No. WO 2024 / 040083 and WO ATTORNEY DOCKET NO. 180802-47402 / PCT
[0804] ELECTRONIC DEPOSIT DATE: December 10, 2025
[0805] 2022 / 204574, the disclosures of which are hereby incorporated by reference in their entireties for all purposes.
[0806] In some embodiments, the CABE comprises a bacterial TadA deaminase variant (e.g., ecTadA). In some embodiments, the CABE comprises a truncated TadA deaminase variant. In some embodiments, the CABE comprises a fragment of a TadA deaminase variant. In some embodiments, the CABE comprises a TadA*8.20 variant.
[0807] In some embodiments, an adenosine deaminase variant of the disclosure is a TadA adenosine deaminase comprising one or more alterations that increase cytosine deaminase activity (e.g, at least about 10-fold, 20-fold, 30-fold, 40-fold, 50-fold, 60-fold, 70-fold or more increase) while maintaining adenosine deaminase activity (e.g., at least about 30%, 40%, 50% or more of the activity of a reference adenosine deaminase (e.g., TadA*8.20 or TadA*8.19)). In some instances, the adenosine deaminase variant comprises one or more alterations that increase cytosine deaminase activity (e.g., at least about 10-fold, 20-fold, 30-fold, 40-fold, 50-fold, 60-fold, 70-fold or more increase) relative to the activity of a reference adenosine deaminase and comprise undetectable adenosine deaminase activity or adenosine deaminase activity that is less than 30%, 20%, 10%, or 5% of that of a reference adenosine deaminase. In some embodiments, the reference adenosine deaminase is Tad A* 8.20 or TadA* 8.19.
[0808] In some embodiments, the adenosine deaminase variant is an adenosine deaminase comprising two or more alterations at an amino acid position selected from the group consisting of 2, 4, 6, 8, 13, 17, 23, 27, 29, 30, 47, 48, 49, 67, 76, 77, 82, 84, 96, 100, 107, 112, 114, 115, 118, 119, 122, 127, 142, 143, 147, 149, 158, 159, 162 165, 166, and 167, of an amino acid sequence having at least about 70%, 75%, 80%, 85%, 90%, 95%, 98%, 99% or greater identity to SEQ ID NO: 1, or a corresponding alteration in another deaminase.
[0809] In some embodiments, the adenosine deaminase variant is an adenosine deaminase comprising one or more alterations selected from the group consisting of S2H, V4K, V4S, V4T, V4Y, F6G, F6H, F6Y, H8Q, R13G, T17A, T17W, R23Q, E27C, E27G, E27H, E27K, E27Q, E27S, E27G, P29A, P29G, P29K, V30F, V30I, R47G, R47S, A48G, I49K, I49M, I49N, I49Q, I49T, G67W, I76H, I76R, I76W, Y76H, Y76R, Y76W, F84A, F84M, H96N, G100A, G100K, T111H, G112H, Al 14C, G115M, Ml 18L, H122G, H122R, H122T, N127I, N127K, N127P, A142E, R147H, A158V, Q159S, A162C, A162N, A162Q, ...
Claims
ATTORNEY DOCKET NO. 180802-47402 / PCTELECTRONIC DEPOSIT DATE: December 10, 2025CLAIMSWhat is claimed is:
1. A method of editing a nucleobase of a galactose- 1 -phosphate uridylyltransferase (GALT) polynucleotide, the method comprising contacting the GALT polynucleotide with a guide RNA, or a polynucleotide encoding the guide RNA, and a base editor comprising a deaminase domain capable deaminating cytidine in DNA and a nucleic acid programmable DNA binding protein (napDNAbp) domain, or one or more polynucleotides encoding the base editor, wherein said guide RNA targets said base editor to effect an alteration to a nucleobase in codon 188 of the GALT polynucleotide encoding an R amino acid, wherein the alteration to the nucleobase changes codon 188 to a codon encoding a Q amino acid, and wherein:a) the deaminase domain comprises an amino acid sequence with at least 85% sequence identity to a deaminase selected from the group consisting of 6b, RrA3f, and T1.52; and / or b) the guide RNA comprises a spacer, wherein the spacer comprises at least 19 contiguous nucleotides of a sequence selected from the group consisting of:UUACCCGGCAGUGGGGGUGG (SEQ ID NO: 705); UUACCCGGCAGUGGGGGUG (SEQ ID NO: 793), CCCUUACCCGGCAGUGGGGG (SEQ ID NO: 699), CCUUACCCGGCAGUGGGGGU (SEQ ID NO: 701), CUUACCCGGCAGUGGGGGUG (SEQ ID NO: 703),, UACCCGGCAGUGGGGGUGGG (SEQ ID NO: 707), CUGACACCCUUACCCGGCAG (SEQ ID NO: 709), UGACACCCUUACCCGGCAGU (SEQ ID NO: 711), GACACCCUUACCCGGCAGUG (SEQ ID NO: 713), ACACCCUUACCCGGCAGUGG (SEQ ID NO: 715), CCCUUACCCGGCAGUGGGGGU (SEQ ID NO: 717), and ACCCGGCAGUGGGGGUGGGG (SEQ ID NO: 719).
2. A method for treating galactosemia in a subject in need thereof, wherein the subject comprises a galactose- 1 -phosphate uridylyltransferase (GALT) polynucleotide encoding a GALT polypeptide having a Q188R mutation, the method comprising: administering to the subject a base editor system comprising a guide RNA, or a polynucleotide encoding the guide RNA, and a base editor comprising a deaminase domain capable of deaminating cytidine in DNA and a nucleic acid programmable DNA binding protein (napDNAbp) domain, or one or more polynucleotides encoding the base editor, wherein said guide RNA targets said base editor to effect an alteration to a nucleobase in codon 188 of the GALT polynucleotide, wherein the alteration to the nucleobase changes codon 188 to a codon encoding a Q amino acid, and wherein:ATTORNEY DOCKET NO. 180802-47402 / PCTELECTRONIC DEPOSIT DATE: December 10, 2025a) the deaminase domain comprises an amino acid sequence with at least 85% sequence identity to a deaminase selected from the group consisting of 6b, RrA3f, and T1.52; and / or b) the guide RNA comprises a spacer, wherein the spacer comprises at least 19 contiguous nucleotides of a sequence selected from the group consisting of:UUACCCGGCAGUGGGGGUGG (SEQ ID NO: 705); UUACCCGGCAGUGGGGGUG (SEQ ID NO: 793), CCCUUACCCGGCAGUGGGGG (SEQ ID NO: 699), CCUUACCCGGCAGUGGGGGU (SEQ ID NO: 701), CUUACCCGGCAGUGGGGGUG (SEQ ID NO: 703), UACCCGGCAGUGGGGGUGGG (SEQ ID NO: 707), CUGACACCCUUACCCGGCAG (SEQ ID NO: 709), UGACACCCUUACCCGGCAGU (SEQ ID NO: 711), GACACCCUUACCCGGCAGUG (SEQ ID NO: 713), ACACCCUUACCCGGCAGUGG (SEQ ID NO: 715), CCCUUACCCGGCAGUGGGGGU (SEQ ID NO: 717), and ACCCGGCAGUGGGGGUGGGG (SEQ ID NO: 719).
3. The method of claim 2 further comprising administering to the subject a lipid nanoparticle comprising the base editor system.
4. A method of editing a nucleobase of a galactose- 1 -phosphate uridylyltransferase (GALT) polynucleotide, the method comprising contacting the GALT polynucleotide with a guide RNA and mRNA encoding a base editor comprising a deaminase domain comprising a deaminase selected from the group consisting of 6b, RrA3f, and T1.52, and a nucleic acid programmable DNA binding protein (napDNAbp) domain, wherein the napDNAbp is an SpCas9, an SpRY, an SpRYc, an saCas9, or an spCas9 variant having specificity for a protospacer-adjacent motif (PAM) selected from the group consisting of: NG, NRRH, and NRTH, wherein “N” indicates A, C, G, or T, “R” indicates A or G, “Y” indicates C or T, and “H” indicates A, C, or T, wherein said guide RNA targets said base editor to effect an alteration to a nucleobase in codon 188 of the GALT polynucleotide encoding an R amino acid, wherein the alteration to the nucleobase changes codon 188 to a codon encoding a Q amino acid, and wherein the guide RNA comprises a spacer, wherein the spacer consists of a sequence selected from the group consisting of:UUACCCGGCAGUGGGGGUGG (SEQ ID NO: 705); UUACCCGGCAGUGGGGGUG (SEQ ID NO: 793), CCCUUACCCGGCAGUGGGGG (SEQ ID NO: 699), CCUUACCCGGCAGUGGGGGU (SEQ ID NO: 701), CUUACCCGGCAGUGGGGGUG (SEQ ID NO: 703), UACCCGGCAGUGGGGGUGGG (SEQ ID NO: 707), CUGACACCCUUACCCGGCAG (SEQ ID NO: 709), UGACACCCUUACCCGGCAGU (SEQ ID NO: 711), GACACCCUUACCCGGCAGUG (SEQ ID NO: 713), ACACCCUUACCCGGCAGUGG (SEQ ID NO: 715), CCCUUACCCGGCAGUGGGGGU (SEQ ID NO: 717), and ACCCGGCAGUGGGGGUGGGG (SEQ ID NO: 719).ATTORNEY DOCKET NO. 180802-47402 / PCTELECTRONIC DEPOSIT DATE: December 10, 20255. A method of treatment comprising administering to a subject in need thereof, wherein the subject comprises a galactose- 1 -phosphate uridylyltransferase (GALT) polynucleotide encoding a GALT polypeptide having a Q188R mutation, a lipid nanoparticle (LNP) comprising: a guide RNA and mRNA encoding a base editor comprising a deaminase domain comprising a deaminase selected from the group consisting of 6b, RrA3f, and T1.52, and a nucleic acid programmable DNA binding protein (napDNAbp) domain, wherein the napDNAbp is an SpCas9, an SpRY, an SpRYc, an saCas9, or an spCas9 variant having specificity for a protospacer-adjacent motif (PAM) selected from the group consisting of: NG, NRRH, and NRTH, wherein “N” indicates A, C, G, or T, “R” indicates A or G, “Y” indicates C or T, and “H” indicates A, C, or T, wherein said guide RNA targets said base editor to effect an alteration to a nucleobase in codon 188 of the GALT polynucleotide encoding an R amino acid, wherein the alteration to the nucleobase changes codon 188 to a codon encoding a Q amino acid, and wherein the guide RNA comprises a spacer, wherein the spacer consists of a sequence selected from the group consisting of: UUACCCGGCAGUGGGGGUGG (SEQ ID NO: 705);UUACCCGGCAGUGGGGGUG (SEQ ID NO: 793, CCCUUACCCGGCAGUGGGGG (SEQ ID NO: 699), CCUUACCCGGCAGUGGGGGU (SEQ ID NO: 701), CUUACCCGGCAGUGGGGGUG (SEQ ID NO: 703), UACCCGGCAGUGGGGGUGGG (SEQ ID NO: 707), CUGACACCCUUACCCGGCAG (SEQ ID NO: 709), UGACACCCUUACCCGGCAGU (SEQ ID NO: 711), GACACCCUUACCCGGCAGUG (SEQ ID NO: 713), ACACCCUUACCCGGCAGUGG (SEQ ID NO: 715), CCCUUACCCGGCAGUGGGGGU (SEQ ID NO: 717), and ACCCGGCAGUGGGGGUGGGG (SEQ ID NO: 719).
6. The method of any one of claims 1-5, wherein the guide RNA comprises a sequence selected from the group consisting of: mUsmUsmAsCCCGGCAGUGGGGGUGGGUUUUAGAmGmCmCmGmGmCmGmGmAmAmAmCmGmCmC mGmGmCAAGUUAAAAUAAGGCUAGUCCGUUAUCAmAmCmUmUmGmAmAmAmAmAmGmUmGmGmC mAmCmCmGmAmGmUmCmGmGmUmGmCmUsmUsmUsmU (SEQ ID NO: 746); mUsmUsmAsCCCGGCAGUGGGGGUGGUUUUAGAmGmCmCmGmGmCmGmGmAmAmAmCmGmCmCm GmGmCAAGUUAAAAUAAGGCUAGUCCGUUAUCAmAmCmUmUmGmAmAmAmAmAmGmUmGmGmCm AmCmCmGmAmGmUmCmGmGmUmGmCmUsmUsmUsmU (SEQ ID NO: 791);mC smC smC sUUACCCGGCAGUGGGGGGUUUUAGAGCUAGAAAUAGCAAGUUAAAAUAAGGCUAG UCCGUUAUCAACUUGAAAAAGUGGCACCGAGUCGGUGCmUsmUsmUsU (SEQ ID NO: 728);ATTORNEY DOCKET NO. 180802-47402 / PCTELECTRONIC DEPOSIT DATE: December 10, 2025mC smC smUsUACCCGGCAGUGGGGGUGUUUUAGAGCUAGAAAUAGCAAGUUAAAAUAAGGCUAG UCCGUUAUCAACUUGAAAAAGUGGCACCGAGUCGGUGCmUsmUsmUsU (SEQ ID NO: 729); mCsmUsmUsACCCGGCAGUGGGGGUGGUUUUAGAGCUAGAAAUAGCAAGUUAAAAUAAGGCUAG UCCGUUAUCAACUUGAAAAAGUGGCACCGAGUCGGUGCmUsmUsmUsU (SEQ ID NO: 730); mUsmUsmAsCCCGGCAGUGGGGGUGGGUUUUAGAGCUAGAAAUAGCAAGUUAAAAUAAGGCUAG UCCGUUAUCAACUUGAAAAAGUGGCACCGAGUCGGUGCmUsmUsmUsU (SEQ ID NO: 731); mUsmAsmC s CCGGCAGUGGGGGUGGGGUUUUAGAGCUAGAAAUAGCAAGUUAAAAUAAGGCUAG UCCGUUAUCAACUUGAAAAAGUGGCACCGAGUCGGUGCmUsmUsmUsU (SEQ ID NO: 732); mCsmUsmGsACACCCUUACCCGGCAGGUUUUAGAGCUAGAAAUAGCAAGUUAAAAUAAGGCUAG UCCGUUAUCAACUUGAAAAAGUGGCACCGAGUCGGUGCmUsmUsmUsU (SEQ ID NO: 733); mUsmGsmAsCACCCUUACCCGGCAGUGUUUUAGAGCUAGAAAUAGCAAGUUAAAAUAAGGCUAG UCCGUUAUCAACUUGAAAAAGUGGCACCGAGUCGGUGCmUsmUsmUsU (SEQ ID NO: 734); mGsmAsmCsACCCUUACCCGGCAGUGGUUUUAGAGCUAGAAAUAGCAAGUUAAAAUAAGGCUAG UCCGUUAUCAACUUGAAAAAGUGGCACCGAGUCGGUGCmUsmUsmUsU (SEQ ID NO: 735); mAsmC smAs CCCUUACCCGGCAGUGGGUUUUAGAGCUAGAAAUAGCAAGUUAAAAUAAGGCUAG UCCGUUAUCAACUUGAAAAAGUGGCACCGAGUCGGUGCmUsmUsmUsU (SEQ ID NO: 736); mC smC smC sUUACCCGGCAGUGGGGGUGUUUUAGUACUCUGUAAUGAAAAUUACAGAAUCUACU AAAACAAGGCAAAAUGCCGUGUUUAUCUCGUCAACUUGUUGGCGAGAUsmUsmUsmU (SEQ ID NO: 737);mAsmC smC s CGGCAGUGGGGGUGGGGGUUUUAGAGCUAGAAAUAGCAAGUUAAAAUAAGGCUAG UCCGUUAUCAACUUGAAAAAGUGGCACCGAGUCGGUGCmUsmUsmUsU (SEQ ID NO: 738); mA s mC s mA s C C CUUAC C CGGC AGUGGGUUUUAGAmGmCmCmGmGmCmGmGmAmAmAmCmGmCmC mGmGmCAAGUUAAAAUAAGGCUAGUCCGUUAUCAmAmCmUmUmGmAmAmAmAmAmGmUmGmGmC mAmCmCmGmAmGmUmCmGmGmUmGmCmUsmUsmUsmU (SEQ ID NO: 739); mCsmCsmCsUUACCCGGCAGUGGGGGGUUUUAGAmGmCmCmGmGmCmGmGmAmAmAmCmGmCmC mGmGmCAAGUUAAAAUAAGGCUAGUCCGUUAUCAmAmCmUmUmGmAmAmAmAmAmGmUmGmGmC mAmCmCmGmAmGmUmCmGmGmUmGmCmUsmUsmUsmU (SEQ ID NO: 740); mCsmCsmUsUACCCGGCAGUGGGGGUGUUUUAGAmGmCmCmGmGmCmGmGmAmAmAmCmGmCmC mGmGmCAAGUUAAAAUAAGGCUAGUCCGUUAUCAmAmCmUmUmGmAmAmAmAmAmGmUmGmGmC mAmCmCmGmAmGmUmCmGmGmUmGmCmUsmUsmUsmU (SEQ ID NO: 741);mC s mU s mG s AC AC C CUUAC C CGGC AGGUUUUAGAmGmCmCmGmGmCmGmGmAmAmAmCmGmCmC mGmGmCAAGUUAAAAUAAGGCUAGUCCGUUAUCAmAmCmUmUmGmAmAmAmAmAmGmUmGmGmC mAmCmCmGmAmGmUmCmGmGmUmGmCmUsmUsmUsmU (SEQ ID NO: 742);ATTORNEY DOCKET NO. 180802-47402 / PCTELECTRONIC DEPOSIT DATE: December 10, 2025 mCsmUsmUsACCCGGCAGUGGGGGUGGUUUUAGAmGmCmCmGmGmCmGmGmAmAmAmCmGmCmC mGmGmCAAGUUAAAAUAAGGCUAGUCCGUUAUCAmAmCmUmUmGmAmAmAmAmAmGmUmGmGmC mAmCmCmGmAmGmUmCmGmGmUmGmCmUsmUsmUsmU (SEQ ID NO: 743);mG s mA s mC s AC C CUUAC C CGGC AGUGGUUUUAGAmGmCmCmGmGmCmGmGmAmAmAmCmGmCmC mGmGmCAAGUUAAAAUAAGGCUAGUCCGUUAUCAmAmCmUmUmGmAmAmAmAmAmGmUmGmGmC mAmCmCmGmAmGmUmCmGmGmUmGmCmUsmUsmUsmU (SEQ ID NO: 744); and mUsmGsmAsCACCCUUACCCGGCAGUGUUUUAGAmGmCmCmGmGmCmGmGmAmAmAmCmGmCmC mGmGmCAAGUUAAAAUAAGGCUAGUCCGUUAUCAmAmCmUmUmGmAmAmAmAmAmGmUmGmGmC mAmCmCmGmAmGmUmCmGmGmUmGmCmUsmUsmUsmU (SEQ ID NO: 745);wherein “mN” indicates a 2'-OMe modification of the nucleotide “N”, and “Ns” indicates that the nucleotide “N” is linked to the following nucleotide by a phosphorothioate (PS).
7. A method for producing a cell, the method comprising editing a GALT polynucleotide in a cell according to the method of claim 1 or claim 4.
8. A lipid nanoparticle (LNP) comprising:A) a guide RNA, or a polynucleotide encoding the guide RNA, wherein the guide RNA comprises a spacer comprising a nucleotide sequence selected from UUACCCGGCAGUGGGGGUGG (SEQ ID NO: 705); UUACCCGGCAGUGGGGGUG (SEQ ID NO: 793), and those listed in Table 1;B) an mRNA molecule, or a polynucleotide encoding the mRNA molecule, wherein the mRNA molecule encodes a base editor polypeptide comprising a nucleic acid programmable DNA binding protein (napDNAbp) domain and a deaminase domain, wherein the deaminase domain comprises an amino acid sequence with at least 90% sequence identity to a deaminase selected from the group consisting of 6b, RrA3f, and T1.52; andC) an ionizable lipid according to any one of the following formulas, or a pharmaceutically acceptable salt thereof:i) a compound of Formula (I):J3AkR1L2O Y3O.. I Y1JI JL ^X2R^O^Y2O X1X3R1' 1 zL1x>> L2' °(I)wherein:ATTORNEY DOCKET NO. 180802-47402 / PCTELECTRONIC DEPOSIT DATE: December 10, 2025each L1and L1'is independently -C(O)- or -OC(O)-;each L2and L2'is independently an optionally substituted bivalent saturated or unsaturated, straight or branched C1-12 hydrocarbon chain;L3is a covalent bond, -O-, -C(O)O-, -OC(O)-, or -OC(O)O-;O-Ral_LCyA_cA MR1is optionally substituted C1-20 aliphatic, 1, or O-Ra;LCyAis a covalent bond or an optionally substituted bivalent saturated or unsaturated, straight or branched C1-12 hydrocarbon chain;CyAis an optionally substituted ring selected from 3- to 7-membered saturated or partially unsaturated carbocyclyl, 1-adamantyl, 2-adamantyl, sterolyl, and phenyl;LRais an optionally substituted bivalent saturated or unsaturated, straight or branched C1-12 hydrocarbon chain;each Raand R1is independently optionally substituted C1-20 aliphatic;Y1is -C(O)- or -C(O)O-;Y2is an optionally substituted bivalent saturated or unsaturated, straight or branched C1-6 hydrocarbon chain;Y3is optionally substituted C1-20 aliphatic;X1is a covalent bond, -O-, or -NR-;X2is an optionally substituted bivalent saturated or unsaturated, straight or branched C1-12 hydrocarbon chain, wherein 1-3 methylene units are optionally and independently replaced with -O- or -NR-;X3is hydrogen or an optionally substituted ring selected from 3- to 7- membered saturated or partially unsaturated carbocyclyl, phenyl, 3- to 7-membered heterocyclyl having 1-3 heteroatoms independently selected from nitrogen, oxygen, and sulfur, or 5- to 6-membered heteroaryl having 1-3 heteroatoms independently selected from nitrogen, oxygen, and sulfur; andeach R is independently hydrogen or optionally substituted C1-6 aliphatic;ii) a compound of formula A’:. L3O RL^o R / X2L2RV bk ^R"A’ATTORNEY DOCKET NO. 180802-47402 / PCTELECTRONIC DEPOSIT DATE: December 10, 2025or its N-oxide, or a pharmaceutically acceptable salt thereof, whereinL1is absent, C1-6 alkylenyl, or C2-6 heteroalkylenyl;each L2is independently optionally substituted C2-15 alkylenyl, or optionally substituted C3-15 heteroalkyl enyl;L is Ci-10 alkylenyl, or C2-10 heteroalkyl enyl;X2is -OC(O)-, -C(O)O-, or -OC(O)O-;X is absent, -OC(O)-, -C(O)O-, or -OC(O)O-;Oy^oAR6L3a[O'" ^Ra%vkR” is hydrogen,O Ra, O R6, or an optionally substituted group selected from C6-20 aliphatic, 3- to 12-membered cycloaliphatic, 7- to 12-membered bridged bicyclic comprising 0-4 heteroatoms independently selected from nitrogen, oxygen, or sulfur, 1- adamantyl, 2-adamantyl, sterolyl, and phenyl;each of R and Rais independently hydrogen, or an optionally substituted group selected from Ce- 20 aliphatic, 3- to 12-membered cycloaliphatic, 7- to 12-membered bridged bicyclic comprising 0-4 heteroatoms independently selected from nitrogen, oxygen, or sulfur, 1- adamantyl, 2-adamantyl, sterolyl, and phenyleach of L3and L3ais independently absent, optionally substituted Ci-io alkylenyl, or optionally substituted C2-10 heteroalkylenyl;R1is hydrogen, optionally substituted phenyl, optionally substituted 3- to 7-membered cycloaliphatic, optionally substituted 3- to 7-membered heterocyclyl comprising 1-3 heteroatoms independently selected from nitrogen, oxygen, and sulfur, optionally substituted 5- to 6-membered monocyclic heteroaryl comprising 1-4 heteroatoms independently selected from nitrogen, oxygen, and sulfur, optionally substituted 8- to 10-membered bicyclic heteroaryl comprising 1-4 heteroatoms independently selected from nitrogen, oxygen, and sulfur, -OR2, -C(O)OR2, -C(O)SR2, -OC(O)R2, -OC(O)OR2, -CN,-N(R2)2, -C(O)N(R2)2, -S(O)2N(R2)2, -NR2C(O)R2, -OC(O)N(R2)2, -N(R2)C(O)OR2, -NR2S(O)2R2, -NR2C(O)N(R2)2, -NR2C(S)N(R2)2, -NR2C(NR2)N(R2)2, -NR2C(CHR2)N(R2)2, -N(OR2)C(O)R2, -N(OR2)S(O)2R2, -N(OR2)C(O)OR2, -N(OR2)C(O)N(R2)2, -N(OR2)C(S)N(R2)2, -N(OR2)C(NR2)N(R2)2, -N(OR2)C(CHR2)N(R2)2, -C(NR2)N(R2)2, -C(NR2)R2, -C(O)N(R2)OR2, -C(R2)N(R2)2C(O)OR2, -CR2(R3)2, -OP(O)(OR2)2, or -P(O)(OR2)2; orATTORNEY DOCKET NO. 180802-47402 / PCTELECTRONIC DEPOSIT DATE: December 10, 2025R2R / \2R1is ° O, or a ring selected from 3- to 7-membered cycloaliphatic and 3- to 7- membered heterocyclyl comprising 1-3 heteroatoms independently selected from nitrogen, oxygen, and sulfur, wherein the cycloaliphatic or heterocyclyl ring is optionally substituted with 1-4 R2or R3groups;each R2is independently hydrogen, oxo, -CN, -NO2, -OR4, -S(O)2R4, -S(O)2N(R4)2, -(CH2)n-R4, or an optionally substituted group selected from C1-6 aliphatic, phenyl, 3- to 7-membered cycloaliphatic, 5- to 6-membered monocyclic heteroaryl comprising 1-4 heteroatoms independently selected from nitrogen, oxygen, and sulfur, and 3- to 7-membered heterocyclyl comprising 1-3 heteroatoms independently selected from nitrogen, oxygen, and sulfur; or two occurrences of R2, taken together with the atom(s) to which they are attached, form optionally substituted 4- to 7-membered heterocyclyl comprising 0-1 additional heteroatom selected from nitrogen, oxygen, and sulfur;each R3is independently -(CH2)n-R4; ortwo occurrences of R3, taken together with the atom(s) to which they are attached, form optionally substituted 5- to 6-membered heterocyclyl comprising 0-1 additional heteroatom selected from nitrogen, oxygen, and sulfur;each R4is independently hydrogen, -OR5, -N(R5)2, -OC(O)R5, -OC(O)OR5, -CN, -C(O)N(R5)2, -NR5C(O)R5, -OC(O)N(R5)2, -N(R5)C(O)OR5, -NR5S(O)2R5, -NR5C(O)N(R5)2,R5R5 / \R5*-Z —-NR5C(S)N(R5)2, -NR5C(NR5)N(R5)2, or O O ■each R5is independently hydrogen, or optionally substituted C1-6 aliphatic; ortwo occurrences of R5, taken together with the atom(s) to which they are attached, form optionally substituted 4- to 7-membered heterocyclyl comprising 0-1 additional heteroatom selected from nitrogen, oxygen, and sulfur;each R6is independently C4-12 aliphatic; andeach n is independently 0 to 4; oriii) a compound of Formula I:ATTORNEY DOCKET NO. 180802-47402 / PCTELECTRONIC DEPOSIT DATE: December 10, 2025or a pharmaceutically acceptable salt thereof, wherein:L1is a covalent bond, -C(O)-, or -OC(O)-;L2is a covalent bond, an optionally substituted bivalent saturated or unsaturated, straight or HH-cyA-Hbranched C1-C12 hydrocarbon chain, orm m;CyAis an optionally substituted ring selected from phenylene and 3- to 7-membered saturated or partially unsaturated carbocyclene;each m is independently 0, 1, or 2;L3is a covalent bond, -C(O)-, -C(O)O-, -OC(O)-, -O-, or -OC(O)O-;R, is H4p-C', B, an optionally substituted saturated or unsaturated, straight or branchedC1-C20 hydrocarbon chain wherein 1-3 methylene units are optionally and independently, O-A1HL2replaced with -O- or -NR-, or O_A;CyBis an optionally substituted ring selected from 3- to 12-membered saturated or partiallyunsaturated carbocyclyl, 1-adamantyl, 2-adamantyl,, sterolyl, and phenyl; p is 0, 1, 2, or 3;each L4is independently a bivalent saturated or unsaturated, straight or branched Ci-Ce hydrocarbon chain;each A1and A2is independently an optionally substituted C1-C20 aliphatic or -L5-R5;or A1and A2, together with their intervening atoms, may form an optionally substituted ring:#K2) X.wherex is selected from 1 or 2; and# represents the point of attachment to L4;each L5is independently a bivalent saturated or unsaturated, straight or branched C1-C20 hydrocarbon chain, wherein 1-3 methylene units are optionally and independently replaced with -O- or -NR-;ATTORNEY DOCKET NO. 180802-47402 / PCTELECTRONIC DEPOSIT DATE: December 10, 2025each R5is independently an optionally substituted group selected from a 5- to 10-membered aryl ring or a 3- to 8-membered carbocyclic ring;X1is a covalent bond, -O-, or -NR-;X2is a covalent bond or an optionally substituted, bivalent saturated or unsaturated, straight or branched, C1-C12 hydrocarbon chain, wherein 1-3 methylene units are optionally and independently replaced with -O-, -NR-, or -Cyc-;Cycis an optionally substituted ring selected from 3- to 7- membered saturated or partially unsaturated carbocyclene, phenylene, 3- to 7-membered saturated or partially unsaturated heterocyclene having 1-3 heteroatoms independently selected from nitrogen, oxygen, and sulfur, and 5- to 6-membered heteroarylene having 1-3 heteroatoms independently selected from nitrogen, oxygen, and sulfur;X3is hydrogen or an optionally substituted ring selected from 3- to 7-membered saturated or partially unsaturated carbocyclyl, phenyl, 3- to 7-membered saturated or partially unsaturated heterocyclyl having 1-3 heteroatoms independently selected from nitrogen, oxygen, and sulfur, or 5- to 6-membered heteroaryl having 1-3 heteroatoms independently selected from nitrogen, oxygen, and sulfur; andeach R is independently hydrogen or an optionally substituted Ci-Ce aliphatic group;FH-cyBprovided that when L is a covalent bond, then R must bep9. A lipid nanoparticle (LNP) comprising:A) a guide RNA, or a polynucleotide encoding the guide RNA, wherein the guide RNA is capable of directing a base editor polypeptide to alter a nucleotide in a galactose- 1 -phosphate uridylyltransferase (GALT) polynucleotide;B) an mRNA molecule, or a polynucleotide encoding the mRNA molecule, wherein the mRNA molecule encodes a base editor polypeptide comprising a nucleic acid programmable DNA binding protein (napDNAbp) domain and a deaminase domain, wherein the deaminase domain comprises an amino acid sequence with at least 90% sequence identity to a deaminase selected from the group consisting of 6b, RrA3f, and T1.52; andC) an ionizable lipid according to Formula (I), or a pharmaceutically acceptable salt thereof:ATTORNEY DOCKET NO. 180802-47402 / PCTELECTRONIC DEPOSIT DATE: December 10, 2025wherein:each L1and L1'is independently -C(O)- or -OC(O)-;each L2and L2'is independently an optionally substituted bivalent saturated or unsaturated, straight or branched C1-12 hydrocarbon chain;L3is a covalent bond, -O-, -C(O)O-, -OC(O)-, or -OC(O)O-;O-Rai_LCyA_cA |“LRa~(R1is optionally substituted C1-20 aliphatic, ‘y, or O-Ra;LCyAis a covalent bond or an optionally substituted bivalent saturated or unsaturated, straight or branched C1-12 hydrocarbon chain;CyAis an optionally substituted ring selected from 3- to 7-membered saturated or partially unsaturated carbocyclyl, 1-adamantyl, 2-adamantyl, sterolyl, and phenyl;LRais an optionally substituted bivalent saturated or unsaturated, straight or branched C1-12 hydrocarbon chain;each Raand R1is independently optionally substituted C1-20 aliphatic;Y1is -C(O)- or -C(O)O-;Y2is an optionally substituted bivalent saturated or unsaturated, straight or branched C1-6 hydrocarbon chain;Y3is optionally substituted C1-20 aliphatic;X1is a covalent bond, -O-, or -NR-;X2is an optionally substituted bivalent saturated or unsaturated, straight or branched C1-12 hydrocarbon chain, wherein 1-3 methylene units are optionally and independently replaced with -O- or -NR-;X3is hydrogen or an optionally substituted ring selected from 3- to 7- membered saturated or partially unsaturated carbocyclyl, phenyl, 3- to 7-membered heterocyclyl having 1-3 heteroatoms independently selected from nitrogen, oxygen, and sulfur, or 5- to 6-membered heteroaryl having 1-3 heteroatoms independently selected from nitrogen, oxygen, and sulfur; andeach R is independently hydrogen or optionally substituted C1-6 aliphatic.ATTORNEY DOCKET NO. 180802-47402 / PCTELECTRONIC DEPOSIT DATE: December 10, 202510. The LNP of claim 8 or claim 9, wherein the lipid nanoparticle comprises an ionizable lipid having the following structure, or a pharmaceutically acceptable salt thereof:
11. The LNP of claim 8 or claim 9, wherein the LNP comprises a molar ratio of ionizable nitrogen atoms in an ionizable lipid to the total negative charge in the payload (N: P ratio) of between about 5: 1 and 7:1.
12. The LNP of claim 11, wherein the LNP comprises an N: P ratio of about 6: 1.
13. The LNP of claim 8 or claim 9, wherein the guide RNA comprises a scaffold with the following nucleotide sequence:GUUUUAGAmGmCmCmGmGmCmGmGmAmAmAmCmGmCmCmGmGmCAAGUUAAAAUAAGGCUAGUC CGUU AU C AmAmCmU mU mGmAmAmAm AmAmGmU mGmGmCm AmCmCmGm AmGmU mCmGmGmU mGm CmUsmUsmUsmU (SEQ ID NO: 794), wherein “mN” indicates a 2'-0Me modification of the nucleotide “N”, and “Ns” indicates that the nucleotide “N” is linked to the following nucleotide by a phosphorothioate (PS).
14. The LNP of claim 8 or claim 9, wherein the guide RNA comprises a spacer comprising the nucleotide sequence UUACCCGGCAGUGGGGGUGG (SEQ ID NO: 705) or UUACCCGGCAGUGGGGGUG (SEQ ID NO: 793).
15. The LNP of claim 8 or claim 9, wherein the guide RNA comprises a nucleotide sequence selected from those listed in Table 2.
16. The LNP of claim 8 or claim 9, wherein the guide RNA comprises one of the following nucleotide sequences: mUsmUsmAsCCCGGCAGUGGGGGUGGGUUUUAGAmGmCmCmGmGmCmGmGmAmAmAmCmGmCmC mGmGmCAAGUUAAAAUAAGGCUAGUCCGUUAUCAmAmCmUmUmGmAmAmAmAmAmGmUmGmGmC mAmCmCmGmAmGmUmCmGmGmUmGmCmUsmUsmUsmU (SEQ ID NO: 746);ATTORNEY DOCKET NO. 180802-47402 / PCTELECTRONIC DEPOSIT DATE: December 10, 2025 mUsmUsmAsCCCGGCAGUGGGGGUGGUUUUAGAmGmCmCmGmGmCmGmGmAmAmAmCmGmCmCm GmGmCAAGUUAAAAUAAGGCUAGUCCGUUAUCAmAmCmUmUmGmAmAmAmAmAmGmUmGmGmCm AmCmCmGmAmGmUmCmGmGmUmGmCmUsmUsmUsmU (SEQ ID NO: 791);wherein “mN” indicates a 2'-OMe modification of the nucleotide “N”, and “Ns” indicates that the nucleotide “N” is linked to the following nucleotide by a phosphorothioate (PS).
17. The LNP of claim 9, wherein the guide RNA comprises a spacer selected from those listed in Table 1.
18. The LNP of claim 8 or claim 9, wherein the napDNAbp is an SpCas9, an SpRY, an SpRYc, an saCas9, or an spCas9 variant having specificity for a protospacer-adjacent motif (PAM) selected from the group consisting of NG, NRRH, and NRTH, wherein “N” indicates A, C, G, or T, “R” indicates A or G, “Y” indicates C or T, and “H” indicates A, C, or T.
19. The LNP of claim 18, wherein the napDNAbp is a nickase.
20. The LNP of claim 8 or claim 9, wherein the base editor polypeptide comprises an amino acid sequence with at least about 90% identity to a sequence listed in Table 2.
21. A lipid nanoparticle (LNP) comprising:A) a guide RNA having a nucleotide sequence selected from the group consisting of: mUsmUsmAsCCCGGCAGUGGGGGUGGGUUUUAGAmGmCmCmGmGmCmGmGmAmAmAmCmGmCmC mGmGmCAAGUUAAAAUAAGGCUAGUCCGUUAUCAmAmCmUmUmGmAmAmAmAmAmGmUmGmGmC mAmCmCmGmAmGmUmCmGmGmUmGmCmUsmUsmUsmU (SEQ ID NO: 746); and mUsmUsmAsCCCGGCAGUGGGGGUGGUUUUAGAmGmCmCmGmGmCmGmGmAmAmAmCmGmCmCm GmGmCAAGUUAAAAUAAGGCUAGUCCGUUAUCAmAmCmUmUmGmAmAmAmAmAmGmUmGmGmCm AmCmCmGmAmGmUmCmGmGmUmGmCmUsmUsmUsmU (SEQ ID NO: 791), wherein “mN” indicates a 2'-0Me modification of the nucleotide “N”, and “Ns” indicates that the nucleotide “N” is linked to the following nucleotide by a phosphorothioate (PS);B) a base editor polypeptide, or a polynucleotide encoding the base editor polypeptide, wherein the base editor polypeptide comprises a nucleic acid programmable DNA binding protein (napDNAbp) domain and a deaminase domain comprising a deaminase domain comprising a deaminase selected from the group consisting of 6b, RrA3f, and T1.52; andATTORNEY DOCKET NO. 180802-47402 / PCTELECTRONIC DEPOSIT DATE: December 10, 2025C) an ionizable lipid having the following structure, or a pharmaceutically acceptable saltthereof:
22. The lipid nanoparticle of claim 21, wherein the base editor polypeptide comprises an amino acid sequence with at least about 90% identity to a sequence listed in Table 2.
23. A base editor system comprising a guide RNA, or a polynucleotide encoding the guide RNA, and a base editor comprising a deaminase domain capable of deaminating cytidine in DNA and a nucleic acid programmable DNA binding protein (napDNAbp) domain, or one or more polynucleotides encoding the base editor, wherein said guide RNA targets said base editor to effect an alteration to a nucleobase in codon 188 of a GALT polynucleotide encoding an R amino acid, wherein the alteration to the nucleobase changes codon 188 to a codon encoding a Q amino acid, and wherein:a) the deaminase domain comprises an amino acid sequence with at least 85% sequence identity to a deaminase selected from the group consisting of 6b, RrA3f, and T1.52; and / or b) the guide RNA comprises a spacer, wherein the spacer comprises at least 19 contiguous nucleotides of a sequence selected from the group consisting of:UUACCCGGCAGUGGGGGUGG (SEQ ID NO: 705), UUACCCGGCAGUGGGGGUG (SEQ ID NO: 793), CCCUUACCCGGCAGUGGGGG (SEQ ID NO: 699), CCUUACCCGGCAGUGGGGGU (SEQ ID NO: 701), CUUACCCGGCAGUGGGGGUG (SEQ ID NO: 703), UACCCGGCAGUGGGGGUGGG (SEQ ID NO: 707), CUGACACCCUUACCCGGCAG (SEQ ID NO: 709), UGACACCCUUACCCGGCAGU (SEQ ID NO: 711), GACACCCUUACCCGGCAGUG (SEQ ID NO: 713), ACACCCUUACCCGGCAGUGG (SEQ ID NO: 715), CCCUUACCCGGCAGUGGGGGU (SEQ ID NO: 717), and ACCCGGCAGUGGGGGUGGGG (SEQ ID NO: 719).
24. A base editor system comprising a guide RNA and mRNA encoding a base editor comprising a deaminase domain comprising a deaminase selected from the group consisting of 6b, RrA3f, and T1.52, and a nucleic acid programmable DNA binding protein (napDNAbp) domain, wherein the napDNAbp is an SpCas9, an SpRY, an SpRYc, an saCas9, or an spCas9 variant having specificity for a protospacer-adjacent motif (PAM) selected from the groupATTORNEY DOCKET NO. 180802-47402 / PCTELECTRONIC DEPOSIT DATE: December 10, 2025consisting of: NG, NRRH, and NRTH, wherein “N” indicates A, C, G, or T, “R” indicates A or G, “Y” indicates C or T, and “H” indicates A, C, or T, wherein said guide RNA targets said base editor to effect an alteration to a nucleobase in codon 188 of the GALT polynucleotide encoding an R amino acid, wherein the alteration to the nucleobase changes codon 188 to a codon encoding a Q amino acid, and wherein the guide RNA comprises a spacer, wherein the spacer consists of a sequence selected from the group consisting of: UUACCCGGCAGUGGGGGUGG (SEQ ID NO: 705), UUACCCGGCAGUGGGGGUG (SEQ ID NO: 793), CCCUUACCCGGCAGUGGGGG (SEQ ID NO: 699), CCUUACCCGGCAGUGGGGGU (SEQ ID NO: 701), CUUACCCGGCAGUGGGGGUG (SEQ ID NO: 703), UACCCGGCAGUGGGGGUGGG (SEQ ID NO: 707), CUGACACCCUUACCCGGCAG (SEQ ID NO: 709), UGACACCCUUACCCGGCAGU (SEQ ID NO: 711), GACACCCUUACCCGGCAGUG (SEQ ID NO: 713), ACACCCUUACCCGGCAGUGG (SEQ ID NO: 715), CCCUUACCCGGCAGUGGGGGU (SEQ ID NO: 717), and ACCCGGCAGUGGGGGUGGGG (SEQ ID NO: 719).
25. A polynucleotide or set of polynucleotides encoding the base editor system of claim 23 or claim 24, or a component thereof.
26. A lipid nanoparticle comprising the base editor system of claim 23 or claim 24, or the polynucleotide or set of polynucleotides of claim 25.
27. A method for modifying a target nucleobase in a galactose- 1 -phosphate uridylyltransferase (GALT) polynucleotide in a cell, the method comprising contacting the cell with the lipid nanoparticle of any one of claims 8-22 or claim 26, the base editor system of claim 23 or claim 24, or the polynucleotide or set of polynucleotides of claim 25, thereby modifying the target nucleobase in the GALT polynucleotide.
28. The method of claim 27, wherein the cell is in a subject.
29. A method of treating a disease in a subject in need thereof, wherein the disease is associated with a pathogenic mutation in a galactose- 1 -phosphate uridylyltransferase (GALT) polynucleotide in the subject, the method comprising administering to the subject the lipid nanoparticle of any one of claims 8-22 or claim 26, the base editor system of claim 23 or claim 24, or the polynucleotide or set of polynucleotides of claim 25, thereby altering a target nucleobase in the GALT polynucleotide in the subject.ATTORNEY DOCKET NO. 180802-47402 / PCTELECTRONIC DEPOSIT DATE: December 10, 202530. The method of claim 29, wherein the disease is galactosemia.
31. A composition comprising: a guide RNA and mRNA encoding a base editor comprising a deaminase domain comprising a deaminase selected from the group consisting of 6b, RrA3f, and T1.52, and a nucleic acid programmable DNA binding protein (napDNAbp) domain, wherein the napDNAbp is an spCas9, SpRY, an SpRYc, an saCas9, or an spCas9 variant having specificity for a protospacer-adjacent motif (PAM) selected from the group consisting of: NG, NRRH, and NRTH, wherein “N” indicates A, C, G, or T, “R” indicates A or G, “Y” indicates C or T, and “H” indicates A, C, or T, wherein said guide RNA targets said base editor to effect an alteration to a nucleobase in codon 188 of the GALT polynucleotide encoding an R amino acid, wherein the alteration to the nucleobase changes codon 188 to a codon encoding a Q amino acid, and wherein the guide RNA comprises a spacer, wherein the spacer consists of a sequence selected from the group consisting of: UUACCCGGCAGUGGGGGUGG (SEQ ID NO: 705);UUACCCGGCAGUGGGGGUG (SEQ ID NO: 793), CCCUUACCCGGCAGUGGGGG (SEQ ID NO: 699), CCUUACCCGGCAGUGGGGGU (SEQ ID NO: 701), CUUACCCGGCAGUGGGGGUG (SEQ ID NO: 703), UACCCGGCAGUGGGGGUGGG (SEQ ID NO: 707), CUGACACCCUUACCCGGCAG (SEQ ID NO: 709), UGACACCCUUACCCGGCAGU (SEQ ID NO: 711), GACACCCUUACCCGGCAGUG (SEQ ID NO: 713), ACACCCUUACCCGGCAGUGG (SEQ ID NO: 715), CCCUUACCCGGCAGUGGGGGU (SEQ ID NO: 717), and ACCCGGCAGUGGGGGUGGGG (SEQ ID NO: 719).
32. A lipid nanoparticle (LNP) composition comprising: a guide RNA and mRNA encoding a base editor comprising a deaminase domain comprising a deaminase selected from the group consisting of 6b, RrA3f, and T1.52, and a nucleic acid programmable DNA binding protein (napDNAbp) domain, wherein the napDNAbp is an spCas9, SpRY, an SpRYc, an saCas9, or an spCas9 variant having specificity for a protospacer-adjacent motif (PAM) selected from the group consisting of: NG, NRRH, and NRTH, wherein “N” indicates A, C, G, or T, “R” indicates A or G, “Y” indicates C or T, and “H” indicates A, C, or T, wherein said guide RNA targets said base editor to effect an alteration to a nucleobase in codon 188 of the GALT polynucleotide encoding an R amino acid, wherein the alteration to the nucleobase changes codon 188 to a codon encoding a Q amino acid, and wherein the guide RNA comprises a spacer, wherein the spacer consists of a sequence selected from the group consisting of:UUACCCGGCAGUGGGGGUGG (SEQ ID NO: 705); UUACCCGGCAGUGGGGGUG (SEQ ID NO:ATTORNEY DOCKET NO. 180802-47402 / PCTELECTRONIC DEPOSIT DATE: December 10, 2025793), CCCUUACCCGGCAGUGGGGG (SEQ ID NO: 699), CCUUACCCGGCAGUGGGGGU (SEQ ID NO: 701), CUUACCCGGCAGUGGGGGUG (SEQ ID NO: 703), UACCCGGCAGUGGGGGUGGG (SEQ ID NO: 707), CUGACACCCUUACCCGGCAG (SEQ ID NO: 709), UGACACCCUUACCCGGCAGU (SEQ ID NO: 711), GACACCCUUACCCGGCAGUG (SEQ ID NO: 713), ACACCCUUACCCGGCAGUGG (SEQ ID NO: 715), CCCUUACCCGGCAGUGGGGGU (SEQ ID NO: 717), and ACCCGGCAGUGGGGGUGGGG (SEQ ID NO: 719).
33. A guide polynucleotide comprising a spacer or gRNA nucleotide sequence listed in Table 134. A pharmaceutical composition comprising an effective amount of the lipid nanoparticle of any one of claims 8-22 or claim 26, the base editor system of claim 23 or claim 24, the polynucleotide or set of polynucleotides of claim 25, the composition of claim 31 or claim 32, or the guide polynucleotide of claim 33, and a pharmaceutically acceptable excipient.
35. A kit comprising the lipid nanoparticle of any one of claims 8-22 or claim 26, the base editor system of claim 23 or claim 24, the polynucleotide or set of polynucleotides of claim 25, the composition of claim 31 or claim 32, the guide polynucleotide of claim 33, or the pharmaceutical composition of claim 34, disposed within a container.