Methods and systems for identifying crispr/cas off-target sites

a technology of off-target sites and methods, applied in the field of bioinformatics, can solve the problems that none of these bioinformatics search tools has considered off-target sites, nor provide application-specific primers, etc., and achieve the effects of improving the speed of search, facilitating experimental confirmation of off-target activity, and increasing run times

Inactive Publication Date: 2019-09-26
GEORGIA TECH RES CORP
View PDF0 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Benefits of technology

[0011]Methods and systems for searching genomes for potential CRISPR off-target sites are provided. In preferred embodiments, the methods include ranking the potential off-target sites based on the number and location of mismatches, insertions, and / or deletions in the gRNA guide sequence relative to the genomic DNA sequence at a putative target site in the genome, allowing the selection of better target sites and / or experimental confirmation of off-target sites.
[0017]The report can include a score indicating the likelihood that the guide sequence will direct a CRISPR / Cas system to the DNA sequence and facilitate nuclease cleavage. The score can be used to rank the putative target sites in a list. The score can include additional information from experiments and / or databases, such as ENCODE, about the genomic context. For example, data on the histones, protein binding or confirmation of individual chromosomal regions can indicate if there is less or more likelihood of cleavage. In some embodiments, target cleavage locations including genomic sequences with higher sequence identity to the guide sequence receive a lower score relative to target cleavage locations having genomic sequences with lower sequence identity to the guide sequence. Typically, in such embodiments, increasing numbers of substitutions, deletions, and insertions at the target cleavage location increase the score, as do substitutions, deletions, and / or insertions closer to the PAM. The scoring mechanism and position weights can be changed to alter the scoring to better model certain CRISPR / Cas activities. For example, in some embodiments, the score is increased more for deletion(s) in the genomic sequence relative to the guide sequence (RNA bulges) than for insertions in the genomic sequence relative to the guide sequence (DNA bulges). The score can also reflect that sgRNA bulges are less tolerant to additional base mismatches, and vice versa.
[0023]The genomic sequence(s) can be DNA sequence converted into FASTA or similarly formatted files, then transformed into index entries that have all possible 25 bases-long tags in the DNA sequence. In other embodiments, other tagging schemes can be used including longer and shorter tags. The index entries can be sorted and the results stored as a binary main index file. The main index file can be divided into parts, each representing entries having about 12 nucleotides of the first nucleotides identical. In other embodiments, other lengths of index files may be used. A secondary index file can include the position in the main index file where each part starts added to the end of the index file. Searching genome sequence organized and indexed in such a way can improve the speed of the search, while allowing exhaustive searching. Preferred embodiments utilize index files, though other embodiments could use other index methods, similar expedited search strategies, or provide searching without index files, as done with linear searches through the full sequence space, though these would increase run times. A particular embodiment of the disclosed method is referred to herein as COSMID (CRISPR Off-target Sites with Mismatches, Insertions, and Deletions).
[0024]The disclosed methods and systems can aid the design and optimization of CRISPR guide strands by selecting the preferred target sites with minimum Cas-induced off-target cleavage and facilitate the experimental confirmation of off-target activity by providing both putative off-target sites and primer for testing cleavage that the sites in a CRISPR / Cas system. In some embodiments, the disclosed methods are more exhaustive and / or have a higher sensitivity for identifying putative and / or actual off-target sites than previously known methods or programs.

Problems solved by technology

However, none of these bioinformatics search tools has considered the off-target sites due to insertions or deletions between target DNA and guide RNA sequences, nor provide application-specific primers.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Methods and systems for identifying crispr/cas off-target sites
  • Methods and systems for identifying crispr/cas off-target sites
  • Methods and systems for identifying crispr/cas off-target sites

Examples

Experimental program
Comparison scheme
Effect test

example

Example 1

CRISPR Guide Strands can Exhibit Off-Target Activity at Similar Levels as On-Target Activity, Even with Mismatches within First 12 Nucleotides

Materials and Methods

[0324]CRISPR Design and Testing

[0325]There were no CRISPR target sites in the human HBB gene sequence with their proximal 12 bases unique in the human genome (Cong, et al., Science, 339:819-823 (2013)); therefore, CRISPR / Cas9 guide strands targeting HBB were chosen by comparing the similar regions in the human hemoglobin δ (HBD) gene. Eight 20-base guide strands were designed to target sites near the sickle mutation in the HBB gene (FIG. 1A), each adjacent to a PAM sequence that contains the canonical trinucleotide NGG. Five guide strands were also designed to target two segments in the human CCRS gene (FIG. 2A), and tested the corresponding CRISPR / Cas9 systems to determine their on-target cleavage and potential off-target activity at the human C-C chemokine receptor type 2 (CCR2) gene. Herein the name of the guid...

example 2

CRISPR-Targeted Loci Showed a Wide Variety of Insertions, Deletions and Point Mutations

Materials and Methods

[0339]Chromosomal Deletion Analysis

[0340]To assay for gross chromosomal deletions, genomic DNA from cells transfected with R-03 was amplified using the HBD forward primer and the reverse primer downstream of the HBB site. Genomic DNA from cells transfected with R-25 or R-30 were similarly amplified using the CCR2 forward and the CCRS reverse primers. Agarose gels were used to confirm that the polymerase chain reaction (PCR) product sizes were consistent with chromosomal deletions between these sites. The R-03, R-25 and R-30 PCR products were cloned and the individual colonies Sanger sequenced and aligned.

[0341]Quantitative PCR

[0342]Quantitative PCR determination of the percentage of HBD-HBB chromosomal deletions. HEK-293 cells were transfected in triplicate with CRISPR plasmids containing guide strands R-02 or R-03, or mock transfected cells. Genomic DNA was harvested using Qu...

example 3

sgRNA Variants Containing Single-Base DNA Bulges Induce Cas9 Cleavage

Materials and Methods

[0351]CRISPR / Cas9 Plasmid Assembly

[0352]DNA oligonucleotides containing a G followed by a 19-nt guide sequence (Table 3) were kinased, annealed to create sticky ends and ligated into the pX330 plasmid that contains the +85 chimeric RNA under the U6 promoter and a Cas9 expression cassette under the CBh promoter (available at Addgene) (Hsu, et al., Nat Biotechnol, 31 (2013)).

TABLE 4Protospacer target sites for the sgRNAs used inExamples 3-8 (Table 4 discloses SEQ ID NOS 35-61,respectively, in order of appearance)StorageGeneIndexProtospacer Target (5′ to 3′) PAMHBBR-01GTGAACGTGGATGAAGTTGG TGGHBBR-03GACGTTCACCTTGCCCCACA GGGHBBR-04GCACGTTCACCTTGCCCCAC AGGHBBR-05GGTCTGCCGTTACTGCCCTG TGGHBBR-06GGTTACTGCCCTGTGGGGCA AGGHBBR-07GAGGTGAACGTGGATGAAGT TGGHBBR-08GCTGTGGGGCAAGGTGAACG TGGEGFPR-19GGTGGTGCAGATGAACTTCA GGGEGFPR-20GACCAGGATGGGCACCACCC CGGCCR5R-25GTGTTCATCTTTGGTTTTGT GGGCCR5R-26GCTGCCGCCCAGTGGGACTT ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

Methods and systems for searching genomes for potential CRISPR off-target sites are provided. In preferred embodiments, the methods include identifying possible on- and off-target cleavage sites and / or ranking the potential off-target sites based on the number and location of mismatches, insertions, and / or deletions in the gRNA guide sequence relative to the genomic DNA sequence at a putative target site in the genome. These methods allow for the selection of better target sites and / or experimental confirmation of off-target sites and are an improvement over partial search mechanisms that fail to locate every possible target site.

Description

CROSS-REFERENCE TO RELATED APPLICATIONS[0001]This application is a continuation application of U.S. patent application Ser. No. 15 / 114,799, filed Jul. 26, 2016, which was a 371 application of International Application No. PCT / US2015 / 013134 filed Jul. 27, 2015, which claims the benefit of and priority to U.S. Ser. No. 61 / 932,003 filed Jan. 27, 2014 and which is incorporated by reference in its entirety.STATEMENT REGARDING FEDERALLY SPONSORED RESEARCH[0002]This invention was made with government support under Grant PN2EY018244 awarded by the National Institutes of Health. The government has certain rights in the invention.SEQUENCE LISTING[0003]The Sequence Listing submitted on May 13, 2019, as a text file named “GTRC_6478_SL_txt.txt” created on May 13, 2019, and having a size of 271,348 bytes is hereby incorporated by reference pursuant to 37 C.F.R. § 1.52(e)(5).FIELD OF THE INVENTION[0004]The invention is generally directed to bioinformatics methods and systems for identifying CRISPR...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(United States)
IPC IPC(8): G16B30/00G06F16/2458G06N7/00G06F16/22G16B30/10
CPCG06F16/2468G06F16/2228G06N7/005G16B30/00G16B20/20G16B25/20G16B30/10G06N7/01
Inventor CRADICK, THOMAS JAMESBAO, GANGQIU, PENG
Owner GEORGIA TECH RES CORP
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products