Unlock instant, AI-driven research and patent intelligence for your innovation.

Typing and assembling discontinuous genomic elements

A genome and genome sequence technology, applied in the field of typing and assembling discontinuous genomic elements and diploid sequencing, which can solve the problems of lack of haplotypes and typing of discontinuous genomic elements

Pending Publication Date: 2018-06-08
LUDWIG INST FOR CANCER RES LTD
View PDF7 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Although there are methods for haplotyping entire diploid genomes (Selvaraj et al., NBT 2013, Dec;31(12):1111-8) or loci of interest (Selvaraj et al., BMC Genomics 2015 Nov5; 16:900) methods, but methods for haplotyping discrete genomic elements into chromosome-spanning haplotypes are still lacking

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Typing and assembling discontinuous genomic elements
  • Typing and assembling discontinuous genomic elements
  • Typing and assembling discontinuous genomic elements

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0107] In this example, it was examined whether whole-exome haplotype phasing could be achieved using data sets obtained from simulated genomic proximity junction assays such as TCC or Hi-C or in situ Hi-C. More specifically, to show that whole-exome haplotype phasing is feasible, data were obtained from Hi-C whole-exome proximity joining experiments performed on chromosome 1 of GM12878 cells. Then, at least one fragment containing the exon region in the two sequence read pairs is retained. Thus, this dataset represents a simulated whole-exome proximity junction dataset.

[0108] The data was then simulated using the algorithm described above and used to examine its ability to phase exonic SNVs to a single haplotype structure. To this end, two metrics were defined - completeness as the length of the haplotype block compared to the length of the chromosome and resolution as the phased exosome in the chromosome The subvariant's score. It was found that complete haplotypes cou...

Embodiment 2

[0111] In this example, it was examined whether whole-exome haplotype phasing could be achieved using real datasets obtained from exome-captured proximity joins.

[0112] More specifically, exome capture was performed using proximity junction data from GM12878 cells, followed by sequencing using the methods described above. The exome capture protocol is optimized in-house for fragment length, blocker primers, and oligonucleotide probe binding. As shown in Figure 4, three whole-exome proximity junction libraries were generated. Two of these libraries were generated using a single enzyme (NcoI or XbaI), while the third was generated using a mixture of 6-base cutting enzymes (HindIII, NcoI, XbaI and BamHI, labeled "multienzyme"). After capture and sequencing, these libraries were found to have a clear enrichment of exon sequences (Fig. 4b). These were then sequenced, generating approximately 50-70 million read pairs per library (Fig. 4b).

[0113] These data sets were first us...

Embodiment 3

[0120] In this example, assays were performed to investigate the effect of restriction enzymes selected based on the number of bases covered and phased. Briefly, three libraries were generated using the exome sequencing protocol and the whole-exome Haploseq method described above. For this, NcoI (6-base cleavage enzyme) and DpnI (4-base cleavage enzyme) were used. The result is as Figure 6 shown in . Results showed that 96% of bases were covered at >10x in whole-exome sequencing samples when the average coverage per library was sequenced at 44x. However, if a 6-base cut is used, only about 30% of the bases are covered at 10x or greater. This increased to 50% with 4-base cleaving enzyme. These results again suggest that multi-enzyme data may be more useful for genotyping and potentially haplotyping or de novo assembly applications than single-enzyme data sets.

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

This invention relates to methods and kits for typing and assembling discontinuous genomic elements.

Description

[0001] Cross References to Related Applications [0002] This application claims priority to US Provisional Application No. 62 / 234,329, filed September 29, 2015. The entire content of this application is incorporated into this application by reference. technical field [0003] The present invention relates generally to the fields of genetics, molecular and cell biology, and particularly to methods and kits for typing and assembling discrete genomic elements and diploid sequencing. Background technique [0004] Current short-read sequencing (Short-read sequencing) produces genomic data with poor continuity and thus limits de novo genome assembly and deconvolution of diploid haplotypes. In the context of typing, each organism has a defined set of chromosomes containing all of its genetic information. For example, normal human somatic cells are diploid and have two sets of chromosomes, a paternal set and a maternal set in each nucleus. In each individual, the two sets of chr...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): C12Q1/6869G06F19/26G16B20/20G16B20/40G16B30/20G16B45/00
CPCC12Q1/68G16B20/00G16B30/00G16B45/00C12Q1/6841G16B30/20G16B20/20G16B20/40C12Q2521/301C12Q2523/101C12Q2563/131C12Q1/6869C12Q1/6876C12Q2600/172
Inventor B.任J.狄克逊S.塞尔瓦拉
Owner LUDWIG INST FOR CANCER RES LTD