Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Methods for producing a paired tag from a nucleic acid sequence and methods of use thereof

Inactive Publication Date: 2006-02-02
APPL BIOSYSTEMS INC
View PDF24 Cites 343 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Benefits of technology

[0029] In a further aspect of the invention, a method for identifying nucleic acid sequences that encode at least two interacting proteins is provided, comprising the steps of a) combining (i) a first vector comprising (1) a nucleic acid sequence that encodes a first protein that interacts with a second protein, and (2) a first restriction endonuclease recognition site specific for a restriction endonuclease that cleaves the nucleic acid sequence fragment distally to the restriction endonuclease recognition site, and (ii) a second vector comprising (1) a nucleic acid sequence that encodes the second protein; and (2) a second restriction endonuclease recognition site specific for a restriction endonuclease that cleaves the nucleic acid sequence fragment distally to the restriction endonuclease recognition site, thereby producing a combination, b) optionally maintaining the combination under conditions in which the first protein and the second protein are expressed and interact, c) joining the first vector with the second vector, thereby forming a contiguous nucleic acid sequence that comprises (i) the nucleic acid sequence that encodes a first protein that interacts with a second protein, (ii) the first restriction endonuclease recognition site specific for a restriction endonuclease that cleaves the nucleic acid sequence fragment distally to the restriction endonuclease recognition site, (iii) the nucleic acid sequence that encodes the second protein, and (iv) the second restriction endonuclease recognition site specific for a restriction endonuclease that cleaves the nucleic acid sequence fragment distally to the restriction endonuclease recognition site, d) cleaving the first restriction endonuclease recognition site and the second restriction endonuclease recognition site in the contiguous nucleic acid sequence with restriction endonucleases that cleave the contiguous nucleic acid sequence distally to the restriction endonuclease recognition sites, thereby producing a 5′ end tag and a 3′ end tag of the contiguous nucleic acid sequence, e) joining the 5′ end tag to the 3′ end tag, thereby producing a paired tag, and f) sequencing the paired tag, thereby identifying nucleic acid sequences that encode at least two interacting proteins.
[0030] In an additional embodiment of the invention, provided is a method for identifying nucleic acid sequences that encode at least two interacting proteins comprising the steps of a) combining (i) a first vector comprising (1) a nucleic acid sequence that encodes a first protein that interacts with a second protein, and (2) a first restriction endonuclease recognition site specific for a restriction endonuclease that cleaves the nucleic acid sequence fragment distally to the restriction endonuclease recognition site, and (ii) a second vector comprising (1) a nucleic acid sequence that encodes the second protein, and (2) a second restriction endonuclease recognition site specific for a restriction endonuclease that cleaves the nucleic acid sequence fragment distally to the restriction endonuclease recognition site, thereby producing a combination, b) optionally maintaining the combination under conditions in which the first protein and the second protein are expressed and interact, c) joining the first vector with the second vector, thereby forming a contiguous nucleic acid sequence that comprises (i) the nucleic acid sequence that encodes a first protein that interacts with a second protein, (ii) the first restriction endonuclease recognition site specific for a restriction endonuclease that cleaves the nucleic acid sequence fragment distally to the restriction endonuclease recognition site, (iii) the nucleic acid sequence that encodes the second protein, and (iv) the second restriction endonuclease recognition site specific for a restriction endonuclease that cleaves the nucleic acid sequence fragment distally to the restriction endonuclease recognition site, d) sequencing the contiguous nucleic acid sequence, thereby identifying nucleic acid sequences that encode at least two interacting proteins.
[0031] In another embodiment of the invention, provided is a method for identifying nucleic acid sequences that encode at least two interacting proteins comprising the steps of a) combining, (i) a first vector comprising (1) a nucleic acid sequence that encodes a first protein that interacts with a second protein, and (2) a first restriction endonuclease recognition site specific for a restriction endonuclease that cleaves the nucleic acid sequence fragment distally to the restriction endonuclease recognition site, and (ii) second vector comprising (1) a nucleic acid sequence that encodes the second protein; and (2) a second restriction endonuclease recognition site specific for a restriction endonuclease that cleaves the nucleic acid sequence fragment distally to the restriction endonuclease recognition site, thereby producing a combination, b) optionally maintaining the combination under conditions in which the first protein and the second protein are expressed and interact, c) joining the first vector with the second vector, thereby forming a contiguous nucleic acid sequence that comprises (i) the nucleic acid sequence that encodes a first protein that interacts with a second protein, (ii) the first restriction endonuclease recognition site specific for a restriction endonuclease that cleaves the nucleic acid sequence fragment distally to the restriction endonuclease recognition site, (iii) the nucleic acid sequence that encodes the second protein, and (iv) the second restriction endonuclease recognition site specific for a restriction endonuclease that cleaves the nucleic acid sequence fragment distally to the restriction endonuclease recognition site, d) cleaving the first restriction endonuclease recognition site and the second restriction endonuclease recognition site in the contiguous nucleic acid sequence with restriction endonucleases that cleave the contiguous nucleic acid sequence distally to the restriction endonuclease recognition sites, thereby producing a paired tag comprising a 5′ end tag and a 3′ end tag of the contiguous nucleic acid sequence, e) sequencing the paired tag, thereby identifying nucleic acid sequences that encode at least two interacting proteins.
[0032] In a further embodiment of the invention, provided is a method for identifying a plurality of nucleic acid sequences that encode at least two interacting proteins comprising the steps of a) combining (i) a plurality of first vectors each comprising (1) a nucleic acid sequence that encodes a first protein that interact with a second protein; and (2) a first restriction endonuclease recognition site specific for a restriction endonuclease that cleaves the nucleic acid sequence fragment distally to the restriction endonuclease recognition site, and (ii) a plurality of second vectors each comprising (1) a nucleic acid sequence that encodes the second protein; and (2) a second restriction endonuclease recognition site specific for a restriction endonuclease that cleaves the nucleic acid sequence fragment distally to the restriction endonuclease recognition site, thereby producing a combination comprising a plurality of first vectors and a plurality of second vectors, b) optionally maintaining the combination under conditions in which the plurality of first vectors encoding a first protein and the p

Problems solved by technology

However, a major problem with the BAC-based approaches is the high cost and operational burden associated with the production of 15,000-25,000 individual BAC subclone libraries, the 15-20% waste associated with re-sequencing the vector, as well as the unavoidable E. coli contamination, the need to deal with transposon and bacteriophage insertions, and the 20-50% waste in redundant sequencing of BAC overlaps.
Although these costs can be reduced by sequencing the BACs at low coverage (using a hybrid BAC / WGS strategy, for example) or by using a pooling strategy, they cannot be eliminated.
The need to generate a physical map by using restriction digest fingerprinting or by complex pooling and sequence based mapping strategies adds additional cost and operational overhead.
However, these methods suffer from the need for large numbers of transformations to be performed: one for each bait to be analyzed against one or more prey molecule.
Therefore, these methods only incrementally increase the efficiency of conventional two-hybrid systems.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Methods for producing a paired tag from a nucleic acid sequence and methods of use thereof
  • Methods for producing a paired tag from a nucleic acid sequence and methods of use thereof
  • Methods for producing a paired tag from a nucleic acid sequence and methods of use thereof

Examples

Experimental program
Comparison scheme
Effect test

example 1

Whole Genome Assembly

[0098] The whole genome shotgun sequencing, assembly and finishing paradigm is generally the strategy of choice for microbial and fungal genome sequencing. Contrary to the expectations of some, recent advances in the development and application of whole genome assembly (WGA) software, such as Arachne, the Celera Assembler, Phusion, or Jazz, have also demonstrated that it is straightforward to produce a high quality assembly of plant and mammalian genomes using such tools (Mural, Adams et al. 2002; Jaffe, Butler et al. 2003). The cost-advantages and simplicity of the whole genome approach relative to a BAC based or hybrid sequencing strategy argue strongly for its continued development and application in future sequencing projects. One problem with the BAC based approaches is the high cost and operational burden associated with the production of 15,000-25,000 individual BAC subclone libraries, the 15-20% waste associated with re-sequencing the vector as well as ...

example 2

Paired Genome Sequence Tag Method

[0118] Typically, a collection of genomic DNA fragments is produced by fragmentation using one or more restriction endonucleases, or by shearing, with subsequent repair of the ends to produce blunt-ended fragments. Fragments of a specific size range are purified by gel electrophoresis, followed by extraction from a gel slice by electroelution, extraction with chaotropic salts and glass beads, or enzymatic dissolution of the gel followed by phenol extraction and ethanol precipitation. The fragments are ligated to an adapter containing a type IIs restriction enzyme site juxtaposed to a blunt end and the resulting circularized products are subsequently digested with the enzyme to create short tags specific to the ends of the DNA fragment. These tags, which remain connected to each other through the process via the adapter are ligated to form concatemers of paired tags. The concatemers are sequenced and the paired tags extracted from the sequence. The p...

example 3

Paired Tag Two-Hybrid

[0121] This method provides a linked pair of bait and prey molecules that derive from cells that screen positive for an interaction. The bait-prey pairs can be sequenced individually, for example, from plasmids, PCR or rolling circle amplification (RCA) products, or from short sequence paired tags. The paired tags can be catenated into longer molecules and sequenced using conventional unidirectional or paired end sequencing methods to generate approximately a 6-10 tag pair sequences from each sequencing read. The linking of bait and prey tags allows thousands of baits to be screened against thousands of preys (e.g., complete libraries of both) simultaneously in a single transformation and screening experiment. The catenation of tag pairs allows approximately 10 or more pairs to be characterized for the cost of one hi-copy plasmid sequencing read (depending on the read length). By using a paired end sequencing approach, approximately 12-20 tag pairs can be gener...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

PropertyMeasurementUnit
Nucleic acid sequenceaaaaaaaaaa
Login to View More

Abstract

Methods for producing a paired tag from a nucleic acid sequence are provided in which the paired tag comprises the 5′ end tag and 3′ end tag of the nucleic acid sequence. In one embodiment, the nucleic acid sequence comprises two restriction endonuclease recognition sites specific for a restriction endonuclease that cleaves the nucleic acid sequence distally to the restriction endonuclease recognition sites. In another embodiment, the nucleic acid sequence further comprises restriction endonuclease recognition sites specific for a rare cutting restriction endonuclease. Methods of using paired tags are also provided. In one embodiment, paired tags are used to characterize a nucleic acid sequence. In a particular embodiment, the nucleic acid sequence is a genome. In one embodiment, the characterization of a nucleic acid sequence is karyotyping. Alternatively, in another embodiment, the characterization of a nucleic acid sequence is mapping of the sequence. In a further embodiment, a method is provided for identifying nucleic acid sequences that encode at least two interacting proteins.

Description

RELATED APPLICATION [0001] This application claims the benefit of U.S. Provisional Application No. 60 / 516,080, filed Oct. 31, 2003. [0002] The entire teachings of the above application are incorporated herein by reference.BACKGROUND OF THE INVENTION [0003] Whole genome shotgun sequencing, assembly and finishing is typically the strategy of choice for microbial and fungal genome sequencing. The cost-advantages and simplicity of the whole genome approach relative to a BAC based or hybrid sequencing strategy argue strongly for its continued development and application in future sequencing projects. However, a major problem with the BAC-based approaches is the high cost and operational burden associated with the production of 15,000-25,000 individual BAC subclone libraries, the 15-20% waste associated with re-sequencing the vector, as well as the unavoidable E. coli contamination, the need to deal with transposon and bacteriophage insertions, and the 20-50% waste in redundant sequencing...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): C12Q1/68C12P19/34
CPCC12P19/34C12Q1/6806C12Q1/6816C12Q2525/191C12Q2521/313C12Q2525/131C12Q2521/501
Inventor SMITH, DOUGLASMALEK, JOELMCKERNAN, KEVIN
Owner APPL BIOSYSTEMS INC
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products