System and methods for predicting transmembrane domains in membrane proteins and mining the genome for recognizing G-protein coupled receptors

a technology of membrane proteins and genomes, applied in the field of system and methods for predicting transmembrane domains in membrane proteins and mining the genomes for recognizing gprotein coupled receptors, can solve the problems of time-consuming and expensive techniques, undesirable side effects, and difficult determination of tertiary structure of proteins, and achieve the effect of fast and accurate procedure for predicting

Inactive Publication Date: 2006-01-12
CALIFORNIA INST OF TECH
View PDF3 Cites 10 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Benefits of technology

[0013] The invention provides a protocol using a hydrophobic profile based on protein sequence information and / or protein multisequence alignment information to predict transmembrane helical regions in proteins, such as multipass membrane proteins including G-Protein Coupled Receptors (GPCRs). The protocol features a coarse grain sampling methods, such as hydrophobicity analysis, to provide a fast and accurate procedure for predicting TM regions.

Problems solved by technology

Determining a protein's tertiary structure is more difficult.
However, these techniques can be time-consuming and expensive, and not all proteins are equally amenable to structural examination by these methods.
In addition, many different types of GPCRs are similar enough that they are affected by the antagonists or agonists for other types (e.g., among adrenergic, dopamine, serotonin, and histamine receptors), leading often to undesirable side effects.
This makes it difficult to develop drugs to a particular subtype without side effects resulting from cross-reactivity to other subtypes.
However, although much effort has been put into elucidating the structure of GPCRs, only a very small number of complete 3D structures of transmembrane proteins are known from experiments (e.g., bacteriorhodopsin and bovine rhodopsin).
In fact, there is no atomic-level structure available for any human GPCRs.
Consequently, design of subtype-specific drugs for GPCR targets is a very tedious empirical process, often leading to drugs with undesirable side effects.
The difficulty in obtaining three-dimensional structures for GPCRs is obtaining high-quality crystals of these membrane-bound proteins sufficient to obtain high-resolution x-ray diffraction data, and the difficulty of using NMR to determine structure on such membrane-bound systems.
For globular proteins, there have been significant advances in predicting the three-dimensional structures by using sequence homologies to families of known structures (Marti-Renom et al., 2000); however, this is not practical for GPCRs, inasmuch as a high-resolution crystal structure is available for only one GPCR, bovine rhodopsin—which has low homology (<35%) to most GPCRs of pharmacological interest.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • System and methods for predicting transmembrane domains in membrane proteins and mining the genome for recognizing G-protein coupled receptors
  • System and methods for predicting transmembrane domains in membrane proteins and mining the genome for recognizing G-protein coupled receptors
  • System and methods for predicting transmembrane domains in membrane proteins and mining the genome for recognizing G-protein coupled receptors

Examples

Experimental program
Comparison scheme
Effect test

example 1

Validation of the Method by Predicting TM Helices in Bovine Rhodopsin

[0184] We have used the TM2ndS algorithm for predicting the structure for about 10 very different GPCR classes (Vaidehi, N., W. B. Floriano, R. Trabanino, S. E. Hall, P. Freddolino, E. J. Choi, G. Zamanakos, and W. A. Goddard. 2002. Prediction of structure and function of G-protein-coupled receptors. Proc. Natl. Acad. Sci. USA. 99:12622-12627). In each case the predicted binding site and binding energy agrees well with available experimental data, providing some validation of the TM region prediction. However only for bovine rhodopsin can we make precise comparisons to experimental structures. We have also used TM2ndS to predict the TM regions and length of TM regions for membrane proteins for which crystal structure is available. Our predictions give 100% of the database as membrane proteins. The length of the TM regions is predicted to within 2 residues of the crystal structure on either carboxy or amino terminu...

example 2

GPCR Mining from the Genome

[0195] Using the TM2ndS module without the capping functionality, and in an automated fashion, we have searched the cDNA sequences in databases that are annotated to be proteins in the genome. The flowchart of this automated version of TM2ndS for data mining is shown in FIG. 3. Instead of applying a multiple sequence alignment on various sequences, one sequence was used for TM2ndS prediction (as on left side of flowchart). We used the cDNA sequences from the Riken database (J. Kawai, A. Shinagawa, et al. 2001, Nature, 409, 685-690) for this purpose, and the results for GPCR mining are given below.

[0196] The Riken Mouse Gene Encyclopedia Project was aimed at determining the function of a full cDNA library obtained for the Mouse genome. This is a good validation for the technique of data mining by searching (using TM2ndS without capping), sequences which potentially have the function of a G-protein coupled receptor. I infer function from structure, so I se...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

PropertyMeasurementUnit
TMaaaaaaaaaa
hydrophobicaaaaaaaaaa
hydrophobicityaaaaaaaaaa
Login to view more

Abstract

The invention provides computer-implemented methods and apparatus implementing a hierarchical protocol using multiscale molecular dynamics and molecular modeling methods to predict the presence of transmembrane regions in proteins, such as G-Protein Coupled Receptors (GPCR), and protein structural models generated according to the protocol. The protocol features a coarse grain sampling method, such as hydrophobicity analysis, to provide a fast and accurate procedure for predicting transmembrane regions. Methods and apparatus of the invention are useful to screen protein or polynucleotide databases for encoded proteins with transmembrane regions, such as GPCRs.

Description

REFERENCE TO RELATED APPLICATIONS [0001] This application claims the benefit under 35 U.S.C. 119(e) of U.S. Provisional Application No. 60 / 491,334, filed on Jul. 29, 2003, the entire content of which is incorporated herein by reference.GOVERNMENT SUPPORT [0002] The invention described herein was supported, in whole or in part, by Grant Nos. NIHBRGRO1-GM625523, NIH-R29AI40567, NIH-HD36385, and other grants from other U.S. Government Agencies including the Army Research Office (ARO), the Office of Naval Research (ONR), the Department of Energy (DOE), and the National Science Foundation (NSF). The U.S. Government has certain rights in this invention.BACKGROUND OF THE INVENTION [0003] Proteins are linear polymers made up of 20 different naturally-occurring amino acids. The particular linear sequence of amino acid residues in a protein is said to define the protein's primary structure. In its natural environment, a protein folds into a three-dimensional structure determined by its primar...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(United States)
IPC IPC(8): G06F19/00G01N33/48G16B15/20G16B30/10
CPCG06F19/22G06F19/16G16B15/00G16B30/00G16B15/20G16B30/10
Inventor TRABANINO, RENEVAIDEHI, NAGARAJANHALL, SPENCERGODDARD, WILLIAMFLORIANO, WELY
Owner CALIFORNIA INST OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products