Analysis of transcriptomic data using similarity based modeling

a transcriptomic data and similarity-based modeling technology, applied in the field of transcriptomic data, can solve the problems of mis-regulation and upset of delicately balanced networks, and achieve the effects of reducing the complexity of the analysis

Inactive Publication Date: 2006-12-28
VENTURE GAIN L L C
View PDF7 Cites 59 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Benefits of technology

[0012] In a first embodiment of the invention, a model is trained with multivariate transcriptome profiles representative of normal health, and the model is then used to detect deviations in transcriptome patterns and dynamics representative of a diseased state, which deviations point to underlying disease mechanisms. An autoassociative model of gene expression data for normal tissue is trained from normal tissue data. A new input vector representing expression level data from diseased tissue is modeled with this model to provide an autoassociative estimate of the expression levels, which is then differenced with the actual measurements to provide residuals. A residual threshold test detects gene expression levels that are abnormal, identifying these genes as potential markers. Furthermore, the residual pattern can be pattern matched against stored patterns for known disease types to classify the input vector vis-à-vis disease.
[0013] In a second embodiment of the invention, a diagnostic classification is made for a dynamic multivariate genomic or proteomic profile. An inferential model is trained to recognize normal and diseased state transcriptome profiles, and new samples are analyzed and classified accordingly.
[0014] The classification capability of the invention can advantageously be extended t

Problems solved by technology

Disease most often occurs when these delicately

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Analysis of transcriptomic data using similarity based modeling
  • Analysis of transcriptomic data using similarity based modeling
  • Analysis of transcriptomic data using similarity based modeling

Examples

Experimental program
Comparison scheme
Effect test

example

[0043] A simulated system comprising 15 constituents was developed whereby the 15 constituents related in their dynamic behavior to one another with varying degrees of linkage, emulating a regulatory network in a metabolic system. A set of reference data for this system, comprising observations of the 15 variables throughout various states of the dynamics of the system, was used to train a kernel-based model. Then, sets of normal and diseased observations, respectively, were generated, wherein one of the 15 constituents was perturbed to be slightly lower than it should be, regardless of its raw value. Turning to FIG. 10, a chart 1004 shows the value (quantity, expression level, etc.) for the suspect constituent, for the set of normal test observations. Each stem is a separate measurement of that constituent from the set of observations of “normal” specimens. Similarly is shown in chart 1005 the values for that same constituent in the set of observations of the “diseased” specimens. ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

An analytic apparatus and method is provided for diagnosis, prognosis and biomarker discovery using transcriptome data such as mRNA expression levels from microarrays, proteomic data, and metabolomic data. The invention provides for model-based analysis, especially using kernel-based models, and more particularly similarity-based models. Model-derived residuals advantageously provide a unique new tool for insights into disease mechanisms. Localization of models provides for improved model efficacy. The invention is capable of extracting useful information heretofore unavailable by other methods, relating to dynamics in cellular gene regulation, regulatory networks, biological pathways and metabolism.

Description

CROSS REFERENCE TO RELATED APPLICATIONS [0001] This application claims the benefit of priority under 35 U.S.C. § 119(e) to U.S. Provisional application Ser. No. 60 / 670,950 filed Apr. 13, 2005.FIELD OF THE INVENTION [0002] The invention relates to biomarker discovery and disease diagnosis and prognosis, especially based on transcriptomic data such as gene expression levels and proteomics. BACKGROUND OF THE INVENTION [0003] Recent advances in the biological sciences have made it possible to measure thousands of gene expression levels in cells using microarrays, and to quantify the cellular content of thousands of types of proteins using e.g., mass spectrometry, in single experiments with one sample of tissue or serum. Gene expression molecules, i.e., messenger RNA, and the proteins encoded by the mRNA comprise the “transcriptome” of the cell, the components of the cell that are the direct products of the transcription (the genome) and translation (the proteome) of DNA. Analysis of the...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F19/00G16B40/10G16B25/10
CPCG06F19/24G06F19/20G16B25/00G16B40/00G16B40/10G16B25/10
Inventor PIPKE, ROBERT MATTHEWMOTT, JACK E.
Owner VENTURE GAIN L L C
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products