Methods, systems, and software for identifying functional biomolecules

a biomolecule and functional technology, applied in the field of molecular biology, molecular evolution, bioinformatics, etc., can solve the problems of inability to explore exhaustively, large protein sequence space, and long time-consuming to achieve protein design

Inactive Publication Date: 2011-10-20
CODEXIS MAYFLOWER HLDG LLC
View PDF1 Cites 25 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Benefits of technology

[0006]One aspect of the present invention pertains to methods, apparatus, and software for identifying amino acid residues for variation in a protein variant library. These residues are then varied in the sequences of protein variants in the library in order to affect a desired activity such as stability, catalytic activity, therapeutic activity, resistance to a pathogen or toxin, toxicity, etc. The method of this aspect may be described by the following sequence of operations: (a) receiving data characterizing a training set of a protein variant library; (b) from the data, developing a sequence activity model that predicts activity as a function of amino acid residue type and corresponding position in the sequence; and (c) using the sequence activity model to identify one or more amino acid residues at specific positions in the systematically varied sequences that are to be varied in order to impact the desired activity. In this method, the protein variants in the library may have systematically varied sequences. Further, the data provides activity and sequence information for each protein variant in the training set.

Problems solved by technology

Protein design has long been known to be a difficult task if for no other reason than the combinatorial explosion of possible molecules that constitute searchable sequence space.
The sequence space of proteins is immense and is impossible to explore exhaustively.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Methods, systems, and software for identifying functional biomolecules
  • Methods, systems, and software for identifying functional biomolecules
  • Methods, systems, and software for identifying functional biomolecules

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

character string variants that include multiple improved objectives relative to other members of the set of biopolymer character string variants.

[0022]FIG. 6 is a chart that depicts steps performed in one embodiment of a method of evolving libraries for directed evolution.

[0023]FIG. 7 is a chart that depicts certain steps performed in an embodiment of a method of producing a fitter population of character string libraries.

[0024]FIG. 8 is a chart that shows certain steps performed in an embodiment of a method of selecting amino acid positions in a polypeptide variant to artificially evolve.

[0025]FIG. 9 is a chart that shows certain steps performed in another embodiment of a method of selecting amino acid positions in a polypeptide variant to artificially evolve.

[0026]FIG. 10 is a chart that shows certain steps performed in an embodiment of a method of identifying amino acids in polypeptides that are important for a polypeptide sequence-activity relationship.

[0027]FIG. 11 is a chart t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

PropertyMeasurementUnit
physicalaaaaaaaaaa
chemical propertiesaaaaaaaaaa
dimensionaaaaaaaaaa
Login to view more

Abstract

The present invention generally relates to methods of rapidly and efficiently searching biologically-related data space. More specifically, the invention includes methods of identifying bio-molecules with desired properties, or which are most suitable for acquiring such properties, from complex bio-molecule libraries or sets of such libraries. The invention also provides methods of modeling sequence-activity relationships. As many of the methods are computer-implemented, the invention additionally provides digital systems and software for performing these methods.

Description

CROSS-REFERENCE TO RELATED APPLICATIONS[0001]This application is a continuation of U.S. patent application Ser. No. 11 / 429,628, filed May 5, 2006, naming Gustafsson et al. as inventors, and titled “Methods, Systems, and Software for Identifying Functional Biomolecules” which is a divisional of U.S. patent application Ser. No. 10 / 629,351, filed Jul. 29, 2003, naming Gustafsson et al. as inventors, and titled “Methods, Systems, and Software for Identifying Functional Bio-Molecules” which is a continuation in part of U.S. patent application Ser. No. 10 / 379,378, filed Mar. 3, 2003, naming Gustafsson et al. as inventors, and titled “Methods, Systems, and Software for Identifying Functional Bio-Molecules.” U.S. patent application Ser. No. 10 / 629,351 is also a continuation in part of International Application No. PCT / US03 / 06551 filed Mar. 3, 2003, naming Gustafsson et al. as inventors. Both U.S. patent application Ser. No. 10 / 379,378 and International Application No. PCT / US03 / 06551 claim t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(United States)
IPC IPC(8): C40B30/02G06F19/00C07K1/00G16B20/20G01N33/48G01N33/50G01N33/53G01N33/543G16B20/50G16B30/10G16B35/20G16B40/20
CPCC40B30/02C40B50/02G06F19/22G06F19/24G06F19/18G16B20/00G16B35/00G16C20/60G16B30/00G16B40/00G16B30/10G16B35/20G16B40/20G16B20/50G16B20/20
Inventor GUSTAFSSON, CLAESGOVINDARAJAN, SRIDHAREMIG, ROBIN A.FOX, RICHARD JOHNROY, AJOY K.MINSHULL, JEREMY S.DAVIS, S. CHRISTOPHERCOX, ANTHONY R.PATTEN, PHILLIP A.CASTLE, LINDA A.SIEHL, DANIEL L.GORTON, REBECCA LYNNECHEN, TEDDY
Owner CODEXIS MAYFLOWER HLDG LLC
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products