Check patentability & draft patents in minutes with Patsnap Eureka AI!

Method for constructing prediction model, method and device for predicting difficulty of polypeptide synthesis

A technology for peptide synthesis and prediction model, applied in the computer field, can solve problems such as difficulty in synthesis and difficult to find, and achieve the effect of ensuring accuracy and reducing the cost of repeated synthesis

Active Publication Date: 2020-12-15
JIANGSU GENSCRIPT BIOTECH CO LTD
View PDF8 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The influence of these factors on the difficulty of polypeptides is interactive and non-linear. For example, if the length of the polypeptide is longer than 30aa, it will increase the difficulty of synthesis, but if the length of the polypeptide is too short, less than 5aa, it will also cause difficulties in synthesis.
Therefore, it is difficult to find a set of simple criteria to judge the difficulty of synthesis of peptide sequences

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for constructing prediction model, method and device for predicting difficulty of polypeptide synthesis
  • Method for constructing prediction model, method and device for predicting difficulty of polypeptide synthesis
  • Method for constructing prediction model, method and device for predicting difficulty of polypeptide synthesis

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0088] Embodiment 1 (prediction result is the best):

[0089] Take a number of peptides and synthesize them separately in a round of synthesis-purification-intermediate control quality inspection, record relevant synthesis data, including peptide sequence, target purity, target product, characteristic information, and the final number of synthesis rounds, and synthesize them into Polypeptides with more than 3 rounds (including 3) are marked as difficult to synthesize, and the corresponding synthesis difficulty is difficult; the rest of the peptides are marked as common polypeptides, and the corresponding synthesis difficulty is normal. According to the specific sequence of the polypeptide, calculate the isoelectric point (use the isoelectric point calculation function in the module bioperl of perl), hydrophobicity, the number of repeated fragments, the length of repeated fragments, the number of ALIVES within 5aa of the C-terminal, etc. as characteristic information. Use the k...

Embodiment 2

[0090] Embodiment 2 (prediction result is close to best):

[0091] Take a number of peptides and synthesize them separately in a round of synthesis-purification-intermediate control quality inspection, record relevant synthesis data, including peptide sequence, target purity, target product, characteristic information, and the final number of synthesis rounds, and synthesize them into Polypeptides with more than 3 rounds (including 3) are marked as difficult to synthesize, and the corresponding synthesis difficulty is difficult; the rest of the peptides are marked as common polypeptides, and the corresponding synthesis difficulty is normal. According to the specific sequence of the polypeptide, calculate the isoelectric point (use the isoelectric point calculation function in the module bioperl of perl), hydrophobicity, the number of repeated fragments, the length of repeated fragments, the number of ALIVES within 5aa of the C-terminal, etc. as characteristic information. Use th...

Embodiment 3

[0092] Example 3 (largest difference, worst result):

[0093] Take a number of peptides and synthesize them separately in a round of synthesis-purification-intermediate control quality inspection, record relevant synthesis data, including peptide sequence, target purity, target product, characteristic information, and the final number of synthesis rounds, and synthesize them into Polypeptides with more than 3 rounds (including 3) are marked as difficult to synthesize, and the corresponding synthesis difficulty is difficult; the rest of the peptides are marked as common polypeptides, and the corresponding synthesis difficulty is normal. According to the specific sequence of the polypeptide, calculate the isoelectric point (use the isoelectric point calculation function in the module bioperl of perl), hydrophobicity, the number of repeated fragments, the length of repeated fragments, the number of ALIVES within 5aa of the C-terminal, etc. as characteristic information. Use the k...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a peptide synthesis difficulty model constructing method and device, and a peptide synthesis difficulty predicting method and device. The peptide synthesis difficulty model constructing method comprises the steps of firstly, obtaining training data and testing data of a peptide synthesis difficulty predicting model, introducing the training data and the testing data into a plurality of preselected machine learning models, training the plurality of machine learning models for obtaining a training result evaluation index value of each machine learning model, analyzing thepredicting result of each machine learning model according to the training result evaluation index value of each machine learning model, and determining the machine learning model with best predictingresult as the peptide synthesis difficulty predicting model. Then the peptide sequence, the target purity, the target yield and the characteristic information of the to-be-tested peptide are input into the peptide synthesis difficulty model for processing for obtaining the synthesis difficulty of the to-be-tested peptide and realizing peptide synthesis difficulty predicting.

Description

technical field [0001] The present invention relates to the field of computer technology, and more specifically, to a method for constructing a prediction model of polypeptide synthesis difficulty, a method and a device for predicting polypeptide synthesis difficulty. Background technique [0002] A polypeptide is an organic compound in which multiple amino acids are connected by peptide bonds. The synthesis of a polypeptide is the process of using organic chemical means to synthesize a polypeptide. In addition to the correct sequence of the peptide product, industrial peptide synthesis has requirements for the purity and yield of the product. [0003] Polypeptide solid-phase synthesis is composed of a series of complex chemical reactions. Any problem with one or several reactions may cause the sequence, purity or yield of the product to be substandard. The synthesis route needs to be redesigned, which increases the synthesis cost of the peptide. Prolonged the synthesis tim...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G16C20/70
Inventor 盛夏李英连王卫国
Owner JIANGSU GENSCRIPT BIOTECH CO LTD
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More