System for predicting influence of amino acid variation on protein structure stability, and method thereof

A protein structure and amino acid technology, applied in the field of biomedical data analysis, can solve problems such as poor practicability, increased protein stability, and poor quality of training data sets, achieving strong practicability, high prediction accuracy, and strong generalization. Effect

Inactive Publication Date: 2017-11-17
SUZHOU UNIV
View PDF2 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] (1) The accuracy is poor. Currently, there are many data errors and omissions in the general experimental database Protherm, which leads to poor quality of the training data set and seriously affects the accuracy of the prediction results;
[0005] (2) Poor generalization. This method uses a large number of protein structure-related input attributes, but it cannot be predicted when the protein structure is unknown.
[0006] (3) Poor practicability. This method lacks a system that supports single and batch input and can classify prediction results into three categories (variation leads to increased, decreased, and unchanged protein stability)

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • System for predicting influence of amino acid variation on protein structure stability, and method thereof
  • System for predicting influence of amino acid variation on protein structure stability, and method thereof
  • System for predicting influence of amino acid variation on protein structure stability, and method thereof

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0037] The present invention will be described in detail below with reference to the accompanying drawings and in combination with embodiments.

[0038] see figure 1 As shown, a system for predicting the impact of amino acid variation on protein structure stability consists of an amino acid variation information input module 1, an amino acid variation site attribute calculation module 2, a protein sequence attribute calculation module 3, a stability change prediction module 4, and prediction results Output module 5, wherein, the amino acid variation information input module 1 is respectively connected with the amino acid variation site attribute calculation module 2 and the protein sequence attribute calculation module 3, the amino acid variation site attribute calculation module 2 and the The protein sequence attribute calculation module 3 is simultaneously connected with the predicted stability change module 4, and the predicted stability change module 4 is connected with th...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a system for predicting the influence of amino acid variation on protein structure stability, and a method thereof. The system consists of an amino acid variation information input module, an amino acid variation site attribute calculation module, a protein sequence attribute calculation module, a prediction stability change module and a prediction result output module. The method comprises the following steps that: inputting and obtaining variation information; extracting AAindex attribute features, and calculating the micro-electrical physical and chemical attribute features of amino acid; calculating the conservative property and the protein attribute of a protein sequence corresponding to the amino acid variation; adopting a two-layer three-classification random forest algorithm to calculate the influence of the amino acid variation on the protein stability; and storing and outputting a prediction result. By use of the system and the method, on the basis of the amino acid variation provided by a user and the corresponding protein sequence, a situation that the structure stability of protein where the amino acid is positioned is raised and lowered or is invariant due to the amino acid variation is accurately predicted, a corresponding probability is also accurately predicted, and the result is stored and sent to the user to be stored.

Description

technical field [0001] The invention belongs to the technical field of biomedical data analysis, and in particular relates to a system and method for predicting the influence of amino acid variation on protein structure stability. Background technique [0002] An important index to predict the effect of amino acid variation on protein stability is the free energy change value ddG of the wild-type protein and the protein after mutation. At present, the existing prediction methods are divided into two types: one is to calculate directly based on energy and use physical formulas, but due to the unclear physical structure of proteins, the calculation results are not accurate and the generalization is weak; [0003] The other is to use machine learning methods to predict based on existing experimental data, but this method will have the following problems: [0004] (1) The accuracy is poor. Currently, there are many data errors and omissions in the general experimental database ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F19/24
CPCG16B40/00
Inventor 杨洋朱斐严文颖钱福良郁春江
Owner SUZHOU UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products