A Self-learning Method for Protein Structure Prediction Based on Residue Contact Information

A technology for protein structure and contact information, applied in informatics, bioinformatics, and used to analyze two-dimensional or three-dimensional molecular structures, etc., can solve the problem of affecting prediction accuracy, "oversampling, and inability to effectively capture remote forces between residues" and other issues to achieve the effect of improving the prediction accuracy

Active Publication Date: 2021-04-06
ZHEJIANG UNIV OF TECH
View PDF2 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the knowledge energy function of this method cannot effectively capture the long-range interaction between residues, and when predicting a target protein with a long sequence, switching between stages under a fixed cost is likely to cause "oversampling" or insufficient sampling to affect the prediction accuracy.
[0006] Therefore, existing multi-stage protein structure prediction methods are deficient in multi-stage sampling and prediction accuracy, and need to be improved

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Self-learning Method for Protein Structure Prediction Based on Residue Contact Information
  • A Self-learning Method for Protein Structure Prediction Based on Residue Contact Information
  • A Self-learning Method for Protein Structure Prediction Based on Residue Contact Information

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0042] The present invention will be further described below in conjunction with the accompanying drawings.

[0043] refer to Figure 1 ~ Figure 3 , a protein structure prediction method based on residue contact information self-learning, including the following steps:

[0044] 1) Given the input sequence information, use the Robetta server to obtain the fragment library of the sequence;

[0045]2) Use RaptorX-Contact to predict the contact map of the sequence, obtain the contact situation of N residue pairs, and make the contact between the kth residue pair in the contact map, the contact refers to the Cα-Cα Euclidean distance less than The exposure probability is denoted as P k , k∈{1,...,N};

[0046] 3) Initialization: population size NP, information entropy threshold α, the maximum number of iterations in the first and second phases of the population are G1 and G2 respectively, according to the input sequence, execute the first and second phases of the Rosetta Abinitio...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

A protein structure prediction method based on self-learning of residue contact information. First, use Robetta and RaptorX-Contact to obtain fragment library and contact map; secondly, learn residue pair distance distribution and establish information entropy in the first stage of population evolution The index reflects the degree of convergence of the population to achieve the purpose of self-learning; then, in the second stage of the final population evolution, use the learned residues to establish a scoring function for the distance distribution information, and assist the energy function to search the conformation space; finally, through clustering Get the final prediction result. The present invention provides a method for predicting protein structure. On the one hand, it can independently learn residue pair distance information and assist energy function to optimize conformational space; on the other hand, it can construct information entropy index to realize two-stage dynamic switching.

Description

technical field [0001] The invention relates to the fields of biological informatics, intelligent optimization and computer application, and in particular to a protein structure prediction method based on residue contact information self-learning. Background technique [0002] A protein is a biological macromolecule with a certain specific spatial structure formed by a polypeptide chain composed of amino acids in the form of "dehydration condensation" after twists and turns, and thus plays a specific function in an organism. The three-dimensional structure of proteins is of great importance in drug design, protein engineering, and biotechnology. At present, millions of protein sequences have been resolved, but most of the protein structures are unknown. Therefore, protein structure prediction is an important research problem. [0003] The gap between protein sequence and structure is mainly due to the rapid development of sequencing technology and relatively slow progress ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G16B15/20
Inventor 张贵军谢腾宇马来发周晓根王柳静郝小虎
Owner ZHEJIANG UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products