Unlock instant, AI-driven research and patent intelligence for your innovation.

Ligand binding residue prediction method based on multi-sequence joint matching information

A prediction method and a technology of binding residues, which are applied in the fields of bioinformatics and computer applications, can solve the problems of high calculation cost, optimization, and unguaranteed prediction accuracy, and achieve the effect of improving prediction accuracy and prediction efficiency

Active Publication Date: 2020-04-28
ZHEJIANG UNIV OF TECH
View PDF8 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Although existing methods can be used to predict binding residues, a large number of training data sets and machine learning algorithms are commonly used, so the calculation cost is high, and because the noise information in the training set has not received enough attention, the prediction accuracy cannot be guaranteed is optimal
[0004] In summary, the existing methods for predicting ligand-binding residues are still far from the requirements of practical applications in terms of computational cost and prediction accuracy, and urgently need to be improved.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Ligand binding residue prediction method based on multi-sequence joint matching information
  • Ligand binding residue prediction method based on multi-sequence joint matching information
  • Ligand binding residue prediction method based on multi-sequence joint matching information

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0023] The present invention will be further described below in conjunction with the accompanying drawings.

[0024] refer to figure 1 and figure 2 , a method for predicting ligand-binding residues based on multiple sequence alignment information, comprising the following steps:

[0025] 1) Input a protein sequence P to be tested with the number of residues L and the number of binding residues N;

[0026] 2) For the input protein sequence P to be predicted for ligand-binding residues, use the HHblits (https: / / toolkit.tuebingen.mpg.de / # / hhblits) program to search the protein sequence database UniRef90 (ftp: / / ftp .uniprot.org / pub / databases / uniprot / uniref / uniref90 / ) generate a multiple sequence alignment information containing M sequences, denoted as MSA;

[0027] 3) For any residue P in the protein sequence P i ,i=1,2,...,L, calculate the frequency of the same residue type appearing in the corresponding column in the residue alignment information of MSA at this position, de...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

A ligand binding residue prediction method based on multi-sequence binding information comprises the following steps: firstly, according to an input protein sequence to be subjected to ligand bindingresidue prediction and the number of binding residues, acquiring the multi-sequence binding information of protein by using an HHbits program; secondly, calculating the frequency of the same residuesappearing at the corresponding positions of the protein sequence to be predicted and the multi-sequence joint information; thirdly, according to the frequency obtained through calculation and the number of input binding residues, acquiring a pseudo-cocorrelation coefficient matrix through calculation; finally, obtaining a maximum value in the pseudo-cocorrelation coefficient matrix, and outputtingresidue information of binding the same ligand in the to-be-predicted protein sequence according to a position corresponding to the maximum value. The invention provides a ligand binding residue prediction method based on multi-sequence joint matching information, which is low in calculation cost and high in prediction precision.

Description

technical field [0001] The invention relates to the fields of bioinformatics and computer applications, in particular to a method for predicting ligand binding residues based on multiple sequence alignment information. Background technique [0002] The interaction between proteins and ligand molecules is realized through the interaction between some amino acid residues and ligand molecules. This interaction is ubiquitous and indispensable in life activities. These amino acid residues are called binding molecules. Determine residues. Therefore, the precise identification of binding residues between proteins and ligands has important guiding significance for understanding protein functions, analyzing the relationship between biomolecules, and designing new drugs. [0003] Research literature found that many methods for predicting binding residues have been proposed, such as: COACH (Yang J, RoyA, Zhang Y. Protein–ligand binding site recognition using complementary binding-spec...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G16B15/30
CPCG16B15/30
Inventor 胡俊郑琳琳樊学强白岩松张贵军
Owner ZHEJIANG UNIV OF TECH