Unlock instant, AI-driven research and patent intelligence for your innovation.

A Deep Convolutional Neural Network-Based Method for Predicting DNA-Binding Residues

A technique of binding residues and deep convolution, applied in neural learning methods, biological neural network models, neural architectures, etc., can solve the problems of insufficient attention to noise information, high cost, and inability to guarantee prediction accuracy. The effect of improving forecast efficiency and accuracy

Active Publication Date: 2022-04-05
深圳新锐基因科技有限公司
View PDF7 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Although existing methods can be used to predict DNA-binding residues in protein sequences, a large amount of experimental data and machine learning algorithms are generally used, so the cost is high. The accuracy is not guaranteed to be optimal and needs to be further improved
[0004] In summary, the existing methods for predicting DNA binding residues are still far from the requirements of practical applications in terms of computational cost and prediction accuracy, and urgently need to be improved.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Deep Convolutional Neural Network-Based Method for Predicting DNA-Binding Residues
  • A Deep Convolutional Neural Network-Based Method for Predicting DNA-Binding Residues
  • A Deep Convolutional Neural Network-Based Method for Predicting DNA-Binding Residues

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0023] The present invention will be further described below in conjunction with the accompanying drawings.

[0024] refer to figure 1 and figure 2 , a method for predicting DNA binding residues based on a deep convolutional neural network, comprising the following steps:

[0025] 1) Input a protein sequence S with the number of residues L to be predicted for DNA binding residues;

[0026] 2) For the protein sequence S, use the psi-blast (https: / / toolkit.tuebingen.mpg.de / tools / psiblast) program to search the protein sequence database swissprot (https: / / ftp.ncbi.nlm.nih.gov / blast / db / FASTA / ) Generate a position-specific scoring matrix of size L×20, denoted as PSSM;

[0027] 3) For the protein sequence S, use the program PSSpred (https: / / zhanglab.ccmb.med.umich.edu / PSSpred) to search the protein sequence database nr (https: / / ftp.ncbi.nlm.nih.gov / blast / db / FASTA / nr) generates a protein secondary structure matrix with a size of L×3, denoted as PSS;

[0028] 4) the two-dimens...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

A method for predicting DNA binding residues based on a deep convolutional neural network. First, according to the input protein sequence information whose number of residues is L to be predicted for ligand binding residues, use the psi-blast program and the PSSpred program to obtain matrix PSSM and PSS; then, combine the two matrices into a feature matrix F; secondly, we process the protein sequence into residue samples; thirdly, build a deep convolutional neural network, using the protein sequence of known binding residues to construct Data set, and the data set is divided into M groups of data subsets, using these ten sets of data subsets to train M network models; finally, the protein sequence to be predicted is processed into residue samples, and input to the trained In the M network models of , the prediction results of these M models are combined to predict whether the residues in the protein sequence are binding residues. The invention has low calculation cost and high prediction accuracy.

Description

technical field [0001] The invention relates to the fields of bioinformatics, pattern recognition and computer applications, in particular to a method for predicting DNA binding residues based on a deep convolutional neural network. Background technique [0002] Protein-ligand interaction is ubiquitous and indispensable in life processes, and this interaction plays a very important role in the recognition and signal transmission of biomolecules. Among them, DNA molecules belong to the category of ligand molecules. Accurately identifying the binding residues of DNA molecules in protein sequences is helpful for understanding protein functions, analyzing the interaction mechanism between proteins and DNA molecules, and designing drug target proteins. important biological significance. [0003] Research literature found that many methods for predicting DNA-binding residues in protein sequences have been proposed, such as: DISPLAR (Tjong H, Zhou H.an accurate method for predicti...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G16B20/00G16B50/00G06N3/04G06N3/08
CPCG16B20/00G16B50/00G06N3/08G06N3/048G06N3/045
Inventor 胡俊白岩松樊学强郑琳琳张贵军
Owner 深圳新锐基因科技有限公司