The present invention is applicable to the technical field of biological information, and provides a method, device, equipment, and storage medium for predicting
protein binding sites. The method includes: receiving a
protein sequence to be predicted, and using a preset sliding window and sliding step The sequence is divided into sequences to obtain multiple
amino acid subsequences, the word vector of the
protein sequence is constructed according to these
amino acid subsequences, the document
feature extraction is performed on the word elements, and the document
feature vector of the protein sequence is constructed according to the extracted document features, and these
amino acid subsequences are Extract the biological features of the
protein chain, construct the biological
feature vector of the protein sequence according to the extracted biological features, and use the preset
amino acid residue classification model to classify the amino acid subsequence represented by the document
feature vector and biological feature vector, The
amino acid residue type of the protein sequence is obtained, thereby improving the accuracy and versatility of protein
binding site prediction.