A Protein Subcellular Interval Prediction Method Using Bag-of-Words Model
A technology of subcellular interval and bag of words model, applied in the field of biology, can solve the problem of low accuracy, and achieve the effect of improving the recognition accuracy and improving the recognition accuracy.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0049] The present invention will be further described below in conjunction with specific examples.
[0050] Taking the dataset of 317 apoptotic protein sequences obtained from the SWISS-PROT database as an example, the bag-of-words model and the AAC algorithm are used to extract the bag-of-words features of the protein sequence, and sent to the support vector machine multi-class classifier for positioning predict. figure 1 is a schematic diagram of the word bag feature extraction process, such as figure 1 As shown, the specific steps are as follows. In the formula involved in the present invention, the symbol Λ represents the omitted item in the sequence.
[0051] 1. After obtaining the data set from the original database, first use the sliding window method to segment all protein sequences in the data set to generate several sequence words, and then extract the features of all sequence words. The specific steps are as follows:
[0052] First, the protein sequence is segment...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


