The invention relates to a random forest-based sucrose transporter identification method, which comprises the following steps of: firstly, acquiring initial data from a protein database, preprocessing the initial data, deleting sequences containing non-standard letters and sequences with too short lengths, and deleting sequences with the similarity greater than 60%; extracting different features according to physicochemical properties and evolutionary information of the protein, and taking each feature and the combined feature as feature input; then, due to the fact that the difference between the sample numbers of the positive example and the counter example is large, performing oversampling on the data set; and finally, under ten-fold cross validation, performing feature training on the oversampled training set by using a random forest, a support vector machine, stochastic gradient descent and naive Bayes, performing testing by using a test set, and analyzing a result. According to the method, a k-separated-bigrams-PSSM and random forest combined method is used, and a Borderline-SMOTE algorithm is introduced, so that the problem of data imbalance is solved, and the identification accuracy of the sucrose transporter is effectively improved.