The invention discloses a
software defect prediction method based on two-stage wrapping-type
feature selection, and belongs to the field of
software quality assurance. The
software defect prediction method comprises the following steps: (1) mining the version
control system and the
defect tracking system of a software project, extracting a program module from the version
control system and the
defect tracking system, and carrying out type marking and
software measurement on the program module to generate a defect prediction
data set D; (2) carrying out two-stage wrapping-type
feature selection on the defect prediction
data set so as to remove redundant features and irrelevant features in the
data set D as many as possible, and finally, selecting an optimal feature subset FS' from an original
feature set FS; and (3) on the basis of the optimal feature subset FS', preprocessing the data set D, forming a preprocessed data set D', and finally, constructing a defect prediction model in virtue of a
decision tree which is a classification method. By use of the software defect prediction method, on one hand, the redundant features and the irrelevant features in the defect prediction data set can be effectively identified and removed, on the other hand, a
class imbalance problem in the defect prediction data set can be effectively alleviated, and finally, the performance of the defect prediction model can be effectively improved.