Software defect prediction-oriented unbalanced data generation method
A software defect prediction and data generation technology, which is applied in software testing/debugging, electrical digital data processing, computer components, etc., can solve the problems of unresolved intra-class imbalance, performance degradation, and uneven distribution of defective samples. To achieve the effect of improving the prediction accuracy
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0012] The present invention is an unbalanced data generation method oriented to software defect prediction, and its purpose is to generate data by adopting different strategies for defective samples with different distributions, so as to make the data set balance between classes and within classes, thereby improving the accuracy of prediction Rate. The specific implementation process of the present invention can be divided into the following stages:
[0013] In the first stage, the distribution is discussed. Discussing the distribution of software defect data sets in the feature space, it is found that there are usually three distributions of the two types of samples: the number of defective samples is greater than the number of non-defective samples, the number of defective samples is smaller than the number of non-defective samples and the number of defective samples Much smaller than the number of defect-free samples.
[0014] In the second stage, the samples are divided...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 
