Software defect prediction method based on principal component analysis and combined sampling

A software defect prediction and principal component analysis technology, applied in the field of defect prediction, can solve the problems of uneven distribution of data classes, lack of defect sample information, etc., to reduce the problem of combination omission, improve prediction efficiency, and improve prediction accuracy.
CN109933539AInactive Publication Date: 2019-06-25YANSHAN UNIV

Patent Information

Authority / Receiving Office
CN · China
Patent Type
Applications(China)
Current Assignee / Owner
YANSHAN UNIV
Publication Date
2019-06-25
Estimated Expiration
Not applicable · inactive patent

Smart Images

  • Figure 1
    Figure 1
  • Figure 2
    Figure 2
Patent Text Reader

Abstract

The invention discloses a software defect prediction method based on principal component analysis and combined sampling. The software defect prediction method comprises the following steps: S1, dimensionality reduction and denoising are selected for software defect data through fusion characteristics; S2, performing SMOTE oversampling and hierarchical random sampling on the data subjected to dimensionality reduction in combination for sampling, the oversampling means that class samples in a data set are relatively balanced by increasing the number of few class samples, hierarchical random sampling means that classification is performed by dividing classes, and no-replay random sampling is adopted in each layer; and S3, selecting a classifier for the processed data, and optimizing classifier parameters. According to the method, the random forest classifier is selected, and the characteristics of the characteristic subset are randomly selected, so that the purpose of randomizing the treeis further achieved, the overfitting problem of the classifier is avoided, finally, the software defect prediction performance and prediction efficiency are improved, and a good theoretical and experimental basis is provided for predicting defective software in reality.
Need to check novelty before this filing date? Find Prior Art

Description

technical field

[0001] The invention relates to a defect prediction method, in particular to a software defect prediction method based on principal component analysis and combined sampling. Background technique

[0002] With the development of Internet technology, the reliability of software product quality has become a concern in the field of software engineering, and software defects will inevitably appear in the process of software development. However, for software with potential threats, once it is put into use, it will cause huge economic losses to companies and individuals. In order to effectively solve this problem, it is necessary to accurately and quickly predict the possible defect modules of the software, so as to improve the reliability of the software system.

[0003] Currently, related software defect prediction methods mainly utilize different types of machine learning techniques. Its main consideration is the prediction accuracy of the overall data. Althou...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More