Software defect prediction method based on generative adversarial network and ensemble learning

A software defect prediction and integrated learning technology, applied in neural learning methods, software testing/debugging, biological neural network models, etc., can solve problems such as difficulty in finding rules, poor application effect, etc. The effect of alleviating the class imbalance problem

Pending Publication Date: 2022-02-18
XIAN UNIV OF TECH
View PDF0 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Among them, the SMOTE and ADASYN methods adopt the form of artificially synthesizing few-sample data to reduce the problem of data class imbalance. The rules for generating data are artificially stipulated, and it is difficult to find suitable rules in real software defect data sets.
However, the ROS and RUS methods directly reduce the amount of data in the training set of the software defect prediction model, and the actual application effect is poor.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Software defect prediction method based on generative adversarial network and ensemble learning
  • Software defect prediction method based on generative adversarial network and ensemble learning
  • Software defect prediction method based on generative adversarial network and ensemble learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0080] Execution of step 1, the software defect data set used in this embodiment comes from the NASA software defect data set, including 12 sub-data sets, and the software defect measurement criteria used include the McCabe measurement method, the HalStead scientific measurement method, and the code line number measurement method and the CK metric. The number of features contained in each sub-dataset varies, see the column of features in Table 1. After the data preprocessing operation was performed on the original NASA software defect data set, the repeated data, repeated attributes and abnormal data were removed. The NASA software defect data set after preprocessing is shown in Table 1.

[0081] The data set in Table 1 is randomly sampled, and the training set and test set are divided according to the ratio of 8:2. Count the number of defective data and non-defective data in the training set data respectively, and then calculate the ratio of the two to obtain the resampling ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a software defect prediction method based on a generative adversarial network and ensemble learning. The method comprises the following steps: 1, carrying out the preprocessing of a software defect data set, dividing the data set into a training set and a test set, and calculating a resampling rate; 2, constructing a generative adversarial network model; 3, inputting the training set into the generative adversarial network for training to obtain a trained generative adversarial network; 4, using the trained generative adversarial network to generate new few-sample defect data according to the resampling rate, and obtaining a resampled training set; and 5, constructing a software defect strong classifier by using an AdaBoost method, and inputting the test set into the trained software defect strong classifier to obtain a software defect prediction result. According to the software defect prediction method, the problem of software defect data imbalance is solved, and the accuracy, the correct rate, the recall rate and the F-measure performance of the software defect prediction method are improved.

Description

technical field [0001] The invention belongs to the technical field of software defect prediction, and in particular relates to a software defect prediction method based on generative confrontation network and integrated learning. Background technique [0002] With the popularization of informatization, there may be software defects in all kinds of software that are in use or in the process of development. Being able to find and locate software defects in a timely manner plays a major role in the normal operation of the software system and the improvement of software functions. . Software Defect Prediction (SDP) aims to find the module most likely to contain defects from a software project, where the module can be a function, a loop body, or a class, etc. Generally, software defect prediction methods are divided into the following three steps. First, according to the vulnerability reports of each module in the historical software, the software modules are marked with defec...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F11/36G06K9/62G06N3/04G06N3/08
CPCG06F11/3672G06N3/08G06N3/045G06F18/285G06F18/24323
Inventor 孟海宁郑毅冯锴朱磊杨哲张嘉薇黑新宏
Owner XIAN UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products