Class imbalance software defect prediction method based on data resampling

A software defect prediction and resampling technology, which is applied in software testing/debugging, electrical digital data processing, computer components, etc., can solve problems such as class imbalance, and achieve the effect of being widely applicable and solving class imbalance problems

Active Publication Date: 2021-08-03
WUHAN UNIV
View PDF14 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The main purpose of the present invention is to solve the class imbalance problem in software defect prediction and propose a class imbalance problem software defect prediction method, which is generally applicable to software defect prediction

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Class imbalance software defect prediction method based on data resampling
  • Class imbalance software defect prediction method based on data resampling
  • Class imbalance software defect prediction method based on data resampling

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0071] In order to make the object, technical scheme and advantages of the present invention clearer, the present invention will be further described below in conjunction with the accompanying drawings and specific implementations. Here, only the suitability examples of the present invention are used to illustrate the present invention, but not as the present invention. limit.

[0072] The overall implementation flow chart of the present invention is as figure 1 As shown, the specific implementation is as follows:

[0073] Step 1: Select any minority class data in the minority class data set to perform Euclidean distance calculation with each minority class data in the minority class data set, and select the minority class with the closest distance to the selected minority class data in the minority class data set Data, select any minority data in the minority data set to perform Euclidean distance calculation with each majority data in the majority data set, and filter out t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a class imbalance software defect prediction method based on data resampling. According to the invention, the minority class data and the majority class data which are closest to the minority class data are screened out by calculating the Euclidean distance between the minority class data set and the majority class elements and the Euclidean distance between the minority class data set and the majority class elements, and distance parameters of the minority class data are obtained through the Euclidean distance; the method also includes marking the minority class data in the minority class data set according to the distance parameter, and obtaining a minority data point type; calculating a K neighbor point set of each few element in the minority class data set, counting the number of majority class data and the number of minority class data in K neighbor points, and obtaining the number of newly generated minority class data; and respectively selecting two classifiers, carrying out confidence evaluation on newly generated software defect prediction minority class data to obtain a training data set, training the selected classifiers, and carrying out weighted voting to obtain a final prediction result. According to the invention, the class imbalance problem in the software defect prediction process can be well solved.

Description

technical field [0001] The invention belongs to the field of software defect prediction, in particular to a class imbalance software defect prediction method based on data resampling. Background technique [0002] With the development of society and the improvement of science and technology, the Internet has been deeply integrated into all aspects of our lives. Whether it is online shopping, going out by car, smart home, restaurant ordering, etc., various activities in our daily life can be done through The software is complete, and the usage scenarios of the software have penetrated into all aspects of our food, clothing, housing and transportation. In the process of software development, the demand for software functions is constantly increasing, the number of people served by software is constantly increasing, and the time of software development is constantly being compressed. Various problems lead to software defects in the process of software development. The occurren...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F11/36G06K9/62
CPCG06F11/3672G06F18/214G06F18/24147
Inventor 荆晓远孔晓辉陈昊文
Owner WUHAN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products