A software defect tendency prediction method based on smote+boosting algorithm

A prediction method and technology for software defects, applied in software testing/debugging, calculation, error detection/correction, etc., can solve the problem of low classification accuracy of minority classes, and achieve the effect of avoiding distance calculation and simple calculation

Active Publication Date: 2018-05-18
BEIHANG UNIV
View PDF3 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0018] The present invention proposes a method for predicting software defect tendency based on the SMOTE+Boosting algorithm in order to solve the problem of low classification accuracy of the minority class in the case of unbalanced data in the integrated algorithm, including the following steps:

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A software defect tendency prediction method based on smote+boosting algorithm
  • A software defect tendency prediction method based on smote+boosting algorithm
  • A software defect tendency prediction method based on smote+boosting algorithm

Examples

Experimental program
Comparison scheme
Effect test

specific Embodiment

[0127] The data in the embodiment comes from the public NASA Metrics Data Program (MDP), which is a NASA software for collecting, verifying, organizing, storing and delivering metric data. Select 5 related project data in NASA MDP, and each project contains detailed quantitative values ​​of metrics and error data of software modules and other information. The basic situation of these five related projects is as follows:

[0128] CM1 project: A scientific instrumentation project implemented in C. The total number of modules is 344; the number of non-failure-prone / low-risk modules is 302; the number of fault-prone / high-risk modules is 42.

[0129] MC2 project: Video guide system implementation software implemented in C. The total number of modules is 127; the number of non-failure-prone / low-risk modules is 83; the number of failure-prone / high-risk modules is 44.

[0130] MW1 project: Zero-gravity experiment software implemented in C. The total number of modules is 204; the n...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a SMOTE+Boosting algorithm based software defect tendency prediction method. The method comprises the steps of for the non-equilibrium problem of software defect data, firstly synthesizing artificial minority class samples by using an SMOTE algorithm and enabling the minority class samples and majority class samples to be balanced in quantity; secondly, calculating a penalty factor of each artificial minority class sample according to density information of original sample distribution to adjust a weight of the artificial sample so as to enable a basic classifier to distinguish the learning of an original sample and the artificial minority class sample and put more emphasis on the learning of the original sample and the artificial sample with relatively high credibility; continuing to use an original loss function for the original sample, adding a penalty factor for the loss function of the artificial minority class sample to penalize the artificial sample with low credibility, and determining a new loss function; and finally, forming a new Boosting algorithm. According to the method, the classification precision of a minority class and a majority class is improved and the problem of low classification precision of a classification model to the minority class caused by data non-equilibrium in the field of prediction and classification is solved to a certain extent.

Description

technical field [0001] The invention belongs to the technical field of software defect prediction, relates to software defect prediction technology in software quality prediction, and specifically refers to a software defect tendency prediction method based on SMOTE and Boosting algorithms. Background technique [0002] With the continuous development of computer technology and the wide application of software in people's production and life, people's requirements for software quality and reliability are getting higher and higher. People expect that by accurately predicting the software quality, it can be used to guide the resource allocation in the software development process, so as to ensure the delivery of software products on schedule and improve the quality of software products. [0003] At present, the prediction of software defects is to measure the quality of software by predicting the risk or number of defects contained in the software; using pattern recognition al...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F11/36
CPCG06F11/3668
Inventor 利广玲王世海刘斌
Owner BEIHANG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products