Software defect number predicting method based on feature selection and ensemble learning

A feature selection and software defect technology, applied in software testing/debugging, error detection/correction, electrical digital data processing, etc., can solve problems such as different algorithms, different prediction capabilities, model feature influence, etc., to achieve the effect of improving accuracy

Inactive Publication Date: 2017-10-13
WUHAN UNIV
View PDF3 Cites 10 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0008] However, these regression algorithms have different predictive capabilities, and the performance of the algorithms will vary due to different data sets. Experiments have proved th

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Software defect number predicting method based on feature selection and ensemble learning
  • Software defect number predicting method based on feature selection and ensemble learning
  • Software defect number predicting method based on feature selection and ensemble learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0046] The flow chart of the method for predicting the number of software defects based on feature selection and integrated learning designed by the present invention is shown in the appendix figure 1 , all steps can be automatically run by those skilled in the art using computer software technology. The specific implementation process of the embodiment is as follows:

[0047] Step 1, mining software historical data, extracting n useful software modules from it. The granularity of software modules can be set as files, packages, classes or functions according to actual application scenarios. Then mark the number of defects in the software module.

[0048] Step 2, extract the attribute feature of software module, for the convenience of setting forth, assume that 5 attribute features are extracted in the embodiment: A 1 , A 2 , A 3 , A 4 , A 5 .

[0049] In this embodiment, the defect data set D={(x 1 ,y 1 ),(x 2 ,y 2 ),(x 3 ,y 3 ),(x 4 ,y 4 ),(x 5 ,y 5 )}, wher...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention belongs to the technical field of software defect prediction, and particularly relates to a software defect number predicting method based on feature selection and ensemble learning. For solving the problems that in prediction of the software defect number, irrelevant module features damage the performance of a defect prediction model, regression models have different prediction abilities, an optimal regression algorithm cannot be selected and the like, the method comprises the following steps: filtering irrelevant and redundant features by using a feature selecting method based on a package mode; then using six different regression algorithms including a linear regression algorithm, a ridge regression algorithm, a decision tree regression algorithm, a gradient boosting regression algorithm, a nearest neighbor regression algorithm and a multilayered sensor regression algorithm, an ensemble learning technology is used, and a comprehensive regression model is constructed according to data instances of which features are screened. Compared with a single regression model, the software defect number predicting method based on feature selection and ensemble learning has the characteristic that the accuracy of software defect number prediction is improved.

Description

technical field [0001] The invention belongs to the technical field of software defect prediction, in particular to a method for predicting the number of software defects based on feature selection and integrated learning. Background technique [0002] (1) Software defect number prediction technology [0003] Software has become an important factor affecting national economy, military affairs, politics and even social life. Highly reliable and complex software systems depend on the reliability of the software they employ. Software defects are the potential source of related system errors, failures, crashes, and even machine crashes. The so-called defect, so far, there are many related terms and definitions in academia and industry, such as failure, defect, bug, error, error, failure, failure, etc. According to ISO 9000, the definition of a defect is: to meet the requirements related to the intended or specified use. A defect is a part of the software that already exists a...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F11/36
CPCG06F11/3608
Inventor 余啸刘进井溢洋崔晓晖邱昌
Owner WUHAN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products