Boosting based cost sensitive software defect prediction method

A software defect prediction, cost-sensitive technology, applied in computer parts, instruments, character and pattern recognition, etc.

Inactive Publication Date: 2016-12-07
NANJING UNIV OF AERONAUTICS & ASTRONAUTICS
View PDF0 Cites 17 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The purpose of the present invention is to address the deficiencies of the above-mentioned background technology, and to provide a cost-sensitive software defect prediction method based on Boosting, which adopts a combination of Boosting and cost-sensitive software in the process of attribute subset selection and weight

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Boosting based cost sensitive software defect prediction method
  • Boosting based cost sensitive software defect prediction method
  • Boosting based cost sensitive software defect prediction method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0045] Combine below figure 1 and figure 2 The technical solution of the invention is described in detail.

[0046] A method of generating a plurality of different k-NN basic predictor sets through Bootstrp iterative sampling and constructing a defect prediction model integrating k-NN predictors is finally used in the field of software defect prediction. In the sampling process, the subset selection method based on cost-sensitive random deletion of attributes one by one is used to find the k value and attribute subset that minimizes the prediction error cost, and the weight update mechanism based on prediction error cost-sensitive is used to resample the different instances of Bootstrp Assign the corresponding weights, and use this to construct the weight vector as the basis for the next sampling. Based on the new sampling set, re-find the k value and attribute subset that minimizes the cost, until the set number of basic predictors is obtained. An integrated predictor with...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a boosting based cost sensitive software defect prediction method, belonging to the technical field of software engineering application. According to the invention, re-sampling is carried out by means of Bootstrap, the erroneous deletion of valuable attributes can be prevented by using a cost-sensitive subclass selection manner of randomly deleting an attribute during attribute selection, and meanwhile, the selected attribute sub-class facilitates the reduction of prediction error costs, when the weight is updated, a cost-sensitive weight updating mechanism is adopted, large weight is endowed to a data set with high cost, so that the data can be guaranteed to be studied for many times to obtain a reasonable integrated prediction model, and the prediction model is applied to small sample data to accurately predict software defects, and therefore, the technical problems that the prediction effects are not satisfactory due to the shortage of training data under the small sample data and the unequal cost of false positives and false negatives in the prediction process are solved.

Description

technical field [0001] The invention discloses a cost-sensitive software defect prediction method based on Boosting, which belongs to the technical field of software engineering applications. Background technique [0002] At present, the classical learning methods used in the field of machine learning for static methods of software defect prediction mainly include naive Bayesian, support vector machines, decision trees, BP neural networks, random forests, or improved methods based on these methods. However, due to these learning Some algorithms have strict restrictions on the tested data, and some processing data is not accurate enough, which leads to unsatisfactory results when applied to the prediction of software defects. Moreover, in the case of small samples, all the above learning methods do not comprehensively consider a series of problems such as insufficient training data, class imbalance of training results, false positives and false negatives in the prediction pro...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06K9/62
CPCG06F18/214G06F18/24
Inventor 燕雪峰杨杰王凯范亚琼张晓策薛参观
Owner NANJING UNIV OF AERONAUTICS & ASTRONAUTICS
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products