Software defect predication method based on cost sensitivity and semi-supervision

A software defect prediction, cost-sensitive technology, applied in software testing/debugging, error detection/correction, instruments, etc., can solve problems such as class imbalance, dimension disaster, low recall rate, etc., to achieve low prediction cost and improve accuracy , the effect of improving the recall rate

Active Publication Date: 2016-12-07
重庆优霓空科技有限公司
View PDF3 Cites 34 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] 1) Relying on a large amount of historical defect information, it is difficult or even impossible to obtain a certain amount of label data for learning in practical applications, and the prediction accuracy is not high without enough learning samples
[0006] 2) Defect data has obvious

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Software defect predication method based on cost sensitivity and semi-supervision
  • Software defect predication method based on cost sensitivity and semi-supervision
  • Software defect predication method based on cost sensitivity and semi-supervision

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0067] The present invention will be described in further detail below.

[0068] Software defect prediction models aim to classify software modules into two categories: defective and non-defective by analyzing static code. However, it is a pity that there is no classifier that can classify all software modules correctly so far. Then, the prediction model will inevitably bring certain costs to the misclassification of software modules, such as testing time, labor costs, losses caused by program crashes, etc. We call this type of cost misclassification costs. Mistakenly classifying a non-defective module as defective wastes manpower, financial resources, and time for detailed testing, while misclassifying a defective module as non-defective may cause program crashes and bring huge losses, such as suspension of transactions on e-commerce websites, and aircraft control The system is out of control, etc. Therefore, for binary classification problems such as software defect predic...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a software defect predication method based on cost sensitivity and semi-supervision. The software defect predication method comprises the following steps: S1, collecting a source code file of software to be predicated through a version control tool; S2, extracting a measurement element value from the source code file, wherein the source code file obtained by S1 is composed of I modules; S3, obtaining a sampling result set through a measurement element value selecting module in a sampling manner; S4, constructing a training set by a labeled sampling result set and a label-free sampling result set; S5, extracting a target function and solving a classification function enabling the target function to be minimum; and S6, predicating a module to be predicated through the classification function and outputting a predicated result. The method provided by the invention fuses semi-supervision and cost sensitivity thoughts to construct a software defect predication model and two problems in software defect predication that defect data is difficult to obtain and types are not balanced are solved; and the accuracy of the predicated result is extremely improved.

Description

technical field [0001] The invention relates to software prediction, in particular to a software defect prediction method based on cost-sensitive semi-supervision. Background technique [0002] However, with the increasing scale and complexity of software, and the market's demand for shortening the software development cycle as much as possible, the difficulty of predicting and controlling software quality is gradually increasing, and the cost is also increasing. Moreover, with the development of software technology so far, whether it is a small program or a large system, defects in software have become an indispensable by-product in the software development process, and there is no inspection or verification method that can find and eliminate all defects. Defects. Worse, the later a software defect is discovered, the more expensive it is to fix. Therefore, how to effectively and fully test the software and find the defects in the software as comprehensively as possible un...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F11/36
CPCG06F11/3608
Inventor 徐玲廖胜平洪明坚葛永新杨梦宁张小洪杨丹王洪星黄晟周末
Owner 重庆优霓空科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products