Unlock instant, AI-driven research and patent intelligence for your innovation.

Software defect prediction method, system and equipment based on balanced subset and medium

A software defect prediction and balance technology, applied in the field of data processing, can solve problems such as low prediction accuracy and poor efficiency, and achieve the effect of eliminating data imbalance and improving accuracy

Inactive Publication Date: 2022-05-27
湖南工商大学
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0008] In view of this, the embodiments of the present invention provide a software defect prediction method, system, device, and medium based on balanced subsets, which at least partially solve the problems of low prediction accuracy and poor efficiency in the prior art

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Software defect prediction method, system and equipment based on balanced subset and medium
  • Software defect prediction method, system and equipment based on balanced subset and medium
  • Software defect prediction method, system and equipment based on balanced subset and medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0045] The embodiments of the present invention will be described in detail below with reference to the accompanying drawings.

[0046] The embodiments of the present invention are described below through specific specific examples, and those skilled in the art can easily understand other advantages and effects of the present invention from the contents disclosed in this specification. Obviously, the described embodiments are only some, but not all, embodiments of the present invention. The present invention can also be implemented or applied through other different specific embodiments, and various details in this specification can also be modified or changed based on different viewpoints and applications without departing from the spirit of the present invention. It should be noted that the following embodiments and features in the embodiments may be combined with each other under the condition of no conflict. Based on the embodiments of the present invention, all other emb...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The embodiment of the invention provides a software defect prediction method, system and device based on a balanced subset and a medium, and belongs to the technical field of data processing, and the method specifically comprises the steps that an unbalanced data set generated in the operation process of target software is obtained, and the unbalanced data set comprises an original multi-class set and an original few-class set; randomly dividing the original multi-class set XN into V subclasses with the same sample number; according to a division instruction input by a user, different division strategies are selected, and the division strategy is any one of a balance subset construction strategy based on random division or a balance subset construction strategy based on hierarchical division; and according to the division strategy, all the subclasses and the original few-class set, constructing V balanced subsets corresponding to the unbalanced data set, and carrying out ensemble learning to obtain the defect class of the target software. According to the scheme of the invention, all original samples are reserved, no new sample is introduced, and the classification performance and the prediction accuracy are improved.

Description

technical field [0001] Embodiments of the present invention relate to the technical field of data processing, and in particular, to a method, system, device, and medium for predicting software defects based on balanced subsets. Background technique [0002] Common software defect prediction methods assume that all classes in the dataset are balanced, that is, the sample sizes are roughly equal. However, in the actual running process of software, there are often more defects than those without defects. The imbalance problem is widely regarded as one of the main reasons for the poor performance of software defect prediction models. Therefore, it is necessary to construct a software defect prediction model based on imbalanced data. [0003] In recent years, various data rebalancing methods for software defect prediction have been proposed. Although some of them achieve better performance, they have the following shortcomings: [0004] For upsampling methods, they need to sy...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06K9/62
CPCG06F18/23213G06F18/10G06F18/24G06F18/214
Inventor 张新玉余绍黔李晓翠史庆宇
Owner 湖南工商大学