Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A software defect prediction method based on less sample data learning

A software defect prediction and sample data technology, applied in software testing/debugging, computer parts, instruments, etc., can solve the problems of model performance dependence, high algorithm complexity, defect historical data, etc., to achieve good prediction results and stable performance Effect

Inactive Publication Date: 2019-06-14
CHONGQING UNIV
View PDF2 Cites 9 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] According to literature analysis, the existing technology has the following problems: 1) The algorithm complexity is high; 2) The performance of the model depends on a large number of training sample data; 3) There is no effective software prediction model that can simultaneously handle limited, high Dimensional and unbalanced software defect history data

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A software defect prediction method based on less sample data learning
  • A software defect prediction method based on less sample data learning
  • A software defect prediction method based on less sample data learning

Examples

Experimental program
Comparison scheme
Effect test

specific Embodiment

[0052] (1) Experimental data

[0053] The NASA data warehouse contains many different software defect data sets. The experiment extracts 10 data sets from the warehouse for experimental analysis. They are AR1, AR4, AR6, CM1, KC1, KC2, MW1, PC1, PC3 and PC4. The extraction principle is that all experimental data come from public machine learning databases, making the invented method easy to verify and apply; and the selected data sets have the same measurement indicators, and the experimental data can be used directly without losing any measurement information during the experiment .

[0054] Note that the attribute dimensions of these data sets are not uniform, the minimum dimension is 21, the maximum dimension is 57, and their classes are also unbalanced, the minimum imbalance degree is 3, and the maximum imbalance degree is 12. Moreover, the instances of each dataset are limited, with a minimum of 87 and a maximum of 2032. Therefore, it is difficult for conventional machin...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a software defect prediction method based on few sample data learning, and belongs to the field of software engineering. The method comprises the following steps of S1, constructing an SDNN based on a twin network, i.e., a twin full connection network; S2, inputting positive sample data and negative sample data, performing few-sample learning through an SDNN network, and extracting high-level depth features of the samples on the data; S3, performing comparative learning and probability output on the high-level deep features extracted in the step S2 by adopting a metriclearning function, adjusting the proportion of positive and negative samples, and setting function learning parameters, so that the metric learning function more pays more attention to learning of defective data features; S4, obtaining a prediction result. Compared with the prior art, the method adopted by the invention has the advantages that a better prediction effect can be obtained on a limited, high-dimensional and unbalanced data set, and the performance is more stable under different unbalance rates; and a better prediction result can be obtained under the conditions of less data, lesstime and the like.

Description

technical field [0001] The invention belongs to the field of software engineering and relates to a software defect prediction method based on few-sample data learning. Background technique [0002] Software defect prediction is to use the existing historical data to predict whether there are defects in software. It is an important task in software maintenance and directly related to software cost and software quality. At present, machine learning algorithms are mainly used to build models, train and evaluate historical data, and these historical data are often limited, high-dimensional and class imbalanced. Traditional machine learning algorithms not only require a large amount of data to model It is difficult to learn effective deep representations from high-dimensional data, especially in the early stages of software testing. [0003] For limited software defect data, LinChen et al. (L.Chen, B.Fang, Z.Shang, Y.Tang, Negative samples reduction in cross-company software def...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F11/36G06K9/62G06K9/66G06N3/04
Inventor 赵林畅尚赵伟赵灵王敏全周晔
Owner CHONGQING UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products