A software defect prediction method based on less sample data learning

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A software defect prediction and sample data technology, applied in software testing/debugging, computer parts, instruments, etc., can solve the problems of model performance dependence, high algorithm complexity, defect historical data, etc., to achieve good prediction results and stable performance Effect

Inactive Publication Date: 2019-06-14

CHONGQING UNIV

View PDF2 Cites 9 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0006] According to literature analysis, the existing technology has the following problems: 1) The algorithm complexity is high; 2) The performance of the model depends on a large number of training sample data; 3) There is no effective software prediction model that can simultaneously handle limited, high Dimensional and unbalanced software defect history data

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

specific Embodiment

[0052] (1) Experimental data

[0053] The NASA data warehouse contains many different software defect data sets. The experiment extracts 10 data sets from the warehouse for experimental analysis. They are AR1, AR4, AR6, CM1, KC1, KC2, MW1, PC1, PC3 and PC4. The extraction principle is that all experimental data come from public machine learning databases, making the invented method easy to verify and apply; and the selected data sets have the same measurement indicators, and the experimental data can be used directly without losing any measurement information during the experiment .

[0054] Note that the attribute dimensions of these data sets are not uniform, the minimum dimension is 21, the maximum dimension is 57, and their classes are also unbalanced, the minimum imbalance degree is 3, and the maximum imbalance degree is 12. Moreover, the instances of each dataset are limited, with a minimum of 87 and a maximum of 2032. Therefore, it is difficult for conventional machin...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention relates to a software defect prediction method based on few sample data learning, and belongs to the field of software engineering. The method comprises the following steps of S1, constructing an SDNN based on a twin network, i.e., a twin full connection network; S2, inputting positive sample data and negative sample data, performing few-sample learning through an SDNN network, and extracting high-level depth features of the samples on the data; S3, performing comparative learning and probability output on the high-level deep features extracted in the step S2 by adopting a metriclearning function, adjusting the proportion of positive and negative samples, and setting function learning parameters, so that the metric learning function more pays more attention to learning of defective data features; S4, obtaining a prediction result. Compared with the prior art, the method adopted by the invention has the advantages that a better prediction effect can be obtained on a limited, high-dimensional and unbalanced data set, and the performance is more stable under different unbalance rates; and a better prediction result can be obtained under the conditions of less data, lesstime and the like.

Description

technical field [0001] The invention belongs to the field of software engineering and relates to a software defect prediction method based on few-sample data learning. Background technique [0002] Software defect prediction is to use the existing historical data to predict whether there are defects in software. It is an important task in software maintenance and directly related to software cost and software quality. At present, machine learning algorithms are mainly used to build models, train and evaluate historical data, and these historical data are often limited, high-dimensional and class imbalanced. Traditional machine learning algorithms not only require a large amount of data to model It is difficult to learn effective deep representations from high-dimensional data, especially in the early stages of software testing. [0003] For limited software defect data, LinChen et al. (L.Chen, B.Fang, Z.Shang, Y.Tang, Negative samples reduction in cross-company software def...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G06F11/36G06K9/62G06K9/66G06N3/04

Inventor赵林畅尚赵伟赵灵王敏全周晔

OwnerCHONGQING UNIV

A software defect prediction method based on less sample data learning

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

specific Embodiment

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology