Multi-feature software defect comprehensive prediction method based on unbalanced noise set

A software defect and comprehensive prediction technology, applied in software testing/debugging, computer components, error detection/correction, etc., can solve problems such as incomplete feature measurement and high unbalanced data sets, so as to prevent model performance degradation and solve The effect of class imbalance problem, reducing algorithm complexity and resource overhead

Active Publication Date: 2020-10-16
北京高质系统科技有限公司
View PDF7 Cites 12 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0009] The purpose of the present invention is to propose a comprehensive prediction method for multi-feature software defects based on an unbalanced noise set, which can solve the problems caused by incomplete feature measurement, high degree of unbalanced data sets, and wrong labels in the data set in the prior art. noise samples etc.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Multi-feature software defect comprehensive prediction method based on unbalanced noise set
  • Multi-feature software defect comprehensive prediction method based on unbalanced noise set
  • Multi-feature software defect comprehensive prediction method based on unbalanced noise set

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0072] The multi-feature software defect comprehensive prediction method based on unbalanced noise set of the present invention comprises the following steps:

[0073] Step S1, extracting code features, development process features and network structure features from the historical version of the target software to construct an initial data set.

[0074] Traditional defect prediction methods mainly use a single code feature as the measurement unit of the historical version of the software. The present invention integrates software code features, development process features and network structure features as measurement elements for software defect prediction, thereby generating initial data sets.

[0075] Further, in a preferred embodiment provided by this application, step S1 specifically includes:

[0076] Step S101, performing code scanning and code feature extraction on the historical version of the target software.

[0077] This step mainly extracts the CK metrics of th...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a multi-feature software defect comprehensive prediction method based on an unbalanced noise set. The multi-feature software defect comprehensive prediction method comprises the following steps: constructing an initial data set containing code features, development process features and network structure features; performing preliminary undersampling processing on the data set, and reducing repeated data in most classes; searching a k nearest neighbor sample set for metric elements in the data set through a tendency score matching method; realizing noise reduction processing of the data set through k nearest neighbor sample threshold judgment; performing sample synthesis on the minority class in the data set and the minority class in the k nearest neighbor sample setto eliminate the class imbalance problem of the data set; and adaptively constructing a plurality of machine learning models and selecting the most suitable machine learning model to perform defect prediction on the new version of software. According to the method, the problem of class imbalance generally existing in software defect prediction is solved. And noise samples are removed based on noise discrimination processing of tendency score matching.

Description

technical field [0001] The invention relates to the technical fields of software defect prediction and software reliability, in particular to a comprehensive prediction method for multi-feature software defects based on an unbalanced noise set. Background technique [0002] Since the 21st century, the status of software in social life has been increasing, and it is ubiquitous. It is not only limited to professional fields such as engineering, scientific research, and economics, but has also entered thousands of households and has become an important tool in people's lives. With the increasing scale and complexity of software systems, software defects are also increasing day by day, and software defects have also affected all aspects of people's lives. [0003] Software defect data is the basis of research and application related to software reliability. For a long time, the defect data used in the field of software reliability mainly comes from limited software testing data...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F11/36G06K9/62G06N20/00
CPCG06F11/3684G06F11/3676G06F11/368G06N20/00G06F18/214
Inventor 严亮许嘉熙艾骏
Owner 北京高质系统科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products