Eureka AIR delivers breakthrough ideas for toughest innovation challenges, trusted by R&D personnel around the world.

Comprehensive Prediction Method of Multi-feature Software Defects Based on Unbalanced Noise Set

A software defect and comprehensive prediction technology, applied in software testing/debugging, computer components, error detection/correction, etc., can solve the problems of incomplete feature measurement and high unbalanced data sets, and achieve the removal of noise samples and prevent model The effect of performance degradation and dimensionality reduction

Active Publication Date: 2021-07-09
北京高质系统科技有限公司
View PDF7 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0009] The purpose of the present invention is to propose a comprehensive prediction method for multi-feature software defects based on an unbalanced noise set, which can solve the problems caused by incomplete feature measurement, high degree of unbalanced data sets, and wrong labels in the data set in the prior art. noise samples etc.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Comprehensive Prediction Method of Multi-feature Software Defects Based on Unbalanced Noise Set
  • Comprehensive Prediction Method of Multi-feature Software Defects Based on Unbalanced Noise Set
  • Comprehensive Prediction Method of Multi-feature Software Defects Based on Unbalanced Noise Set

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0072] The multi-featured software defect integrated prediction method based on unbalanced noise set, including the steps of:

[0073] Step S1, extract code characteristics, development process features, and network structural features from the historical version of the target software, and build the initial data set.

[0074] The traditional defect prediction method mainly uses a single code characteristic as a metric of the software's historical version. The present invention integrates software code characteristics, development process features, and network structural features as metrics for software defect prediction, thereby performing initial data set generation.

[0075] Further, in a preferred embodiment of the present application, step S1 specifically includes:

[0076] Step S101, the history version of the target software performs code scanning and code feature extraction.

[0077] This step primarily extracts the CK metric element of the target software as the code char...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a multi-feature software defect comprehensive prediction method based on an unbalanced noise set, comprising the following steps: constructing an initial data set including code features, development process features and network structure features. Initially undersample the dataset to reduce duplicates in the majority class. The k-nearest neighbor sample set is found for the metric element in the dataset by means of propensity score matching. Through the k-nearest neighbor sample threshold judgment, the noise reduction processing of the data set is realized. Synthesize the minority class in the data set and the minority class in the k-nearest neighbor sample set to eliminate the class imbalance problem in the data set. Adaptively build a variety of machine learning models and select the most suitable machine learning model to predict defects for new versions of software. The invention solves the class imbalance problem commonly existing in software defect prediction. Noise samples are removed through noise discriminant processing based on propensity score matching.

Description

Technical field [0001] The present invention relates to software defect prediction and software reliability technology, and in particular, to the comprehensive prediction method of multi-featured software defects based on unbalanced noise sets. Background technique [0002] In the 21st century, the status of software in social life has become increasing, everywhere, is not limited to engineering, scientific research, economy and other professional fields, and has entered thousands of households, becoming a pivotable tool in people's lives. With the increasing software system, the complexity is geometric, and the shortcomings of software are also increasing, and software defects have also affected all aspects of the people's lives. [0003] Software defect data is the basis for software reliability related research and application. For a long time, defective data used in software reliability is mainly from limited software test data and analysis data in use. Although this type of ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F11/36G06K9/62G06N20/00
CPCG06F11/3684G06F11/3676G06F11/368G06N20/00G06F18/214
Inventor 严亮许嘉熙艾骏
Owner 北京高质系统科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Eureka Blog
Learn More
PatSnap group products