Software defect prediction method and system based on TAN semi-naive Bayesian network

A software defect prediction and Bayesian network technology, which is applied in software testing/debugging, computer parts, character and pattern recognition, etc., can solve the problem of low prediction accuracy, poor computing performance, and fully connected Bayesian network classification prediction model Problems such as large space overhead, to achieve the effect of reducing space-time overhead, repairing labor costs, and improving prediction performance

Active Publication Date: 2022-01-07
EAST CHINA INST OF COMPUTING TECH
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, this patent cannot solve the problems of large space overhead, poor computing performance, and low prediction accuracy of the fully connected Bayesian network classification prediction model.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Software defect prediction method and system based on TAN semi-naive Bayesian network
  • Software defect prediction method and system based on TAN semi-naive Bayesian network
  • Software defect prediction method and system based on TAN semi-naive Bayesian network

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0080] like figure 1 According to a kind of software defect prediction method based on TAN semi-naive Bayesian network provided by the present invention, the specific implementation method includes the following steps:

[0081] Step S1: Collect software defect records composed of software function description, defect description, defect type and other information in historical projects, format and store data after data sorting, cleaning and optimization, and form a training data set that can be used for software defect prediction.

[0082] Step S2: Use the software function descriptions of historical items in the training data set as the training text set D, summarize all defect types in the training data set as the prediction classification set Y, and use the defect types associated with the software function descriptions as the training text set D Classification label c for each training text.

[0083] Specifically include the following steps:

[0084] Step S21: format the...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a software defect prediction method and system based on a TAN semi-naive Bayesian network. The method comprises the following steps: collecting historical defect records to form a training data set; summarizing historical items in the training data set; performing word segmentation on a to-be-predicted text to obtain a segmented word set; carrying out merging processing on repeated segmented words in the segmented word set to form a feature word set; calculating a condition mutual information value between any two feature words in the feature word set; constructing an undirected graph by taking each feature word in the feature word set as a node and the mutual information value as an edge; selecting any node as a root node, taking the outward direction of the root node as the circulation direction between the nodes, sequentially setting the circulation direction between the nodes in the undirected graph set in a recursive mode, and forming a directed acyclic graph; constructing a TAN semi-naive Bayesian network, calculating and comparing the posterior probability value of each classification in the prediction classification set based on the word segmentation set, and taking the classification with the maximum posterior probability value as a final software defect prediction result.

Description

technical field [0001] The present invention relates to the technical field of software defect prediction, in particular to a software defect prediction method and system based on TAN semi-naive Bayesian network. Background technique [0002] The application fields of software are very extensive. In many modern large-scale equipment systems, more and more key functions are realized by software. In a software system with a scale of more than one million lines of code, more than 80% of the functions are realized by software. At the same time, the higher the proportion of software, the more serious the problems caused by software defects. In order to effectively avoid the problems caused by software defects, it is necessary to analyze and summarize software defects with representative characteristics from the software defect data of historical projects, and use intelligent methods such as machine learning to train the sorted out software defect data. Research software require...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F11/36G06K9/62
CPCG06F11/3604G06F18/24155G06F18/214
Inventor 龙刚吴振宇孙佳美
Owner EAST CHINA INST OF COMPUTING TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products