Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A Method for Automatic Classification of Software Defects Based on Association Rules

A technology for automatic classification and software defects, applied in neural learning methods, text database clustering/classification, computer components, etc., can solve problems such as applications without associations, inability to represent semantic information, and failure to mine fine-grained associations. Achieve strong scalability, improve accuracy, and improve efficiency

Active Publication Date: 2022-05-13
YANGZHOU UNIV
View PDF2 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, none of the above studies have explored the relationship between fine-grained defect appearances and cause categories, nor have they applied these relationships to automatic defect classification.
In addition, there are some works dedicated to the research of automatic defect classification technology, but most of these technologies use artificial feature engineering and shallow neural network classification models, and artificial feature engineering does not have strong versatility, and often needs to be combined with the understanding of different software. Choose different features
In addition, the text context is ignored in the text representation, each word is independent of each other, and semantic information cannot be represented

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Method for Automatic Classification of Software Defects Based on Association Rules
  • A Method for Automatic Classification of Software Defects Based on Association Rules
  • A Method for Automatic Classification of Software Defects Based on Association Rules

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0036] The software defect automatic classification method based on association rules includes the following contents:

[0037] Step 1. Collect defect reports of 2 open source software projects to build a defect data set, and extract the title, description and comments from each defect report. The distribution of the number of reports collected is shown in Table 1 below. Convert the information extracted from the defect report into a txt document, and use the Natural Language Processing Toolkit (NLTK) to perform data cleaning on the defect document, such as deleting links, code snippets and XML tags, etc. The document is further divided into sentences and words, and each document is converted into a series of tokens.

[0038]Table 1 Distribution table of defect data quantity on 2 items

[0039] software Bugsets Document Sentence Token Mozilla 200K 1000 63452 807534 Eclipse 50K 400 21380 249077 Total 250K 1400 84832 1056611

[...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a method for automatically classifying software defects based on association rules, which comprises the following steps: first extracting the text content in the defect report, and preprocessing it; then randomly sampling the defects whose state is VERIFIED FIXED, and constructing defects to be classified Then according to the defect appearance and defect occurrence cause, each defect in the to-be-classified defect set is labeled with the defect appearance category and the defect occurrence cause category to obtain the defect appearance category classification set and the defect occurrence cause category classification set; and then excavate different defect appearance categories, The association rules between defect occurrence cause categories are selected, and strong association rules are selected from them and converted into a relationship matrix; finally, the deep learning method is used to train the classification set of defect appearance categories to obtain an automatic defect classifier. The present invention simultaneously performs fine-grained automatic classification of defects from two dimensions of defect appearance and cause, which solves the shortcoming of inability to accurately classify defects due to a small amount of defect information in the prior art.

Description

technical field [0001] The invention belongs to the field of software maintenance, in particular to an automatic software defect classification method based on association rules. Background technique [0002] Software defects are one of the most serious problems to be solved in the process of software quality maintenance. With the continuous expansion of software scale, a large number of new defects are introduced, and intelligent defect repair has become the focus of industry research. To fix defects in a timely and effective manner, you first need to identify the appearance of the defect and the cause of the defect. However, the defect reports submitted by users are often incomplete and inaccurate. They only include the steps to reproduce the defect, the actual results and the expected results. Developers can relatively easily However, it takes a lot of time to analyze the cause of the defect. Accurate defect classification is an important means to improve the efficiency ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/35G06K9/62G06N3/08
CPCG06N3/08G06F18/214
Inventor 李斌周澄孙小兵
Owner YANGZHOU UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products