Method for detecting repeated software defect reports

A defect reporting, software technology, applied in software testing/debugging and other directions

Active Publication Date: 2014-08-06
重庆优霓空科技有限公司
View PDF2 Cites 19 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Compared with Runeson P's method, the correct rate has increased by about 20%, but it is lower than the research results of Wang X Y et al.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for detecting repeated software defect reports
  • Method for detecting repeated software defect reports
  • Method for detecting repeated software defect reports

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0137] To evaluate the results of the above detection methods, we use recall and precision as evaluation criteria.

[0138] Recll Rate = N det ected N total - - - ( 8 ) ;

[0139] Percision Rate = N det ected N det ectedall - - - ( 9 ) ;

[0140] where N detected is the number of replicates reported correct by the experiment for the test sample, N total refers to the total number of duplicate reports actually owned by the test sample, N detectedall Refers to th...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a method for detecting repeated software defect reports. The method comprises the following steps of firstly, extracting a training sample set and a test sample set from a software defect report database, establishing a subject model of the training sample set, then applying the subject model to test samples to obtain a document-subject matrix, calculating the subject similarity between two test samples, extracting classification information of the test samples to calculate the classification information similarity, and multiplying the classification information similarity and the subject similarity to obtain LDA (local data area) similarity between the two test samples; secondly, extracting an N-gram sequence of the test samples to calculate N-gram similarity, performing weighted summation on the N-gram similarity and the LDA similarity to calculate the whole similarity between the two test samples; and finally, if the whole similarity is greater than or equal to a preset threshold value, indicating that the two test samples are the repeated defect reports. According to the method, the accuracy of a detection result is greatly improved; the repeated defect reports can be prevented from being dispatched to a developer as much as possible, and human resources are saved.

Description

technical field [0001] The invention belongs to the technical field of software maintenance, and in particular relates to a method for detecting repeated defect reports. Background technique [0002] As software projects grow in size, software becomes more and more complex. The cost of the software maintenance phase has accounted for 2 / 3 of the cost of the software life cycle. A software defect report is a document description of possible defects and errors in software by software testers or users during software maintenance. Open source software such as Eclipse, Firefox, Open Office, etc., with the increase in scale and version updates, different users submit defect reports to the defect tracking management system every day, and the system will generate a large number of repeated defect reports, such as the open source software Firefox. The proportion of repeated defects in the software defect database is as high as 30%. In 2005, Mozilla programmers reported that "everyd...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F11/36
Inventor 徐玲邹杰葛永新杨梦宁洪明坚张小洪蒋欣志
Owner 重庆优霓空科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products