A method for enhancing automatic bug report distribution by adopting a weighted optimization training set

A training set and report technology, applied in special data processing applications, unstructured text data retrieval, text database clustering/classification, etc., can solve the problem that the classification effect is not particularly good, and reduce noise words and redundancy. Instances, saving time and labor costs, and improving work efficiency

Inactive Publication Date: 2019-01-22
DALIAN MARITIME UNIVERSITY
View PDF2 Cites 7 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

If the natural language description of the bug report contains a lot of noise, no matter how optimized the classification algorithm is, the classification effect will not be particularly good

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A method for enhancing automatic bug report distribution by adopting a weighted optimization training set
  • A method for enhancing automatic bug report distribution by adopting a weighted optimization training set
  • A method for enhancing automatic bug report distribution by adopting a weighted optimization training set

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0015] In order to make the technical solutions and advantages of the present invention more clear, the technical solutions in the embodiments of the present invention are clearly and completely described below in conjunction with the drawings in the embodiments of the present invention:

[0016] Such as figure 1 Shown is a method of using weighted optimization training set to enhance automatic bug report distribution, which specifically adopts the following steps:

[0017] S1: Obtain the original training set data from the bug warehouse, and preprocess the original training set: first filter out the bug reports handled by inefficient developers from the original training set, that is, the inefficient developers are those who deal with a small number of bugs, If there are too few training examples belonging to a certain category in the training set, the classifier cannot learn the characteristics of this category. For each bug report in the screened data set, the short descri...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method for enhancing automatic bug report distribution by adopting a weighted optimization training set. By weighting the bug report dataset, the information frequency in theshort description is improved. Combined with feature selection algorithm and instance selection algorithm, a smaller and higher quality training set is obtained by reducing noise words and redundantinstances, which improves the accuracy of bug classification, saves the time cost and labor cost of bug assignment, and improves the work efficiency.

Description

technical field [0001] The invention relates to the technical field of data processing and classification, in particular to a method for enhancing automatic bug report distribution by adopting a weighted optimization training set. Background technique [0002] Currently, some researchers try to solve the bug report classification problem. In the document [1], G.C. Murphy et al. first proposed to convert the bug assignment problem into a text classification problem to solve it, that is, to apply text classification technology to the bug warehouse. Anvik semi-automated the problem of bug assignment in the document [2] and others, first using text classification technology to train and predict multiple developers, and then use these developers as candidates for experts to choose. Jeong et al. proposed the concept of tossing graph in document [3], and improved classification accuracy by filtering the classification results through tossing graph. Xuan et al. used a semi-supervi...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/35G06F16/332
Inventor 魏苗苗陈荣李辉郭世凯唐文君
Owner DALIAN MARITIME UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products