Deep learning-based automatic generation method for single sentence abstract defect report title

A defect reporting and deep learning technology, applied in the computer field, which can solve problems such as data set usage, inability to model low-frequency vocabulary, and inability to exhaust the spelling forms of artificially named vocabulary.

Pending Publication Date: 2020-11-06
WUHAN UNIV
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] There is no ready-made high-quality data set, and the quality of data obtained from open source channels varies and cannot be directly used as a data set
The neural network model based on big data learning always needs high-quality data samples that meet the task requirements for training, otherwise problems such as low training efficiency or wrong learning direction may occur during model training
The unfiltered data set obtained directly from the open source community may contain a large number of invalid defect report samples with poor title quality and non-single-sentence summary, which cannot be directly used as a training data set
[0008] Cannot effectively deal with artificially named words with low word frequency. Artificially named words such as identifiers and version numbers are often specific to defect reports and their software warehouses. Different projects usually contain different artificially named words. Therefore, defect reports There are inexhaustible spelling forms of artificially named words in , and the word frequency of each artificially named word is relatively low
However, since the neural summarization model needs to learn, understand and generate target word examples by repeatedly adjusting word embedding vectors and other related parameters recorded in the fixed word example table of the model, the low-frequency words in the corpus are usually not well understood. processed by the model

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Deep learning-based automatic generation method for single sentence abstract defect report title
  • Deep learning-based automatic generation method for single sentence abstract defect report title
  • Deep learning-based automatic generation method for single sentence abstract defect report title

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0078] In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the following will briefly introduce the drawings that need to be used in the description of the embodiments or the prior art. Obviously, the accompanying drawings in the following description are For some embodiments of the present invention, those skilled in the art can also obtain other drawings based on these drawings without creative work.

[0079] The purpose of the present invention is to provide an automatic method for automatically generating defect report titles, which can automatically generate high-quality titles for the content of defect reports, so as to alleviate the problem caused by the limited writing time or writing level, only by stating the writing requirements and Explain the problem that the quality of defect report titles that cannot be effectively alleviated is difficult to guarantee. The present invention builds a defect rep...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a deep learning-based automatic generation method for a single sentence abstract defect report title. The method comprises the following steps: acquiring an open source defect report sample as an original data set, constructing three discrimination models to evaluate the sample in the original data set, and selecting a qualified sample to construct a formal data set; introducing a plurality of artificially named vocabulary types and constructing a corresponding regular expression for extracting and positioning artificially named vocabularies in the main body content of the formal data set sample; inserting type marks in front of and behind each artificial named vocabulary, and performing word segmentation and minimization processing on titles of the samples and the marked main content to construct a training data set; building a coding and decoding recurrent neural network model with a copying mechanism, and training on the training data set to obtain an optimized model; and inputting the defect report main body content of the title to be drafted into the optimized model, so that the model can automatically generate the corresponding title. According to the invention, the title writing quality and efficiency of the user are improved.

Description

technical field [0001] The invention relates to the field of computer technology, in particular to a method for automatically generating a single-sentence summary defect report title based on deep learning. Background technique [0002] Software defect reports play a vital role in software development and are included in a large number of software repositories. Defect reports with high-quality content help to understand, reproduce, locate and fix software defects. As one of the required contents of a software defect report, a high-quality defect report title can convey the core idea of ​​the specific details of the defect report, help project maintainers quickly understand the defect report, and assign appropriate labels to it more effectively. prioritization, or develop an appropriate defect triage strategy. However, due to inconsistent levels, limited writing time, etc., it is difficult to guarantee the quality of manually written defect report titles in the course of pr...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/117G06F40/258G06F40/289G06N3/04G06N3/08G06F16/34
CPCG06F40/289G06F40/117G06F40/258G06N3/084G06F16/345G06N3/045
Inventor 谢晓园陈崧强姬渊翔晋硕尹邦国
Owner WUHAN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products