CNN-based bug positioning method combining source code semantics and grammatical features

A positioning method and source code technology, applied in the direction of code compilation, program code conversion, neural learning method, etc., can solve the problem of high cost of manual bug positioning, and achieve the effect of promoting the overall maintenance level, improving the maintenance efficiency, and reducing the time cost.

Inactive Publication Date: 2020-02-21
NANJING UNIV
View PDF5 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] The problem to be solved by the present invention is: in the software system maintenance process, the maintenance team will locate the relevant source code files according to the bug report submitted by the user, and assign the repair task to the corresponding developer. However, the cost of manual bug location is very high. Big

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • CNN-based bug positioning method combining source code semantics and grammatical features
  • CNN-based bug positioning method combining source code semantics and grammatical features
  • CNN-based bug positioning method combining source code semantics and grammatical features

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0019] Several key technologies involved in the present invention are related knowledge and technologies of word2vec word vector model, convolutional neural network, natural language processing technology, gradient descent algorithm and abstract syntax tree extraction.

[0020] 1. word2vec word embedding model

[0021] Word vectors are vectors used to represent words, and can also be considered as feature vectors of words. The technique of mapping words into real number field vectors is also called word embedding. word2vec represents each word as a fixed-length vector, and enables these vectors to better express the similarity and analogy between different words. word2vec contains two models: Skip-gram and CBOW (continuousbagofwords). In the present invention, mainly use the Skip-gram model in word2vec to carry out word embedding to bug report and source code.

[0022] 2. Natural language processing tools and technologies

[0023] Natural Language Processing (NLP) is a subd...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a convolutional neural network-based bug positioning method combining source code semantics and grammatical features. The method is characterized by providing the method for positioning a source code file for generating a bug according to a bug report submitted by a user; according to the method, a CNN is used for carrying out feature extraction on a bug report, source codesemantics and source code syntax, then the features are fused, unified features are extracted, finally, the CNN is used for carrying out correlation prediction on the bug report and source codes, andTopK source code files related to the bug report are obtained. Therefore, when a user submits a new bug report, a maintainer can perform positioning of the related source code file in time and notifya developer of repairing, and the bug repairing efficiency and the project maintenance efficiency are improved. The whole process of the method is shown in the attached drawing of the abstract.

Description

technical field [0001] The invention belongs to the field of software maintenance, especially the field of testing for locating and repairing software defects, and is used for locating relevant source code files in time when a user submits a bug report, so that developers or maintenance personnel can repair the bug. Background technique [0002] Software quality assurance is crucial to the success of a software system. However, often due to tight development plans and limited testing resources, it is basically impossible to find every existing bug in an increasingly large and complex software system before the software system is officially released. Therefore, software systems are always released with potential bugs, which means that the later maintenance of software systems is also very important, and bug location is an important part of the software system maintenance process. [0003] During the use of the software system, after the user finds a bug, a corresponding bug ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F8/41G06F11/36G06N3/04G06N3/08
CPCG06F11/3628G06F8/42G06N3/08G06N3/045G06N3/044
Inventor 房春荣蒋燕史洋洋陈振宇李玉莹
Owner NANJING UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products