Unlock instant, AI-driven research and patent intelligence for your innovation.

Software defect detection model establishment and detection method and system based on Github

A technology for detecting model and software defects. It is applied in software testing/debugging, error detection/correction, instruments, etc. It can solve problems such as low coverage, inability to detect complex and diverse models, and unbalanced samples, and achieve high detection accuracy. rate effect

Active Publication Date: 2020-07-28
NORTHWEST UNIV
View PDF7 Cites 11 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] In order to solve the deficiencies in the prior art, the present invention provides a Github-based software defect detection model establishment, detection method and system to solve the existing method loopholes The library is updated slowly, there are few types of defects, and the defect coverage rate is low. As a result, the defect static detection technology must face the problems of sample imbalance, the model cannot be used to detect complex and diverse defects, and the model fails quickly.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Software defect detection model establishment and detection method and system based on Github
  • Software defect detection model establishment and detection method and system based on Github
  • Software defect detection model establishment and detection method and system based on Github

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0065] "Slice" refers to a multi-line code fragment obtained by cutting the code according to certain rules. The cutting method can follow data flow or control flow and other custom methods. In the present invention, the segmentation is based on two parts of information, data flow and control flow, with the purpose of extracting multi-line codes with semantic relevance and highlighting defect information.

[0066] "Change record" refers to every time the warehouse manager on Github changes, corrects and adds codes in the warehouse, it is called the change record of the warehouse, that is, commit.

[0067] "Mixed change records" means that a commit contains many reasons for modification, and it is impossible to intuitively see which part of the code is modified for what reason.

[0068] "Verb / direct object mode" means that the change record description in the change record conforms to the verb + direct object rule, such as the change record description "fix a bug", which has a...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a software defect detection model establishment and detection method and system based on Github, and the method comprises the steps: firstly carrying out the preprocessing of adata set in a Github platform, and obtaining a change record meeting the requirements and a Bug-Fix file pair corresponding to the change record; processing the change records meeting the requirements to generate vectors and labels of the slices; and finally, inputting the vector and the label of the slice into a bidirectional LSTM model for training and learning to obtain a trained detection model. And for a to-be-detected target file, processing the to-be-detected target file to obtain a vector of the target file, and inputting the vector into the detection model to obtain a detection result. The method provided by the invention solves the problems of data imbalance, insufficient data diversity and poor model generalization ability which must be faced by a too small data set in currentdefect detection based on source code learning; and higher detection accuracy can be achieved.

Description

technical field [0001] The invention belongs to the technical field of code auditing, and in particular relates to a Github-based software defect detection model establishment, detection method and system. Background technique [0002] A variety of defect analysis tools already exist in the field of code auditing, which attempt to detect common defects in software. Static detection tools such as Clang can do this without executing the program. Dynamic inspection tools detect defects by repeatedly executing many test cases on real or virtual processors. Both static and dynamic detection tools are based on manual definition of defect rules, so they are limited to manual design rules and cannot guarantee complete testing of the code base. Symbolic execution replaces input data with symbolic values, and performs analysis and diagnosis on the control flow diagram of the program. Although it can probe all feasible program paths, symbolic execution is expensive and does not scal...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F11/36
CPCG06F11/3688G06F11/3684Y02D10/00
Inventor 柯鑫叶贵鑫汤战勇尹小燕龚晓庆房鼎益
Owner NORTHWEST UNIV