Unlock instant, AI-driven research and patent intelligence for your innovation.

Warning classification method based on cost-sensitive neural network with oversampling operation

A cost-sensitive, neural network technology, applied in the field of software static analysis, can solve problems such as data imbalance, software crash, and cost inequality, and achieve the effect of stable recall rate and low classification cost

Active Publication Date: 2018-10-02
重庆优霓空科技有限公司
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] However, there are two types of problems in the warning classification process: the first type is the problem of unequal costs caused by misclassification, classifying a false positive warning as a valid warning (misclassification I), and classifying a valid warning as a false positive warning. Report a warning (misclassification II), and the resulting cost is not equal. It is different from the software defect prediction category. Misclassification II may cause software crashes, while misclassification I only requires developers to spend 5 minutes for review, that is, misclassification The cost of II will be much higher than that of misclassification I; the second type of problem is a class imbalance problem. As mentioned above, false positive warnings may account for the vast majority of overall warnings, and the experimental data of the present invention is also extremely unbalanced
The purpose of previous warning classification techniques is often to reduce the error rate of classification, while ignoring the different costs caused by the above-mentioned different classification errors and the imbalance of data

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Warning classification method based on cost-sensitive neural network with oversampling operation
  • Warning classification method based on cost-sensitive neural network with oversampling operation
  • Warning classification method based on cost-sensitive neural network with oversampling operation

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0032] The present invention will be described in further detail below.

[0033] The warning classification method of the cost-sensitive neural network based on the oversampling operation comprises the following steps:

[0034] S1: Use the FindBugs tool to analyze the jar files of a series of versions of the target software to obtain static warnings of the target software; the target software in the present invention refers to software that is ready to classify its warnings;

[0035]FindBugs is an open source project created by Bill Pugh and David Hovemeyer to find programming errors in Java code by manipulating Java bytecode. It identifies more than 400 different types of source code warnings in Java programs by matching defect patterns. These source code warnings can be divided into four levels, which are scary (scariest), scary (scary), troubling (troubling) and concern (of concern), which are used to indicate the existence of a warning The degree of harm to the software ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to an oversampling operation-based warning classification method for a cost-sensitive neural network. The method comprises the steps of S1, analyzing jar files with a series of versions in target software by using a FindBugs tool to obtain a static warning of the target software; S2, annotating the static warning obtained in the step S1; and S3, changing distribution of samples in a sample set in an oversampling manner by adopting the BP neural network to obtain a new sample set, training a classifier by using samples in the new sample set, performing prediction classification on all the samples in the sample set by using the classifier, and predicting all the samples in the sample set to be effective or false warnings . According to the method, an effective warning recall ratio is averagely increased by 44.07%, a relatively high and stable recall ratio can be quickly reached, and lower classification cost can be achieved in comparison with that in a conventional neural network method.

Description

technical field [0001] The invention relates to software static analysis, in particular to the classification of software static warnings, in particular to a warning classification method based on a cost-sensitive neural network based on oversampling operations. Background technique [0002] During the software development process, a lot of resources are consumed in finding and resolving bugs in software. Except for some large companies that use automated testing methods during the development process, in most small and medium-sized software companies or project teams, software testing is still in the manual processing stage. Its testing tasks are often heavy and inefficient. Smaller functional changes require regression testing of other functions. In the case of limited resources and time constraints, software testing work is usually not completed smoothly, resulting in many undiscovered errors remaining in the software system. The lack of testing methods has become a bot...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F11/36G06N3/08
CPCG06F11/3684G06N3/084
Inventor 徐玲潘志辉洪明坚葛永新杨梦宁张小洪杨丹王洪星黄晟
Owner 重庆优霓空科技有限公司