Test program plagiarism detection method based on support vector machine

A support vector machine and testing program technology, applied in software testing/debugging, error detection/correction, computer components, etc., can solve the problem of affecting the accuracy of code cloning analysis, without relatively mature applications or tools, and cannot truly reflect Test code plagiarism and other issues to achieve the effect of filling the gap in detection technology, fair competition, and eliminating the need for manual detection

Pending Publication Date: 2020-07-28
NANJING UNIV
View PDF8 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] For unit test test case code similarity detection, there are currently no relatively mature applications or tools in academia and industry
Moreover, because contestants often only copy and paste the code of a few test cases, different unit test cases are completely independent, unlike source codes that have dependencies; and contestants will also perform a certain amount of test case code Modifications (for example, modifying literals, inserting and deleting one or several statements), so that the code similarity detection of test cases directly performed by existing source code similarity detection tools often cannot truly reflect the plagiarism of test codes, affecting Analysis accuracy for tools like code cloning, plagiarism checker, etc.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Test program plagiarism detection method based on support vector machine
  • Test program plagiarism detection method based on support vector machine
  • Test program plagiarism detection method based on support vector machine

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0076] In order to better understand the technical content of the present invention, specific examples are given and described as follows in conjunction with the accompanying drawings.

[0077] figure 1 It is an overall frame diagram of a test program plagiarism detection method based on a support vector machine implemented in the present invention.

[0078] A test program plagiarism detection method based on a support vector machine is characterized in that it includes the following steps.

[0079] Step 1 Program cutting and analysis: Given the source program of the project in the developer test competition, that is, the program to be tested, and the test program submitted by the contestants participating in the developer test competition, perform static analysis on the files in each program to be tested, and obtain the The test class and the method to be tested. Analyzing each test method, you can know the class name, method name, and type sequence of each parameter of the...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a test program plagiarism detection method based on a support vector machine. The method comprises: performing cutting and static analysis on a to-be-tested program and a testprogram to obtain a to-be-tested method mapping set and a test method mapping set; secondly, traversing the players pairwise, calculating the similarity of the test fragments, and summarizing to obtain a similarity set; secondly, selecting an appropriate kernel function and a reference point to establish a support vector machine model, and optimizing the support vector machine model; and finally,for other test programs, calculating a similarity set and inputting the similarity set into the support vector machine to judge plagiarism conditions among the test programs. The invention aims to fill the blank of a test program code similarity detection technology. The code plagiarism detection accuracy and precision of the test program are improved, so that developers are helped to test code plagiarism behaviors of competitors for competition automation detection, the manual detection link is omitted, the labor cost and the time cost are saved, and it is guaranteed that the competition ismore fair and justice.

Description

technical field [0001] The invention belongs to the field of plagiarism detection of software test codes, and especially solves the phenomenon of plagiarism in test codes submitted by players in current developer test competitions more accurately. After analyzing the program to be tested and the test program, model training, testing, verification and optimization are carried out on the existing test code data set by introducing the support vector machine method. Then, classify the unlabeled test code data. By analyzing the category of the test codes, it is possible to determine whether there is plagiarism between the test codes, thereby eliminating the need for manual inspection, saving labor costs and time costs, and ensuring a fairer competition. Background technique [0002] In the process of writing code, it has become a common behavior for software developers to copy and paste code from different sources and modify it to achieve their own goals, which improves the effi...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F11/36G06K9/62
CPCG06F11/3604G06F18/2411G06F18/214
Inventor 陈振宇孙伟松孙泽嵩王兴亚段定
Owner NANJING UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products