Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

SQL code plagiarism detection method and system based on coding characteristics

A technology of coding features and detection methods, applied in relational databases, structured data retrieval, special data processing applications, etc., can solve problems such as easy misjudgment, too simple coding features, and inability to make full use of SQL codes to enrich coding features, etc. To achieve the effect of improving accuracy and efficiency, and improving accuracy

Active Publication Date: 2021-05-07
GUANGXI UNIV
View PDF5 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Due to the consideration of the writer's unique coding habits in the coding process, compared with the detection method based on string matching, this type of SQL plagiarism detection method based on coding feature matching can reduce the misjudgment rate of plagiarism detection, but a On the one hand, this type of method only judges plagiarism by simply comparing the changes in the coding characteristics of the writers themselves. It lacks the observation of the coding correlation between the SQL codes of different writers, and it is still prone to misjudgment. On the other hand, the coding considered by this type of method The features are still too simple to make full use of the rich coding features of SQL codes, and cannot fully characterize the relationship between SQL codes and coding features, which will still restrict the accuracy of plagiarism detection
[0006] To sum up, the automatic plagiarism detection of SQL codes in the prior art, either simply performs similarity matching on SQL codes to find plagiarism problems, or uses the simple difference characteristics of the writers in SQL coding habits to find plagiarism behaviors, and Either way, it cannot make full use of the rich coding features shown by the writer for the SQL code, and still cannot achieve high-precision plagiarism detection

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • SQL code plagiarism detection method and system based on coding characteristics
  • SQL code plagiarism detection method and system based on coding characteristics
  • SQL code plagiarism detection method and system based on coding characteristics

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0040] The present invention will be further described below in conjunction with the accompanying drawings and specific preferred embodiments, but the protection scope of the present invention is not limited thereby.

[0041] Such as figure 1 As shown, the steps of the SQL code plagiarism detection method based on coding features in this embodiment include:

[0042] S1. Coding feature extraction: Input all the SQL codes to be detected under the specified coding task, extract the specific coding features used to characterize the coding characteristics of the writer under the current SQL coding task in each SQL code to be detected, and extract them for characterization The generalized coding characteristics of the generalized coding characteristics exhibited by the writer under the current SQL coding task;

[0043] S2. Plagiarism group detection: cluster all the specific coding features extracted, and determine the plagiarism group according to the clustering results, and the s...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an SQL code plagiarism detection method and system based on coding characteristics, and the method comprises the steps: S1, inputting all to-be-detected SQL codes under a specified coding task, extracting specific coding characteristics in each to-be-detected SQL code, and extracting generalized coding characteristics; S2, clustering all the extracted specific coding features, and determining a plagiarism group according to a clustering result; S3, respectively obtaining each to-be-detected SQL code in a plagiarism group for judgment, comparing the generalization coding feature of the to-be-detected SQL code with the historical generalization coding feature during judgment, and judging whether the to-be-detected SQL code belongs to plagiarism or is plagiarized according to a judgment result; and S4, outputting the plagiarism judgment result of each to-be-detected SQL code in each plagiarism group. The method has the advantages that the implementation method is simple, plagiarism automatic detection of the SQL code can be realized, and the detection efficiency and precision are high.

Description

technical field [0001] The invention relates to the technical field of plagiarism automatic detection, in particular to a method and system for plagiarism detection of SQL codes based on coding features. Background technique [0002] Structured Query Language (SQL) is the key to database technology. During the application of SQL, it may be necessary to evaluate the SQL codes of many writers under the same task, and there may be plagiarism among them. For example, in teaching practice, students are required to complete SQL programming exercises in order to evaluate the quality of students' learning, and there may be plagiarism in the SQL code submitted by students, which will affect the accuracy of the evaluation. At present, it is usually manually judged whether many SQL codes are plagiarized based on experience, which not only requires a lot of energy, but also the accuracy of manual judgment is not high. If intelligent plagiarism detection can be realized, and automatic pl...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F8/75G06F16/28G06F16/36G06F40/279G06F40/30G06F40/253G06K9/62
CPCG06F8/751G06F16/285G06F16/36G06F40/279G06F40/30G06F40/253G06F18/22Y02D10/00
Inventor 许嘉莫晓琨吕品
Owner GUANGXI UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products