Unlock instant, AI-driven research and patent intelligence for your innovation.

Code similarity detection method and device based on code warehouse process management

A detection method and process management technology, which are applied in the field of code similarity detection based on code warehouse process management, can solve the problems of inability to detect whether there is plagiarism in code changes, inability to calculate the similarity of warehouse change processes, and ignoring transition trajectories.

Active Publication Date: 2019-11-12
NANJING UNIV OF POSTS & TELECOMM
View PDF5 Cites 7 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] In the past, most of the research on code plagiarism detection focused on measuring the similarity between two separate codes, without considering the change trajectory of the two pieces of code, so that it was impossible to detect whether there was plagiarism in the process of code changes
In code similarity detection, methods such as attribute counting or structure measurement are to extract the key information between codes with historical version information, and then compare the similarity between the two warehouses to determine whether plagiarism, but can only get The similarity of two warehouses at a certain point in time, but the similarity of the change process of two warehouses cannot be calculated

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Code similarity detection method and device based on code warehouse process management
  • Code similarity detection method and device based on code warehouse process management
  • Code similarity detection method and device based on code warehouse process management

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0081] Such as Figure 1-5 As shown, the code similarity detection method based on the code warehouse process management, the first explanation is that,

[0082] Git repository: The code repository generated using the Git tool is a comparison entity, including submission information.

[0083] Submit (Gitcommit): It is an information block, including: project file content, submitter's email address, submission time, and modified file information. Each commit represents a repository version.

[0084] Extraction: refers to the action of extracting and converting the content in the submitted information block into usable format data.

[0085] In this embodiment, the Git warehouse is selected as the code version control tool, and the homework code check is performed on the Git code warehouse that the students finally completed, including the following steps:

[0086] 1) Acquiring configuration information, the configuration information includes filtering rules, weight distributi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a code similarity detection method based on code warehouse process management, and the method comprises the steps: obtaining configuration information which comprises a filtering rule, weight distribution and an execution strategy; extracting warehouse information to be detected; filtering an abnormal warehouse according to the filtering rule; comparing warehouse submissioninformation in a normal warehouse, and carrying out similarity calculation according to weights obtained through weight distribution and the execution strategy; counting the similarity of the warehouses. The device comprises a configuration information acquisition module, a to-be-detected warehouse information extraction module, an abnormal warehouse filtering module, a warehouse comparison module and a similarity information statistical output module. The code similarity detection method is based on code warehouse process management, and duplicate checking of programming jobs in computer practice courses in a version management mode is achieved.

Description

technical field [0001] The invention belongs to the technical field of code similarity detection, and in particular relates to a code similarity detection method and device based on code warehouse process management. Background technique [0002] With the rapid development of Internet technology, the method of obtaining electronic data from the Internet has become the choice of the vast majority of Internet users, and this method is also the main reason why electronic data can be disseminated and reused. However, the characteristics of easy duplication and low modification cost of electronic data make the abuse of electronic data more and more widespread. In the field of computer programming, this kind of abuse also occurs frequently. For example, if you search for software with disk partition management functions in a search engine, you can find at least 10 products with such functions. In fact, among these products, the core functions The code implementations are the same...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/22G06F8/71
CPCG06F8/71
Inventor 张卫丰蒋臣威蒋进文王庭
Owner NANJING UNIV OF POSTS & TELECOMM