Detection method of code similarity based on digital fingerprints

A digital fingerprint and detection method technology, applied in the direction of electronic digital data processing, instrument, platform integrity maintenance, etc., can solve the problem of poor feature selection effect, reduce the probability of misjudgment, high reliability, improve calculation speed effect

Inactive Publication Date: 2011-02-16
BEIJING INSTITUTE OF TECHNOLOGYGY
View PDF2 Cites 35 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The purpose of the present invention is to solve the problem that in the code similarity detection process, the source code features of the two parties are not obvious, which leads to the poor feature selection effect, and propose a code similarity detection method based on digital fingerprints, which can identify relatively Multi-code plagiarism concealment means, effective selection of code features and fingerprints, and improved similarity detection accuracy

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Detection method of code similarity based on digital fingerprints
  • Detection method of code similarity based on digital fingerprints
  • Detection method of code similarity based on digital fingerprints

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0043] A code similarity detection method based on digital fingerprints, such as figure 1 shown, for figure 2 The two sections shown use the source code written in C language for similarity detection, and the steps are as follows:

[0044] 1. The purpose of preprocessing the source codes of both parties is to reduce the influence of irrelevant information on the surface of the codes on the similarity determination. Specifically, the following operations are performed on the source codes of both parties: delete comments, delete macro commands, and delete content that has nothing to do with code semantics. The results obtained are as follows image 3 shown;

[0045] 2. Segment the codes of both parties processed in step 1. Specifically, it includes adding spacers and spaces between different types of words contained in the code, and retaining the original spaces. In the process of writing and modifying the code, there may be no separator between different types of words. In...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a detection method of code similarity based on digital fingerprints, belonging to the computer application technical field. The detection method comprises the following steps: preprocessing codes, segmenting, formatting, performing numeralization by a digital fingerprint technology, calculating digital fingerprint sequences and calculating the similarity. The detection method has the advantages of simple computational process and high credibility of detection results of the code similarity; and the method can lower misjudgment probability by selecting more effective code characteristics and effectively improve the overall computation speed by identifying various code copying and concealing means through the digital fingerprint technology.

Description

technical field [0001] The invention relates to a code similarity detection method based on digital fingerprints, belonging to the technical field of computer applications. Background technique [0002] With the advancement of science and technology, the writing of software codes has gradually become popular, and the reduction in the cost of copying software codes has led to a large number of software infringement cases. At this time, not only some application software codes need to be protected, but almost all high-tech products. Codes are also easy to crack and imitate, so it is necessary to detect the similarity of the codes of both parties in the process of handling infringement cases as an important basis for the judgment of infringement cases. In addition, with the continuous improvement of the level of automation education, many courses in colleges and universities also involve the writing of codes. In order to effectively test students' attitudes and gains to these c...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F21/00G06F21/56
Inventor 史树敏黄河燕黄柳柳
Owner BEIJING INSTITUTE OF TECHNOLOGYGY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products