Software homology detection method and device based on abstract syntax tree and semantic matching

A semantic matching and syntax tree technology, applied in computer security devices, platform integrity maintenance, instruments, etc., can solve problems such as inability to deal with plagiarism in the order of software source codes, and achieve the effect of effective software homology detection

Inactive Publication Date: 2010-11-24
北京华夏信安科技有限公司 +1
View PDF2 Cites 34 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

This technical solution considers the language characteristics to a certain extent, but its principle is to find the longest similar substring in the software source code, so it cannot deal with plagiarism such as the order of the software source

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Software homology detection method and device based on abstract syntax tree and semantic matching
  • Software homology detection method and device based on abstract syntax tree and semantic matching
  • Software homology detection method and device based on abstract syntax tree and semantic matching

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0043] The embodiment of the present invention provides a software homology detection method based on summary syntax tree and semantic matching, as shown in the attached figure 1 shown, including:

[0044] Step 11, generating an abstract syntax tree corresponding to the software source code, and adjusting the subtrees in the abstract syntax tree that match the same semantic feature rules and have the same semantics into a unified structure;

[0045] Step 12, calculating the hash (Hash) value of the subtree in the summary syntax tree;

[0046] In step 13, software homology detection is performed by judging whether the hash values ​​of subtrees with the same number of nodes are consistent.

[0047] Through the implementation of the software homology detection method based on abstract syntax tree and semantic matching provided by the embodiment of the present invention, accurate and effective software homology detection can be performed on the grammatical level combined with sem...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a software homology detection method and a device based on an abstract syntax tree and semantic matching. The method comprises the following steps: generating an abstract syntax tree corresponding to software source codes, and regulating subtrees which match with the same semantic feature rule and have the same semantics in the abstract syntax tree to a unified structure; calculating the hash values of subtrees in the abstract syntax tree; and performing software homology detection through judging whether the hash values of subtrees with the same number of nodes are consistent, thereby performing accurate and effective software homology detection by combining semantics in grammatical level.

Description

technical field [0001] The invention relates to the technical field of software security in information security, in particular to a software homology detection method and device based on abstract syntax tree and semantic matching. Background technique [0002] Software homology detection technology is an important aspect of computer programming language research. According to different detection methods, there are the following mainstream research fields in this field: text-based software homology detection and token-based software homology Sex detection. [0003] The text-based software homology detection technical solution is to detect software homology based on the text level, and the current software source code plagiarism process is generally copying the whole block, or making changes on this basis, such as replacing variable names, Disrupt the sequence of statements, change the function name or function location, etc. without affecting the function of the program. s...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F21/00G06F21/56
Inventor 崔宝江吴世忠郭涛
Owner 北京华夏信安科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products