Source code fragment pairwise comparison method based on coding sequence representation
Patent Information
- Authority / Receiving Office
- CN ยท China
- Patent Type
- Applications(China)
- Current Assignee / Owner
- NANJING UNIV OF AERONAUTICS & ASTRONAUTICS
- Publication Date
- 2021-02-26
Smart Images

Figure 1 
Figure 2 
Figure 3
Abstract
Description
technical field
[0001] The invention belongs to the technical field of computer programs, and relates to a pairwise comparison method of source code fragments based on coding sequence representation. Background technique
[0002] Source code similarity detection is widely used in many software development tasks, for example, code plagiarism and duplicate code detection through clone detection, software fault location through similarity matching, code recommendation or Generate fixes etc. In these tasks, a source code similarity matching algorithm is needed to retrieve and quantitatively analyze similar codes.
[0003] Commonly used code similarity calculation methods usually use text, symbols, tree structures or graph structures to represent the source code text, and then use the corresponding similarity definition to calculate the similarity between two pieces of source code. The text-based method is to use the source code text as a string sequence or set for text matchin...