Supercharge Your Innovation With Domain-Expert AI Agents!

Method and device for identifying multiplexing relationship between binary code and source code

A binary code and source code technology, applied in the direction of creating/generating source code, software reuse, code compilation, etc., can solve irrelevant, non-solution, high development costs and other problems

Active Publication Date: 2020-04-21
INST OF INFORMATION ENG CAS
View PDF3 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, it takes a high development cost to realize the source code reuse relationship construction, and the related technologies are not related to the target problem, so it is not the best solution

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for identifying multiplexing relationship between binary code and source code
  • Method and device for identifying multiplexing relationship between binary code and source code
  • Method and device for identifying multiplexing relationship between binary code and source code

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0040] In order to make the above objects, features and advantages of the present invention more comprehensible, the present invention will be further described in detail below through specific embodiments and accompanying drawings.

[0041] In most of the existing code reuse detection technologies, there is no discussion on multiple types of code reuse, but the reuse relationship is simply understood as the complete reuse of the target code. In fact, code reuse is a complex many-to-many relationship. The present invention divides code multiplexing types into four categories, which are single multiplexing, mixed multiplexing, partial multiplexing and nested multiplexing.

[0042] Single reuse means that the binary code contains and only contains the complete code of one open source component, that is, the binary file is compiled from the complete code of the target open source component. For example, binary files named zlib.dll or zlib1.dll exist in a large number of software...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a method and a device for identifying a multiplexing relationship between a binary code and a source code. The method comprises the following steps: constructing a compilationdependency graph for displaying a mapping relationship between a library file and a source file of an open source component; calculating the similarity between the source code and the binary code byutilizing the compilation dependency graph, and identifying single multiplexing, hybrid multiplexing and partial multiplexing; identifying nested multiplexing by comparing the binary code with a matched feature set of the open source component; and generating a multiplexing relation graph according to the identified multiplexing relation. According to the method, four multiplexing types are divided from a common code multiplexing relationship in real software, whether the multiplexing relationship really exists between the binary code and the source code or not is judged according to the fourmultiplexing types, and the multiplexing type interfering similarity detection is eliminated, so that the multiplexing relationship between the binary code and the source code can be effectively identified.

Description

technical field [0001] The invention belongs to the field of program analysis, focuses on the static analysis direction of codes, specifically classifies and recognizes multi-type inclusion relationships between binary codes and source codes. Background technique [0002] Reuse relationship identification is used to accurately identify and describe the inclusion relationship between codes. Although there are many related studies on the code reuse problem, it is usually only used as the research background of code similarity detection, and few studies discuss and deal with different reuse relationship types. In most studies, reuse detection is equivalent to code similarity detection, and only complete clones of open source components are identified. However, the reuse relationship between codes is actually a complex many-to-many relationship. A binary file may contain the code of multiple open source components, or may only reuse part of the code of an open source component...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F8/36G06F8/41
CPCG06F8/36G06F8/41
Inventor 袁子牧冯牧玥霍玮李丰肖扬班固
Owner INST OF INFORMATION ENG CAS
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More