Sub-graph semantic isomorphism based instruction idiom identification method

An identification method and instruction technology, applied in the field of decompilation, can solve problems such as the inability to completely detect program instruction idioms, and achieve the effect of simplifying data flow analysis and control flow analysis, eliminating influence, and improving accuracy

Active Publication Date: 2016-05-18
弘毅视界(北京)科技有限公司
View PDF3 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

At present, mainstream decompilation software includes Hex-Rays, RetargetableDecompiler, SecondWrite, etc. Their basic technology is to search for the corresponding subsequence in the assembly instruction sequence flow. This method is affected by factors such as code scheduling and code optimization in the compilation optimization stage. , cannot fully detect instruction idioms in the program

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Sub-graph semantic isomorphism based instruction idiom identification method
  • Sub-graph semantic isomorphism based instruction idiom identification method
  • Sub-graph semantic isomorphism based instruction idiom identification method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0039] The present invention will be described in detail below in conjunction with the accompanying drawings and specific embodiments.

[0040] The present invention is an instruction idiom recognition method based on subgraph semantic isomorphism, the working process is as follows figure 1 As shown, the specific steps are as follows:

[0041] Step 1: According to the construction algorithm of the assembly program control flow graph, construct the assembly instruction control flow graph of the process body;

[0042] Step 2: Analyze instruction idioms on the assembly instruction control flow graph obtained in step 1, analyze the instruction sequence flows corresponding to different instruction idioms through the frequent sequence mining algorithm, and combine all instruction idioms and instruction sequence flows Constitute a data dependency graph;

[0043] Specifically include the following steps:

[0044] a) Select the source program containing the instruction idiom to be m...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an analysis method of instruction idioms in reverse analysis. The analysis method of the instruction idioms in reverse analysis comprises the steps of performing data stream analysis for an assembly instruction sequence stream in a basic block; constructing a data dependence graph of the instruction sequence stream based on a definition-use relationship; then performing sub-graph semantic isomorphism matching for each connection sub-graph in the data dependence graph and a template in a template library; discovering a semantic isomorphism sub-graph; performing regulated reduction for the isomorphism sub-graph; and analyzing abstract operation of the sub-graph and corresponding operands. The analysis method of the instruction idioms in reverse analysis can efficiently increase the abstraction level of intermediate codes in the reverse analysis and improve code analysis efficiency.

Description

technical field [0001] The invention belongs to the technical field of decompilation, and in particular relates to an instruction idiom recognition method based on subgraph semantic isomorphism. Background technique [0002] With the advancement of science and technology, most software is currently written in high-level languages ​​and provided to users in the form of machine codes. However, the readability of machine codes is poor. People need to translate machine code programs into easy-to-understand high-level language programs to achieve software analysis, for understanding and maintenance purposes. Decompilation technology, also known as reverse compilation technology, refers to the process of turning executable files into high-level language source programs. As a tool for software analysis and understanding, decompilation has been widely used in program understanding, program debugging, and source code various aspects of recovery. The process of decompilation usually...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F9/45
CPCG06F8/53
Inventor 李浩高寅生李向军李立崔啸华
Owner 弘毅视界(北京)科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products