Intrinsic function identification method based on subgraph isomorphism matching algorithm in decompilation

A subgraph isomorphism and matching algorithm technology, applied in the field of decompilation, can solve the problems of ineffective representation of the control flow relationship of instruction statements, incomplete decompilation results of intrinsic functions, and affecting the readability of high-level codes, etc., to achieve strong functions. Extensibility, enhanced readability and accuracy, and the effect of increasing the level of abstraction

Inactive Publication Date: 2018-04-17
XI AN JIAOTONG UNIV
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

For example, the common library functions strlen, strcpy, strcmp, and memcmp in C language are also used as intrinsic functions of the compiler. Under the compiler optimization option, the function body statement is expanded inline at the function call point, and the Flirt algorithm is constructed. The signature of the byte stream function cannot effectively represent the control flow relationship between instruction statements, and cannot efficiently identify such functions, resulting in incomplete decompilation results of intrinsic functions, which affects the readability of the final high-level code sex

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Intrinsic function identification method based on subgraph isomorphism matching algorithm in decompilation
  • Intrinsic function identification method based on subgraph isomorphism matching algorithm in decompilation
  • Intrinsic function identification method based on subgraph isomorphism matching algorithm in decompilation

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0033] The present invention will be described in further detail below in conjunction with the accompanying drawings.

[0034] The intrinsic function identification method based on the subgraph isomorphism matching algorithm in the decompilation provided by the present invention comprises the following steps:

[0035] First, for a specific compiler, build the compiler’s intrinsic function template library: select a program with intrinsic function calls as a sample program, compile the sample program under the compiler optimization option to generate an executable file; use the IDA disassembler to disassemble Assemble the executable file, and use the generated assembly file as the input of the Boomerang decompiler. The Boomerang decompiler loads and decodes the assembly file. The decoding module uses continuous assembly instructions as basic blocks, and the control flow relationship as directed edges. Construct the assembly instruction control flow graph of the assembly file; e...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an intrinsic function identification method based on a subgraph isomorphism matching algorithm in decompilation, and belongs to the technical field of decompilation. By establishing an intrinsic function template library, the present invention performs subgraph isomorphism matching on the intrinsic function template and the target assembly file based on the control flow graph generated by decompilation, and locates the compiled optimized and inline expanded objects in the target assembly file in the target program. intrinsic function. The invention realizes the automatic identification of the inline intrinsic function in the process of decompilation, and at the same time, by analyzing the template and prototype of the intrinsic function, the function name, return value, return value type and function parameters of the intrinsic function are restored, so as to achieve the inline function. Purpose of intrinsic function semantic promotion. The improved inline intrinsic function provides more type information for type analysis in decompilation, reduces the complexity of data flow analysis and control flow analysis, improves the abstraction level of intermediate code, and enhances the reliability of decompilation results. readability.

Description

technical field [0001] The invention belongs to the technical field of decompilation, and relates to a method for identifying inline intrinsic functions in decompilation, in particular to a method for identifying intrinsic functions in decompilation based on a subgraph isomorphism matching algorithm. Background technique [0002] Decompilation technology first appeared in the 1960s, mainly to realize cross-platform porting of code, and has been widely used in various aspects such as program understanding, source code recovery, program debugging, and security analysis. Decompilation software includes front-end, middle-end and back-end. The front end includes loader, software parsing unit and decoder. The loader loads the executable file, disassembles to obtain the assembly code, and then the decompilation software organizes the assembly program into corresponding data structures, such as symbol table, symbol address table, process body entry address table, instruction chain ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F8/53
Inventor 赵银亮张磊刘凯刘延昭
Owner XI AN JIAOTONG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products