Unlock instant, AI-driven research and patent intelligence for your innovation.

A method and system for identifying homologous binary files

A binary file, to-be-identified technology, used in text database indexing, unstructured text data retrieval, instruments, etc., can solve the problems of inaccuracy and large errors in identifying homologous binary files.

Inactive Publication Date: 2020-09-11
INST OF INFORMATION ENG CHINESE ACAD OF SCI +1
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] In order to overcome the above-mentioned problems of large errors and inaccuracies in identifying homologous binary files or at least partially solve the above problems, the present invention provides a method and system for identifying homologous binary files

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A method and system for identifying homologous binary files
  • A method and system for identifying homologous binary files
  • A method and system for identifying homologous binary files

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0050] The specific implementation manners of the present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments. The following examples are used to illustrate the present invention, but are not intended to limit the scope of the present invention.

[0051] In one embodiment of the present invention, a method for identifying homologous binary files is provided, figure 1 It is a schematic diagram of the overall flow of the method for identifying homologous binary files provided by the embodiment of the present invention. The method includes: S1, using the minimum hash algorithm to obtain the signatures of the binary files to be identified and the original binary files respectively; S2, using the bucketing method to Each of the signatures is divided into buckets, and the strings of each of the signatures divided into each bucket are obtained; wherein, each of the strings in the same bucket has the same number of characte...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a method and system for identifying homologous binary files. The method comprises the steps of S1, using a minimum hash algorithm to obtain signatures of to-be-identified binaryfiles and each original binary file; S2, using a bucket method for bucket dividing on each signature to obtain a character string, divided in each bucket, of the signature, wherein character stringsin the same bucket have the same number of characters; S3, according to character strings corresponding to signatures of the original binary files, using an inverted index method to obtain a dictionary corresponding to each bucket one by one; S4, according to character strings corresponding to signatures of the to-be-identified binary files in each bucket, obtaining original binary files in homology with the to-be-identified binary files from the dictionary corresponding to each bucket. The method reduces the amount of calculation and improves the speed and accuracy of identifying the homologous binary files, and is suitable for identification of various homologous binary files.

Description

technical field [0001] The invention belongs to the field of vulnerability mining, and more specifically relates to a method and system for identifying homologous binary files. Background technique [0002] In recent years, more and more IoT devices are connected to the network, which greatly facilitates people's lives. However, due to the lack of security awareness of manufacturers, shared code modules and third-party SDKs are widely used in these smart devices, causing many smart devices to face the risk of being attacked. [0003] In the prior art, in order to prevent smart devices from being attacked, related firmware is generally repaired to improve the performance of smart devices against attacks. In order to know about vulnerable firmware in advance, identification of homologous binaries is required. The existing binary file comparison method is to directly read in the binary data for comparison. This method is relatively direct, but it does not take into account th...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/31G06F21/56G06F21/57
CPCG06F16/319G06F16/325G06F21/562G06F21/572G06F21/577
Inventor 石志强陈昱孙利民朱红松赵威威马原
Owner INST OF INFORMATION ENG CHINESE ACAD OF SCI