A method and system for identifying homologous binary files
A binary file, to-be-identified technology, used in text database indexing, unstructured text data retrieval, instruments, etc., can solve the problems of inaccuracy and large errors in identifying homologous binary files.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0050] The specific implementation manners of the present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments. The following examples are used to illustrate the present invention, but are not intended to limit the scope of the present invention.
[0051] In one embodiment of the present invention, a method for identifying homologous binary files is provided, figure 1 It is a schematic diagram of the overall flow of the method for identifying homologous binary files provided by the embodiment of the present invention. The method includes: S1, using the minimum hash algorithm to obtain the signatures of the binary files to be identified and the original binary files respectively; S2, using the bucketing method to Each of the signatures is divided into buckets, and the strings of each of the signatures divided into each bucket are obtained; wherein, each of the strings in the same bucket has the same number of characte...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


