Method and system for identifying homologous binary files

A binary file and file technology, applied in special data processing applications, instruments, electrical digital data processing, etc., can solve problems such as lack of program logic understanding, and achieve the effect of increasing speed and reducing the amount of calculation

Active Publication Date: 2018-02-16
INST OF INFORMATION ENG CAS
View PDF4 Cites 11 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, this method lacks the understanding of program l

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and system for identifying homologous binary files
  • Method and system for identifying homologous binary files
  • Method and system for identifying homologous binary files

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0041] The specific implementation manners of the present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments. The following examples are used to illustrate the present invention, but are not intended to limit the scope of the present invention.

[0042] In order to overcome the above-mentioned problems in the prior art, the present invention provides a method for identifying homologous binary files from a database, wherein the database contains a plurality of binary basic files. For this method, see figure 1 , the method includes:

[0043] S1. According to the minimum hash algorithm, respectively obtain the signature of the file to be identified and the signature of each basic file;

[0044] For step S1, the present invention first needs to obtain the signature of the file to be identified and all basic files in the database according to the minimum hash algorithm, and the minimum hash algorithm is also LSH (Engli...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a method and a system for identifying homologous binary files in a database. The database comprises multiple binary basic files. The method comprises the steps of obtaining signatures of to-be-identified files and signatures of the basic files according to a min-hash algorithm; for any signature, performing bucket dividing processing on the signature according to a bucket dividing method; according to a reverse indexing method and the signatures, subjected to bucket dividing, of all the basic files, obtaining dictionaries in one-to-one correspondence with buckets, wherein each dictionary comprises at least one key value pair; according to character strings in the buckets of the to-be-identified files, traversing the corresponding dictionaries, and according to valuescorresponding to matching keys, obtaining the homologous binary files of the to-be-identified files. According to the method and the system, the signatures are obtained by adopting the min-hash algorithm and the bucket dividing is performed by adopting a local sensitive hash algorithm, so that the calculation amount can be remarkably reduced; and by adopting the reverse indexing method, an indextable is established for all the signatures, so that the speed of identifying the homologous binary files is increased.

Description

technical field [0001] The present invention relates to the technical field of vulnerability mining for embedded devices, and more specifically, to a method and system for identifying homologous binary files. Background technique [0002] In recent years, IoT devices have been connected to the network at a nearly doubled speed, which greatly facilitates people's lives. However, these smart devices face the risk of being attacked due to the lack of security awareness of the manufacturers and the extensive use of shared code modules and third-party SDKs. [0003] For example, on October 21, 2016, hackers used network cameras and other equipment to attack DNS provider Dyn in a distributed denial-of-service attack (DDoS), destroying a series of large Internet websites, including Spotify, Twitter, PayPal, Netflix, etc. The media called this attack "the worst DDos attack in history", and the security of IoT devices received more attention. When an emergency security event occurs...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30
CPCG06F16/2255G06F16/24556
Inventor 石志强陈昱赵威威李红孙利民
Owner INST OF INFORMATION ENG CAS
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products