Sample homology analysis method based on data slice and image hash combination

A technology of data slicing and analysis methods, applied in the field of homologous analysis of malicious samples, which can solve problems such as overfitting of training results, sample lag, and insufficient number of analysts, and achieve the effect of reducing labor and low false alarm rate

Active Publication Date: 2019-01-11
NAT COMP NETWORK & INFORMATION SECURITY MANAGEMENT CENT
View PDF3 Cites 12 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] 1. Manual identification of homologous samples has high requirements for analysts, who need to be familiar with the characteristics of known APT samples. Faced with today's massive high-risk samples, the number of analysts is far from enough, and it is impossible to efficiently analyze recent samples, which is prone to lag problems, increasing the difficulty of tracing the source of attack events
[0005] 2. The method of using machine learning algorithms to train a model through a large number of similar samples for homologous sample identification has become the mainstream in recent years. However, in the real environment of APT sample identification, due to the limited known samples of each APT organization, training The quality of the set is often not guaranteed, and the final training result is prone to overfitting (Overfit), which has limitations

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Sample homology analysis method based on data slice and image hash combination
  • Sample homology analysis method based on data slice and image hash combination

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0044] In order to solve the shortcomings of the sample homology analysis scheme provided by the prior art, that is, the requirements for analysts are relatively high, overfitting is prone to occur when the sample set is small, and there is a lag period, etc., the method of the present invention provides a data slice based And the sample homology analysis method of image hash combination, by extracting sample data slices to form a grayscale atlas, using the image hash algorithm to generate fingerprint results to establish a fingerprint database, which has the effect of identifying homologous samples.

[0045] In order to make the purpose and technical solution of the method of the present invention clearer, further detailed description will be given below in conjunction with the accompanying drawings.

[0046] see figure 1 , is a schematic flow diagram of the establishment of the method of the present invention, a sample homology analysis method based on data slicing and image h...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a sample homology analysis method based on data slice and image hash combination, which comprises the following steps: 1, collecting a malicious sample of known APT organization; 2, filtering and restoring the sample of that training data set; 3, carrying out static analysis on that sample and extracting data slice; 4, dynamically analyzing samples and other training data sets, and extracting data slices; 5, filtering the white list data slices and manually reviewing and arranging the slice format for all the data slices; 6, formatting all data slices into grayscale image form and classify according to functions; 7, calculating all gray images and classify and storing that calculated results to a fingerprint database; 8, testing organization of the sample in the testdataset. Through the above steps, a sample homology analysis method based on data slice and image hash combination is realized, the labor and time cost are reduced, and the problem of lag and highlydependent on manual analysis in the existing APT homology sample analysis is solved.

Description

1. Technical field [0001] The invention provides a sample homology analysis method based on data slice and image hash combination, which relates to a malicious sample homology analysis method and belongs to the technical field of network security. 2. Background technology [0002] In recent years, the network security situation has become more and more serious. APT-Advanced Persistent Threat (APT-Advanced Persistent Threat) incidents targeting the government, military industry, education departments, scientific research institutions and enterprises have continued to increase. Malicious sample variants and new malicious Samples emerge in endlessly. By studying the correlation and homology analysis between malicious samples, revealing the relationship between developers or attacking organizations behind malicious code attacks can provide more comprehensive data support for network attack traceability. [0003] In the face of more and more APT incidents, attacker source tracing...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62
CPCG06F18/22G06F18/241
Inventor 韩志辉吕志泉梅瑞严寒冰丁丽李佳沈元张帅李志辉张腾陈阳王适文马莉雅高川周昊周彧何永强袁伟华吕承琨李骏杰卞玉捷
Owner NAT COMP NETWORK & INFORMATION SECURITY MANAGEMENT CENT
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products