Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Constraint limited clustering and information measuring software birthmark feature selection method and computer

A feature selection method and software feature technology, applied in software engineering design, calculation, software maintenance/management, etc., can solve the problems of lack of dynamic feature selection, low software recognition rate, no feature group, etc., to improve practicability and wide application. The effect of ensuring anti-attack and ensuring uniqueness

Active Publication Date: 2018-06-15
XIAN UNIV OF FINANCE & ECONOMICS
View PDF2 Cites 10 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] To sum up, the existing problems in the prior art are as follows: First, the traditional software feature acquisition method without clustering and screening is characterized by a large feature library, which contains relatively large feature redundant information, and feature-based software identification The rate is low; or there is no contextual semantic analysis feature group combined with software, lack of dynamic feature selection

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Constraint limited clustering and information measuring software birthmark feature selection method and computer
  • Constraint limited clustering and information measuring software birthmark feature selection method and computer
  • Constraint limited clustering and information measuring software birthmark feature selection method and computer

Examples

Experimental program
Comparison scheme
Effect test

Embodiment approach

[0104] Before feature extraction, equivalent semantic transformation is performed on the original software, and then the sub-behavioral features of the original software, similar software (software after equivalent semantic transformation) and heterogeneous software (reference software) are analyzed. The analysis and identification process is divided into two parts, such as Figure 4 , the first is the construction of birthmark features, and the second is the detection of unknown software. The specific implementation process is as follows:

[0105] The process of "construction of birthmark feature software recognition scheme" includes:

[0106] a) Extract the sub-behavior feature set of each type of software, and count the frequency of the sub-behavior features.

[0107] b) Measure each sub-line feature set of the original software, and compare its similarity with the same sub-behavioral features in similar software and heterogeneous software.

[0108] c) Calculate the simil...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention belongs to the technical field of birthmark-based software recognition, discloses a constraint limited clustering and information measuring software birthmark feature selection method, and aims at measuring distances between same-class features and distances between different-class features on the basis of mutual information by adoption of constraint limited clustering analysis. Themethod comprises the following steps of: during software feature selection, carrying out equivalent semantic transformation on software; carrying out feature segmentation; combining a program slice technology to carry out limited group classification on features; constructing gain function and penalty function evaluations for segmented feature fragment sets; and carrying out different constructiongroup-based hierarchical clustering selection so as to screen invariable features in same classes and get rid of common features in different classes. According to the method, relevancy between features is considered, and in a set formed by the screened birthmark features, the differentiation information amount is the maximum and the redundancy is the minimum, so that the anti-attack performanceof the birthmark features is ensured and the uniqueness of the birthmark features is ensured. According to the method, the robustness and credibility of the software birthmark features are improved, and the feature-based software recognition rate is greatly improved.

Description

technical field [0001] The invention belongs to the technical field of software birthmarks, and in particular relates to a feature selection method of constraint-defined clustering and information measurement software birthmarks and a computer. Background technique [0002] Traditional software feature extraction techniques are summarized into two categories: one is segmental extraction, which is a fixed-length or variable-length segmentation mode combined with grammar. After feature extraction, it can be cut into segments by static word segmentation. Segmented feature selection, or selection based on variable-length P-gram segmentation; without combining semantic analysis and analyzing the correlation between features, the obtained birthmark features are less robust to software encryption, deformation, polymorphic attacks, etc. ; The other is slicing extraction, combined with the selection method of software semantic structure, the program slicing is based on the program de...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F8/75
CPCG06F8/751
Inventor 罗养霞
Owner XIAN UNIV OF FINANCE & ECONOMICS
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products