Unlock instant, AI-driven research and patent intelligence for your innovation.

Novel Bayesian weighting method based on CFS _ KL

A Bayesian algorithm, a new type of technology, applied in the direction of instruments, character and pattern recognition, computer components, etc., can solve problems such as unrealistic, limited naive Bayesian classification effect, and data not so strong independence, to achieve blunt Improve the recognition degree, improve the classification effect, and improve the effect of classification accuracy

Active Publication Date: 2020-06-05
XI AN JIAOTONG UNIV
View PDF4 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, since learning the optimal Bayesian classifier is just like learning the Bayesian network, it is an NP-hard problem, so learning the Naive Bayesian classifier has been favored by many scholars, and Naive Bayesian is often based on a simple But unrealistic assumption: the features of the training data are independent of each other. This strong condition is difficult to achieve in real life. Even in reality, it has logically shown that the features are independent of each other. In the actual data Not so independent, which greatly limits the classification effect of Naive Bayes

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Novel Bayesian weighting method based on CFS _ KL
  • Novel Bayesian weighting method based on CFS _ KL
  • Novel Bayesian weighting method based on CFS _ KL

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0059] A kind of novel Bayesian weighting method based on CFS_KL of the present invention comprises the following steps:

[0060] S1. In the data collection stage, disassemble the nmap fingerprint library, obtain training data, and simulate test data;

[0061] Analyze the operating system identification rules in the nmap fingerprint library. The nmap fingerprint library will send 16 data packets to generate a corresponding response sequence, and each response sequence will correspond to some flag bits. The fingerprint library of nmap contains the operating system's fingerprint information contained in the response data packet of the operating system known to nmap to the 16 probe packets of nmap. Therefore, the fingerprint name in the fingerprint library is used as the tag data of the model, and the flag bits of the response sequence under the fingerprint name are used to form the training data. The following is a fingerprint of the nmap fingerprint library:

[0062] Fingerpr...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a novel Bayesian weighting method based on CFS _ KL, and the method comprises the steps: taking a fingerprint name in a fingerprint library as the mark data of a model, and forming training data through a response sequence flag bit under the fingerprint name; performing box sealing preprocessing operation on the training data; calculating the correlation degree between theattributes and the classes by using KL divergence to serve as the weight of each attribute; selecting 42 dimensions by using a feature selection method; correcting the weight calculated by the KL divergence by using the dimension selected by the CFS; performing training by using a weighted Bayesian algorithm; inputting the vector into a trained fingerprint model through box sealing operation, calculating the maximum posterior probability of each flow based on a CFS _ KL weighted Bayesian algorithm, and completing simulation data testing; collecting real traffic in a mode of sending a packet tothe target network segment, inputting the real traffic into the fingerprint model, and predicting a result; and calculating the test precision of the real flow. According to the method, the requirement of the Bayesian algorithm for feature independence is relieved, and the recognition precision of the Bayesian algorithm is improved.

Description

technical field [0001] The invention belongs to the technical field of machine learning, and in particular relates to a novel Bayesian weighting method based on CFS_KL (correlation-based feature selection_Kullback-Leibler). Background technique [0002] As one of the ten classic algorithms of machine learning, Bayesian algorithm has many applications in many fields, and all of them have shown very good results. For example, judging whether an email is spam or not based on its title and content. However, since learning the optimal Bayesian classifier is just like learning the Bayesian network, it is an NP-hard problem, so learning the Naive Bayesian classifier has been favored by many scholars, and Naive Bayesian is often based on a simple But unrealistic assumption: The features of the training data are independent of each other. This strong condition is difficult to achieve in real life. Even in reality, it has logically shown that the features are independent of each other...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06K9/62
CPCG06F18/24155G06F18/214
Inventor 桂小林安迪
Owner XI AN JIAOTONG UNIV