Multi-classifier integrating method based on increment native Bayes network

A Bayesian network, multi-classifier technology, applied in the fields of instrumentation, computing, electrical digital data processing, etc., can solve the problems of affecting classification prediction results, inability to discard useless classifiers in time, concept interference, etc., to improve classification prediction results. , to avoid the effect of catastrophic forgetting

Inactive Publication Date: 2008-08-27
JILIN UNIV
View PDF0 Cites 21 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Existing ensemble-based methods cannot discard useless classifiers in time, causing interference of wrong concepts and affecting classification prediction results

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Multi-classifier integrating method based on increment native Bayes network
  • Multi-classifier integrating method based on increment native Bayes network
  • Multi-classifier integrating method based on increment native Bayes network

Examples

Experimental program
Comparison scheme
Effect test

experiment example

[0060] The present invention adopts STAGGER, a classic data set of concept drift problem, to analyze the performance of DynamicAddExp (multi-classifier integration method based on incremental naive Bayesian network). The instance space of the STAGGER dataset is described by three attributes: size = {small, medium, large}, color = {red, green, blue}, and shape = {square, circular, triangular}. Class labels class ∈ {-1, +1}. Three target concepts are defined as follows: (1) size=small and color=red; (2) color=green or shape=circular; (3) size=(medium or large). 120 training instances are randomly generated, and each instance is assigned a category according to the current concept. Every 40 training examples belong to a concept, and the concept sequence is: (1)-(2)-(3). At each time step, the classifier learns from one instance and is tested for predictive accuracy on a test set of 100 instances. Test instances are also randomly generated according to the current concept. All...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to an increment-based naive Bayesian network multiple classifier integration method, comprising the following steps that: a integration classifier and various key parameters are initialized; if no novel data exists, the process is ended; a category of a novel data item is forecasted by utilization of the prior integration classifier; parameter values of all the individual classifiers are dynamically updated; the weighing of all the individual classifiers is updated; if no error of the category forecast of novel data by the integration classifier is generated, all the individual classifiers in the integration classifier are trained by utilization of the novel data item; redundant individual classifiers are deleted according to the KL pruning strategy; a novel individual classifier is increased; all the individual classifiers are trained by utilization of the novel data item. The increment-based naive Bayesian network multiple classifier integration method can effectively improve the classification forecast result when concept shift is generated, and is particularly suitable for processing the concept shift problem.

Description

technical field [0001] The invention belongs to the field of data mining and machine learning, and relates to an incremental naive Bayesian network-based multi-classifier integration method for conceptual drift data modeling. Background technique [0002] The data in many application fields is constantly increasing, and the patterns contained in it will change with time and application environment, which is called "concept drift". At present, a lot of research has been done on dealing with concept drift at home and abroad, and a variety of model learning methods have been proposed. They can be classified into two categories: methods based on instance selection, such as the FLORA series of algorithms proposed by Widmer and Kubat et al., the algorithm for adaptively adjusting the window size proposed by Lazarescu et al., and the TMF (Time-Windowed Forgetting) algorithm of Salganicoff. The method based on instance selection cannot deal with the problem of concept drift well be...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
Inventor 刘大有关菁华黄晶齐红
Owner JILIN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products