Unlock instant, AI-driven research and patent intelligence for your innovation.

Multi-label stream data classification method based on Hough-ding tree

A classification method and multi-label technology, applied in the field of pattern recognition and data mining, can solve the problems of high time complexity and poor prediction performance, and achieve the effect of wide application and improved classification and prediction performance.

Pending Publication Date: 2022-03-01
YANSHAN UNIV
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] Due to the limitations and one-sidedness of the technology for classification and label prediction of multi-label and streaming data under the existing big data background, as well as the problems of poor prediction performance and high time complexity
In order to solve this problem, the present invention provides a multi-label flow data classification method based on Houghting tree, in order to solve the label prediction problem of multi-label and flow data at the same time, and further improve the algorithm by using the cascade structure and hierarchical model performance, which can effectively improve the real-time performance of dynamic data flow label prediction

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Multi-label stream data classification method based on Hough-ding tree
  • Multi-label stream data classification method based on Hough-ding tree
  • Multi-label stream data classification method based on Hough-ding tree

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0051] Below in conjunction with accompanying drawing and specific embodiment, working principle of the present invention and working procedure are further explained:

[0052] In order to solve the problem of classification and multi-label label prediction of massive data in the background of big data, the present invention provides a Houghting tree-based multi-label data classification method applied to data flow scenarios. The samples are assigned to a group of base classifiers through feature division, and the label prediction vector of the current basic unit for the sample is obtained through the voting method during prediction. This method of predicting value decision-making takes into account the possible correlation between feature values.

[0053] Such as figure 1 As shown, the present invention provides a kind of multi-label stream data classification method based on Hofferding tree, which comprises the following steps:

[0054] S01. Define sample set Indicates d-d...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a multi-label stream data classification method based on a Hough-ding tree, and belongs to the technical field of pattern recognition and data mining. A training stage step: enabling new examples to flow into the model one by one; constructing a base classifier according to the number of instance features, respectively distributing a plurality of features, and carrying out incremental learning; the structures are used as basic units for hierarchical cascading, and each layer is provided with a plurality of different basic units; performing three-fold cross validation during training of each layer to obtain the weight of each unit, and verifying the classification performance of the current layer; a prediction stage step: enabling the new examples to flow into the trained model one by one, and obtaining the feature vector of the current example; feature division; the weight determined when training is applied to the predicted value of the current instance by each basic unit is spliced with the original feature to serve as the input feature of the next layer; and performing statistics on the prediction precision of each layer for the current test instance, if the prediction precision does not obviously increase, stopping model increase, and taking the prediction value of the last layer as a final prediction result.

Description

technical field [0001] The invention belongs to the technical field of pattern recognition and data mining, and in particular relates to a method for classifying multi-label flow data based on a Hofferding tree. Background technique [0002] With the popularization and vigorous development of computer technology, data information is becoming more and more complex and changeable. How to filter useful information from massive data information is a key problem that we need to solve urgently in the era of big data. Feature selection, as a feature dimensionality reduction technique that obtains an "optimal feature subset" by eliminating irrelevant and redundant features, has received extensive attention from researchers in the context of big data. [0003] Classification and regression problems are hot topics in machine learning. Existing multi-label classification methods usually deal with existing complete static data and put them into the model in batches in the form of featur...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F16/21G06F16/22G06F16/2458
CPCG06F16/212G06F16/2246G06F16/2465
Inventor 梁顺攀潘维维
Owner YANSHAN UNIV