Unlock instant, AI-driven research and patent intelligence for your innovation.

Stratified decision tree constructing method

A construction method and decision tree technology, applied in special data processing applications, instruments, electrical and digital data processing, etc., can solve problems such as difficulty in obtaining ideal results, and achieve the effect of strong flexibility

Inactive Publication Date: 2011-09-14
BEIJING INSTITUTE OF TECHNOLOGYGY
View PDF0 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] The purpose of the present invention is to solve the problem that the existing decision tree method is difficult to achieve ideal results in the presence of nonlinear conditional attribute relations in view of the deficiencies in the prior art, and propose a new decision tree construction method for predicting data Classification and Data Mining

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Stratified decision tree constructing method
  • Stratified decision tree constructing method
  • Stratified decision tree constructing method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0033] The present invention will be further described below in conjunction with accompanying drawing.

[0034] figure 1 is a flowchart of the cascaded decision tree method. The concrete steps of implementation mode include:

[0035] 1. Enumerate all non-category attributes, assuming that the total number is n, calculate the correlation coefficient between them and category attributes;

[0036] 2. Take out m attributes whose correlation coefficient is less than the threshold, and put them into the hierarchical attribute cluster to be selected;

[0037] Sort the calculated correlation coefficients, and take out m attributes whose correlation coefficients are less than the threshold value. The threshold value here is the control of the correlation coefficient. If it is set too low, hidden conditional attributes may be lost, and if it is set too high, too many attributes will be lost. The unconditional attributes of the unconditional attributes are put into the hierarchical at...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a stratified decision tree constructing method comprising the following steps: 1. calculating related coefficients of non-categorical attributes and categorical attributes; 2. taking out attributes with a related coefficient smaller than a thresh value to put in a stratified attribute cluster to be selected; 3. taking out one attribute and stratifying a data set according to the attribute value; 4. recalculating the related coefficients of all the attributes in data subsets, judging whether the attributes are real stratified attributes according to the related coefficient lifting condition of the attributes; 5. repeating the steps 3 and 4 to obtain a real stratified attribute cluster; 6. sequencing according to contribution degree in the stratified attribute cluster; 7. sequentially taking out the stratified attributes, carrying out data set stratification to form a stratified decision tree; and 8. nesting and applying a known excavation method in the data subsets to obtain a complete stratified decision tree. By introduction of stratified attributes, the stratified decision tree constructing method solves the problem that an ideal effect is difficult to obtain by the traditional method under the condition of nonlinear condition attribute relationship; moreover the stratified decision tree constructing method has stronger flexibility.

Description

technical field [0001] The present invention relates to a new data mining method—a method for constructing a stratified decision tree (Stratified Decision Tree, referred to as SDT), which belongs to the field of data mining. Background technique [0002] The decision tree method is one of the most widely used and practical data mining prediction techniques. The principle of the decision tree method is to find out the splitting attribute with the most ability to distinguish different data in the current data set, and then divide the data according to the value of the splitting attribute. The set is divided into multiple subsets, each subset corresponds to a branch of the bifurcation tree, and then this process is called recursively for each subset until all subsets contain the same type of data, thus obtaining a decision tree model, and Based on this, the new data set to be processed is predicted and classified. [0003] Among the decision tree methods, the selection method ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/30
Inventor 牛振东赵育民王维强
Owner BEIJING INSTITUTE OF TECHNOLOGYGY