Supercharge Your Innovation With Domain-Expert AI Agents!

Novel decision tree classification method based on J divergence

A decision tree classification and decision tree technology, applied in the intersection of information theory and data mining, can solve problems such as poor classification and prediction accuracy, and achieve good overall performance.

Inactive Publication Date: 2019-05-14
SHANGHAI JIAO TONG UNIV
View PDF1 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] In view of the defects in the prior art, the purpose of the present invention is to provide a novel decision tree classification method based on J-divergence, which is applied to the classification of data sample sets, so as to solve the problem of poor classification prediction accuracy of the existing decision tree classification method. good technical questions

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Novel decision tree classification method based on J divergence
  • Novel decision tree classification method based on J divergence
  • Novel decision tree classification method based on J divergence

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0041] The present invention will be described in detail below in conjunction with specific embodiments. The following examples will help those skilled in the art to further understand the present invention, but do not limit the present invention in any form. It should be noted that those skilled in the art can make several changes and improvements without departing from the concept of the present invention. These all belong to the protection scope of the present invention.

[0042] In the present embodiment, a novel decision tree classification method based on J-divergence of the present invention is introduced as follows:

[0043] S1. Normalized input sample data set D={X j (i) ;C (i)},i=1,2,...,M,j=1,2,...,N, where X j (i) Denotes sample X (i) in feature A j The eigenvalues ​​on C (i) ∈{c 1 ,c 2 ,...,c K} represents sample X (i) The corresponding category label value;

[0044] S2. Set the division termination condition of the sample data set or data subset, th...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a novel decision tree classification method based on J divergence. The method comprises the following steps of S1, standardizing an input sample data set; S2, setting a divisiontermination condition of the sample data set or the data subset, i.e., generating a condition that the leaf node does not continue to be divided according to the state of the current node by the decision tree; S3, preparing to divide the original sample data set, and creating a root node of the decision tree; S4, dividing the sample data set, splitting nodes of the decision tree and generating adecision rule according to a division criterion; S5, continuously dividing the sample subset obtained by dividing each time according to the steps S2, S3 and S4 by adopting a recursive mode to obtaina new node; and S6, discriminating and predicting the category value of the sample X * with the unknown category label value. According to the method, J divergence information measurement is innovatively used for generation of a division criterion, so that the prediction accuracy of the decision tree classification algorithm is improved.

Description

technical field [0001] The present invention relates to the cross technical field of information theory and data mining, in particular to a novel decision tree classification method based on J-divergence. Background technique [0002] With the development and progress of technologies such as wireless mobile communications, the Internet, and various smart terminal devices, massive amounts of data are being generated and collected exponentially and explosively. How to discover and extract useful knowledge or rules from massive big data is a series of issues worth considering. These problems all involve a key technology in big data technology, that is, data mining technology. The classification problem is a typical problem to be solved by data mining, and it is widely used in practical application scenarios such as spam recognition, text and image recognition, financial risk control and credit card fraud, online advertisement placement, and recommendation systems. Most of the...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06K9/62
Inventor 杨云帆陈文
Owner SHANGHAI JIAO TONG UNIV
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More