Incremental gradient improving decision-making tree updating method

An update method and decision tree technology, applied in character and pattern recognition, instruments, computer parts, etc., can solve the problem of not fully utilizing the data classification model

Active Publication Date: 2017-02-22
HENAN UNIVERSITY
View PDF2 Cites 14 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, when new data arrives, the existing gradient boosting decision tree needs to retrain a data classification model on all data sets, and does not make full use of the data classification model established on t

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Incremental gradient improving decision-making tree updating method
  • Incremental gradient improving decision-making tree updating method
  • Incremental gradient improving decision-making tree updating method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0044] Below in conjunction with accompanying drawing and embodiment the present invention is described in detail:

[0045] like figure 1 As shown, the incremental gradient lifting decision tree update method of the present invention comprises the following steps:

[0046] A: Sorting and merging the incremental data set and the original data set to form the current latest ordered data set after merging new data blocks.

[0047] Step A includes the following specific steps:

[0048] A1: After one or more new data blocks arrive in real time, first sort each attribute on the new data block separately, and then independently generate a sorted data set for each attribute;

[0049] A2: Use the sorting and merging strategy to sequentially merge the sorted data set independently generated for each attribute on the new data block with the original data set sorted based on the original data set for the attribute. The merging method is as follows:

[0050] for each attribute T i The ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an incremental gradient improving decision-making tree updating method. The incremental gradient improving decision-making tree updating method comprises the following steps: A, sequencing and merging an incremental data set and an original data set to acquire the current newest sequential data set; B, calculating the newest optimal splitting attribute and splitting value of each node on the current newest sequential data set separately; and C, updating data classification models by utilizing the comparison result of the newest optimal splitting attribute and the optimal splitting attribute of the node before a new data block arrives. By the incremental gradient improving decision-making tree updating method, the existing data classification models can be quickly updated by an incremental method before new data sets arrive in batches, a new gradient improving decision-making tree classification model does not need to be trained again, and updating is conducted on the basis of the existing models, so that the time of establishing the data classification models is greatly reduced, the training speed of the models is increased and a large amount of time cost is saved.

Description

technical field [0001] The invention relates to an updating method of a data classification model, in particular to an incremental gradient-boosting decision tree updating method. Background technique [0002] In recent years, people have gradually realized the importance of data analysis, and began to analyze and mine data to discover the potential value of data. More and more fields such as finance, e-commerce, medical care and education have begun to use data mining technology to obtain the potential value of data. [0003] Among these applications and services, the more common requirement is data classification. In the data classification problem, how to quickly update the existing data classification model is an important research topic, and it is also an important challenge faced by many applications that need to process data in real time. Because, after the arrival of new data, it will take a lot of time to rebuild a new data classification model. [0004] Gradient...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06K9/62
CPCG06F18/24323
Inventor 张重生凡高娟张愿
Owner HENAN UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products