Method and device for constructing GBDT model and prediction method and device

A model and algorithm technology, applied in the field of building a gradient boosting decision tree GBDT model, can solve the problem of low accuracy of the decision tree model

Active Publication Date: 2019-10-18
THE FOURTH PARADIGM BEIJING TECH CO LTD
View PDF12 Cites 13 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] In view of the above problems, the present invention proposes a method and device for constructing a gradient boosting decision tree GBDT model, the main purpose of which is to solve the problem of low accuracy of the existing trained decision tree model and improve the accuracy of the trained model. Rate

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for constructing GBDT model and prediction method and device
  • Method and device for constructing GBDT model and prediction method and device
  • Method and device for constructing GBDT model and prediction method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0090] Exemplary embodiments of the present invention will be described in more detail below with reference to the accompanying drawings. Although exemplary embodiments of the present invention are shown in the drawings, it should be understood that the invention may be embodied in various forms and should not be limited to the embodiments set forth herein. Rather, these embodiments are provided for more thorough understanding of the present invention and to fully convey the scope of the present invention to those skilled in the art.

[0091] The embodiment of the present invention provides a method for constructing a GBDT model of a gradient boosting decision tree, which can be applied to processes such as bank card leakage detection, recommendation of goods and services, classification of images or texts, and detection of malicious traffic. In the above scenarios , the labeled positive sample data is a small part, and most of the sample data is unlabeled data. The method de...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method and a device for constructing a GBDT (Gradient Boost Decision Tree) model, relates to the technical field of machine learning, and mainly aims to solve the problem oflow accuracy of the existing trained GBDT model. According to the main technical scheme, the method comprises the steps of obtaining a sample data set, wherein the sample data set comprises positive sample data with positive labels and unlabeled sample data without labels; training each regression tree of a GBDT model; constructing a positive sample training subset based on positive sample data inthe sample data set, sampling unmarked sample data in the sample data set to construct a negative sample training subset, and combining the positive sample training subset with the plurality of negative sample training subsets to obtain a training set of a current regression tree, training the current regression tree based on the training set of the current regression tree, and constructing a GBDT model according to each regression tree. The invention is used in the construction process of the gradient boosting decision tree.

Description

technical field [0001] The present invention relates to the technical field of machine learning, in particular to a method and device for constructing a gradient boosting decision tree GBDT model and a method and device for predicting using the model. Background technique [0002] With the continuous advancement of technology, artificial intelligence technology is also gradually developing. Among them, machine learning is an inevitable product of the development of artificial intelligence research to a certain stage. It is committed to improving the performance of the system itself by means of computing and using experience. In computer systems, "experience" usually exists in the form of "data". Through machine learning algorithms, "models" can be generated from data. Model, when faced with a new situation, the model will provide the corresponding judgment, that is, predict the result. Whether training a machine learning model or making predictions using a trained machine ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06N20/00
CPCG06N20/00
Inventor 王海涂威威
Owner THE FOURTH PARADIGM BEIJING TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products