A prediction method and device based on tree model

A tree model and model technology are applied in the field of prediction methods and devices based on tree models, and can solve problems such as undesired data provision.

Active Publication Date: 2021-02-19
ALIPAY (HANGZHOU) INFORMATION TECH CO LTD
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In the case of use, the following problems are often faced: multiple data parties have their own data, and they want to use each other's data to unify the modeling tree model, such as Xgboos t or GBDT model, but they do not want to provide their own data to each other

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A prediction method and device based on tree model
  • A prediction method and device based on tree model
  • A prediction method and device based on tree model

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0057] Embodiments of this specification will be described below with reference to the accompanying drawings.

[0058] figure 1 A schematic diagram showing a system for constructing and using a privacy-preserving tree model according to an embodiment of the present specification. The system includes at least two data cubes, three data cubes are schematically shown in the figure, data cube A, data cube B and data cube C. The three data parties use their own data (shown as data 1 to data 3 in the figure) to jointly construct and use a tree model. For example, data parties A, B, and C are banks, insurance institutions, and platforms respectively, and their users may have multiple common users (such as users a, b, c, d, and e). The difference is that data parties A (such as a bank) has the characteristic value of each user's characteristic 1 (such as deposit balance), data party B (insurance institution) has the characteristic value of each user's characteristic 2 (such as the i...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The embodiment of this specification provides a privacy-protecting tree model construction method and device and a prediction method and device based on the tree model. The construction method includes: obtaining M groups of split results from the respective devices of at least two data parties, M Group splitting results correspond to M features respectively; record the corresponding data cubes of M group splitting results; calculate the splitting gain of each splitting result based on the respective label values ​​of N samples; obtain the splitting result with the largest splitting gain as the optimal splitting result; in the case where the splitting gain of the optimal splitting result is positive, determine the data cube corresponding to the optimal splitting result; in the case that the corresponding data cube is the second data cube, send the optimal splitting result to the second The device of the second data cube, and record the corresponding relationship between the first node and the second data cube; mark the first node to indicate that there is no node data of the first node locally, and update the tree structure of the first tree accordingly .

Description

technical field [0001] The embodiments of this specification relate to the technical field of machine learning, and more specifically, relate to a method and device for constructing a tree model, and a method and device for predicting based on a tree model. Background technique [0002] Machine learning that can protect data security is currently a technical issue of widespread concern. That is, multiple data parties have their own data, and under the condition of ensuring the data security of all parties, all parties cooperate to train the machine learning model for multiple parties to use. The linear regression / logistic regression tree model is a regression / classification model widely used in the industry. In the case of use, we often face the following problems: multiple data parties have their own data, and they want to use each other's data to unify the modeling tree model, such as Xgboos t or GBDT model, but they do not want to provide their own data to each other . ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/901G06F21/62G06Q10/04
CPCG06F21/6245G06Q10/04G06F16/9027
Inventor 陈超超王力周俊
Owner ALIPAY (HANGZHOU) INFORMATION TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products