Binning data processing method, device and equipment, storage medium and program product

A processing method and binning technology, applied in the field of data processing, can solve problems such as data leakage and poor security, and achieve the effect of increasing security and improving the overall interaction security

Pending Publication Date: 2021-11-26
WEBANK (CHINA)
View PDF0 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] However, if the party holding the target variable encodes the target variable in a special way, it is possible to extract the information of the characteristic variable from the other party, resulting in data leakage. Therefore, the existing calculation process of WOE and IV has poor security.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Binning data processing method, device and equipment, storage medium and program product
  • Binning data processing method, device and equipment, storage medium and program product
  • Binning data processing method, device and equipment, storage medium and program product

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0064] An exemplary embodiment of the present invention will be described in more detail below with reference to the accompanying drawings. While the exemplary embodiments of the present invention are shown in the drawings, it is understood that the present invention can be implemented in various forms and is not intended to be illustrated herein. Instead, these embodiments are provided in order to be more specifically understood, and the scope of the invention can be communicated to those skilled in the art.

[0065] Federal learning can be trained in models in combination with multiple institutions without local conditions. Since the sample data may contain multiple types of feature variables, which feature variables are selected are a key issue for model training. WOE and IV can reflect the prediction capability of features, which effectively measures its contribution to model prediction results, which can be used to assist in achieving screening of feature variables.

[0066] ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a binning data processing method, device, equipment, a storage medium and a program product, and the method comprises the steps: obtaining a plurality of data IDs sent by a target variable provider, an encrypted target variable corresponding to each data ID, and an encrypted opposite variable, and calculating an encrypted positive sample proportion and an encrypted negative sample proportion corresponding to each sub-box, multiplying the encrypted positive sample proportion and the encrypted negative sample proportion by corresponding random numbers, and sending multiplication results to a target variable provider, and obtaining an intermediate result corresponding to each sub-box determined by the target variable provider according to the multiplication result, and sending the encrypted information value and / or a result obtained by adding the random number to the evidence weight to the target variable provider according to the intermediate results corresponding to the plurality of sub-boxes and the corresponding random number. According to the method, the security of solving the information value and the evidence weight can be improved.

Description

Technical field [0001] The present invention relates to data processing technology, and in particular, to a method of processing data binning, apparatus, equipment, storage medium, and a program product. Background technique [0002] Learning can combine multiple federal agencies, while meeting user privacy and data security, machine learning model. [0003] During the federal study, WOE (Weight of Evidence, weight of evidence) and IV (InformationValue, the value of information) is a very important indicator, can be used to assess the predictive power characteristic variables. In practice, one of the target holders of variable, other variables held features, both of the interaction may be implemented WOE calculated and IV, thereby completing the filter characteristic variables and so on. [0004] However, if the objective variable target variable will hold one special encoding, it is possible characteristic variable taking information from the other, leading to leakage of data, s...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F21/60
Inventor 谭明超马国强范涛杨强
Owner WEBANK (CHINA)
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products