Unlock instant, AI-driven research and patent intelligence for your innovation.

Efficient Pearson coefficient calculation method based on third party in federated learning environment

A technology of Pearson coefficient and learning environment, which is applied in the field of high-efficiency Pearson coefficient calculation based on third parties, which can solve problems such as impracticality, impracticality of data correlation calculation, and inability to complete calculation tasks, and achieve the effect of improving calculation efficiency.

Pending Publication Date: 2022-04-15
深圳前海新心数字科技有限公司
View PDF0 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0019] However, this method still uses a large number of Paillier homomorphic public key encryption operations to generate BeaverTriplets (triplets), resulting in inefficient calculation of the overall Pearson correlation coefficient of FATE, which is not practical in the case of large-scale data
For example, when there are tens of millions of data, the calculation task cannot be completed in one day, which makes FATE's data correlation calculation very impractical

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Efficient Pearson coefficient calculation method based on third party in federated learning environment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0043] Example 1, such as figure 1 As shown, it is a third-party efficient Pearson coefficient calculation method based on a federated learning environment. In this method, the open source FATE is selected as the overall calculation and communication framework for calculating the Pearson coefficient. The two parties involved in the calculation of the feature correlation coefficient are respectively Party A and Party B, and the semi-honest third party is Party C. Specifically include the following steps:

[0044] (1) First, select an overall computing and communication framework that needs to calculate the Pearson coefficient. Here, choose the open source FATE.

[0045] (2) Assume that the two parties involved in the calculation of the characteristic correlation coefficient are party A and party B, and the semi-honest third party is party C.

[0046] (3) Both parties A and B need to perform sample alignment according to their respective input data (filter data with the same i...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a third-party-based efficient Pearson coefficient calculation method in a federated learning environment, in the method, an open-source FATE is selected as an overall calculation communication framework for calculating a Pearson coefficient, two parties participating in feature correlation coefficient calculation are a party A and a party B respectively, and a semi-honest third party is a party C; according to an existing scheme, a large number of homomorphic encryption operations need to be used in order to safely generate Bever Triplets, and in a new scheme, the homomorphic encryption operations can be removed and (a, b, c) Bever Triplets can be safely generated through a semi-honest third party without sacrificing safety, and meanwhile, the two parties respectively obtain an addition secret sharing share of (a, b, c). Due to the fact that a large amount of large integer modular exponentiation operation in Paillier homomorphic encryption in an original scheme does not exist, only tensor dot product and addition and subtraction operation exist, efficiency is greatly improved.

Description

technical field [0001] The invention relates to a third-party efficient Pearson coefficient calculation method in a federated learning environment. Background technique [0002] Here are some basics in this field: [0003] Federated machine learning: The application of privacy computing in the field of machine learning can integrate data from multiple parties without revealing the private data of all parties, and use machine learning algorithms to train models and make predictions. [0004] The security model of federated learning is mostly a semi-honest model. [0005] Feature engineering: A data preprocessing method in machine learning engineering, which screens the feature data of the sample and discretizes the feature data so that a better machine learning model can be trained later. [0006] Pearson coefficient: A method of calculating data correlation, which can be used in feature engineering to calculate the correlation between sample data features, so as to filter ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06N20/20G06F21/60G06F17/18G06F7/50
CPCG06N20/20G06F21/602G06F17/18G06F7/50
Inventor 谈扬
Owner 深圳前海新心数字科技有限公司