Federated learning system based on heterogeneous data

A heterogeneous data and federation technology, applied in the computer field, can solve the problems of global model parameter update direction deviation, high communication cost, and slow convergence speed, so as to improve convergence speed and convergence stability, widely use value, and reduce communication cost Effect

Active Publication Date: 2021-06-22
上海嗨普智能信息科技股份有限公司 +1
View PDF7 Cites 19 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The existing federated averaging algorithm is suitable for independent and identical distribution of client data sets. However, on data sets with strong heterogeneity, the federated averaging algorithm will have slow convergence speed and unstable convergence due to differences in client data distribution. A series of technical problems such as high communication costs and even inability to converge are mainly due to the fact that the federated averaging algorithm has been updated locally many times, and this method will cause the parameter update direction of the global model to deviate from the ideal update direction, which will lead to slow convergence

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Federated learning system based on heterogeneous data
  • Federated learning system based on heterogeneous data
  • Federated learning system based on heterogeneous data

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0019] In order to further explain the technical means and effects of the present invention to achieve the intended purpose of the invention, the following is a specific implementation of a federated learning system based on heterogeneous data proposed in accordance with the present invention. Its effect is described in detail below.

[0020] Federated learning is mainly divided into horizontal federated learning and vertical federated learning. Horizontal federated learning is suitable for situations where user features overlap more but users overlap less. The present invention is an improvement for horizontal federated learning. Assuming that there are clients A and B, their data set distributions are very different. The cost function F of A and B k (w) The graphs about the model parameter w are very different. Taking the following two functions as an example (ie w=(x,y)), the contour graph of client A f(x,y)=x 2 +y 2 The change of +10x is relatively smooth, such as figu...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a federated learning system based on heterogeneous data, which comprises a central server, K clients, a memory in which a computer program is stored and a processor, and is characterized in that the central server stores a global control variable S and a global model parameter W obtained by each round of federated learning; the global control variable S is used for recording the global model updating direction of the round; the client stores a local control variable Si obtained by each participating client by participating in federated learning every time, the local control variable Si is used for recording the updating direction of a local model of the client participating in federated learning training this time, and the value of i is 1-K.Communication cost of federated learning based on heterogeneous data is reduced, and the convergence speed and the convergence stability of federated learning are improved.

Description

technical field [0001] The invention relates to the field of computer technology, in particular to a federated learning system based on heterogeneous data. Background technique [0002] Federated learning is a special distributed learning framework. It is mainly used to solve the problem of data islands. In federated learning, data exists on each local client, and all data cannot be gathered in one place for traditional centralized learning. The Federated Averaging Algorithm (FedAvg for short) in the existing federated learning has become the preferred optimization algorithm in the field of federated learning because of its simplicity and low communication overhead. The existing federated averaging algorithm is suitable for independent and identically distributed client data sets. However, on heterogeneous data sets, the federated averaging algorithm will have slow convergence speed and unstable convergence due to differences in client data distribution. A series of techn...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06N20/00
CPCG06N20/00
Inventor 蔡文渊魏森辉高明顾海林徐林昊孙嘉
Owner 上海嗨普智能信息科技股份有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products