Client selection method for edge-side federal learning under heterogeneous data

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A heterogeneous data and client-side technology, which is applied in the field of artificial intelligence, can solve the problems of far-flung models, unreal scenes, and increased energy consumption, and achieve the effects of accelerating model convergence, reducing energy consumption, and maintaining accuracy

Pending Publication Date: 2022-04-22

BEIJING INSTITUTE OF TECHNOLOGYGY

View PDF0 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

FL's machine learning environment uses multiple entities to collaborate to solve machine learning problems under the coordination of a central server or service provider. During the process, the original data of each client is still stored locally and does not participate in exchange or transmission, such as figure 1 The shown FL architecture can implement machine learning tasks without data concentration. Therefore, this model meets the client’s demand for personal data privacy, and is especially suitable for edge computing applications; FL is used on the edge side The application can effectively use the edge device data, but there is also the problem that the edge device data needs to be optimized for the edge environment because the edge environment is relatively complex

[0004] The central environment of the edge device data satisfies the characteristics of Independently Identical Distribution (IID), that is, the edge device data conforms to the same probability distribution and is independent of each other. Using IID data to train the model in the test set can show better results; while in In FL, since the data on the client is non-independent and identically distributed data (Non-IID), that is, heterogeneous data, and the data is usually not evenly distributed among the clients, this will lead to directly using the client data training The model may be very different from the overall model of the central environment. The local data directly extracted from the edge device client cannot meet the sample requirements for extracting data from the overall distribution. This has a huge impact on model training. At the same time, in FL, participants in the training Typical clients are mobile devices powered by independent batteries. Compared with the energy storage requirements for model training, energy efficiency is also a key challenge that cannot be ignored.

[0005] One of the effective means to deal with this problem is to adopt a basic algorithm called federated averaging (FedAvg) in FL. FedAvg randomly selects a subset of clients in each round of learning and runs the global model on its local data. The local copy of , when the local data is weighted by running stochastic gradient descent and sent back to the FL server, the FL server then updates the client's model weights to a weighted sum; this algorithm can train high-quality models with relatively few communication rounds, At the same time, it has shown a strong ability to overcome the unbalanced data distribution between devices that is common in FL; McMahan and others have confirmed that the FedAvg algorithm works in a communication-constrained heterogeneous environment and Non-IID data distribution. , but its proven FedAvg algorithm lacks theoretical convergence guarantees

[0006] The reason for the lack of theoretical guarantee for the convergence of the FedAvg algorithm is that the assumption used for experimental analysis and theoretical proof of the convergence of the FedAvg algorithm is that data is shared between devices or distributed in the form of IID, and all devices participate in each round of learning communication; The assumption simplifies the analysis, but the FL environment violates the real edge side, and the scene built by the environment is not realistic

Aiming at this unreal problem, the experiment of Smith et al. carried out the simulation of the real scene, and the FedAvg algorithm performed well when the local client data set was relatively large and the data Non-IID distribution was mild; however, when The performance of the FedAvg algorithm drops significantly when the non-IID data with severe offset and the local data set of the client is relatively small.

[0007] The main reason for the obvious decline in the performance of the FedAvg algorithm is that the traditional method of randomly selecting clients to participate in training is not suitable for the FedAvg algorithm. Due to the heterogeneity of data and devices, the number of clients participating in training usually exceeds the actual training requirements. The number of clients, which not only greatly increases energy consumption, but also causes poor performance of the model due to the traditional random selection method of clients or data, and the test results are biased

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0050] Next, in conjunction with the accompanying drawings of the description, the construction data set of the client selection method, the establishment of a convolutional neural network, the establishment of a federated learning framework and construction environment, the accuracy of the FedNorm algorithm, the test of FedNorm-E (FedNorm algorithm optimization) and The comparison between FedNorm-E and FedNorm algorithm is further introduced in detail:

[0051] 1. Build a dataset based on the FEMNIST dataset:

[0052] In the client selection method, the construction method of the IID data set is globally unbalanced in the original FEMNIST (FederatedExtended MNIST, federated extended MNIST) data set data sample, and the FEMNIST data set first contains 3550 user handwritten data The MNIST data set expands numbers and characters, and then divides the expanded data. The numbers and characters used for expansion include 10 numbers, 26 lowercase letters and 26 uppercase letters. Ac...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention belongs to the field of artificial intelligence, and provides a client selection method for edge-side federated learning under heterogeneous data, and the method comprises the steps: carrying out the training initialization, and constructing a data set; calculating changes of local weights of the candidate clients; the FL server selects a client set participating in training based on the weight change information; the FL server calculates an average weight; repeating the previous steps until the convergence performance of the training model is unchanged; according to the method, the FL server ensures that the selected data sample accords with scientificity and representativeness in a real heterogeneous data environment, meanwhile, the accuracy of the FL training model is further improved by increasing the additionally selected client number parameter S and the cycle parameter P, and the energy consumption is reduced.

Description

technical field [0001] The invention belongs to the technical field of artificial intelligence, and in particular relates to a client selection method for edge-side federated learning under heterogeneous data. Background technique [0002] As large amounts of data are increasingly generated from mobile devices such as smart homes, mobile phones, wearables, and edge devices, it becomes critical for many applications to train machine learning distributed across multiple nodes. Distributed training Machine learning is a collaborative training model through multiple working nodes. The most commonly used method is the training method of Stochastic Gradient Descent (SGD), that is, by tracking the direction of the target gradient and iteratively optimizing the objective function until convergence , in each iteration of this training method, the training data is used to calculate the descending gradient first, and then the model parameters are updated by changing the training steps ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G06F9/54G06N3/04G06N3/08

CPCG06F9/547G06N3/084G06F2209/544G06N3/045Y02D10/00

Inventor 赵健鑫刘驰冯雁浩常欣煜

Owner BEIJING INSTITUTE OF TECHNOLOGYGY

Client selection method for edge-side federal learning under heterogeneous data

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology