Distributed training framework construction method based on NFDT and distributed training method and system

A training method and distributed technology, applied in the Internet field, can solve the problems of high bandwidth storage cost, consumption of bandwidth, difficult to define, etc., and achieve the effect of ensuring data reliability, increasing download speed, and improving security.

Pending Publication Date: 2020-10-13
钛星投资(深圳)有限公司
View PDF0 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, for many beginners, they still need to be familiar with new APIs or frameworks before they can be deployed into a code structure that can perform distributed training, and the effect is still not ideal.
From the perspective of the overall implementation process, implementing distributed training on the stand-alone framework requires a lot of framework rewriting and cluster definition. Compared with the stand-alone version, such a distributed framework is more complex and difficult to define
[0004] The second is that when a centralized distributed storage dataset is used, if a server is down, the data will be lost, which will affect the overall training efficiency and data security.
Therefore, if the data is placed in a centralized place, the user's data security will not be guaranteed
In addition, centralized storage leads to very high bandwidth transmission costs. For example, in the field of computer vision, image data sets are often hundreds of gigabytes or thousands of gigabytes. Whether uploading data or downloading data, bandwidth will be consumed. Bandwidth cannot be ignored and compared with storage costs It is expensive, and centralized storage services, in order to ensure that users in various places have a good experience, they will deploy data centers on the backbone network, and the backbone network is very expensive in any country

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Distributed training framework construction method based on NFDT and distributed training method and system
  • Distributed training framework construction method based on NFDT and distributed training method and system
  • Distributed training framework construction method based on NFDT and distributed training method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0037] Embodiments of the present invention are described in detail below, examples of which are shown in the drawings, wherein the same or similar reference numerals designate the same or similar elements or elements having the same or similar functions throughout. The embodiments described below by referring to the figures are exemplary only for explaining the present invention and should not be construed as limiting the present invention.

[0038] In describing the present invention, it should be understood that the terms "longitudinal", "transverse", "upper", "lower", "front", "rear", "left", "right", "vertical", The orientation or positional relationship indicated by "horizontal", "top", "bottom", "inner", "outer", etc. are based on the orientation or positional relationship shown in the drawings, and are only for the convenience of describing the present invention and simplifying the description, rather than Nothing indicating or implying that a referenced device or elem...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a distributed training framework construction method based on NFDT and a distributed training method and a system. The distributed training method comprises the following stepsthat a single-machine user stores a data set and codes in a cluster network and generates a unique hash value; a distributed training module receives a hash value corresponding to a task demand and downloads a data set and a standalone version code from a cluster network; the distributed training module generates a distributed training framework by utilizing the standalone version code; and the distributed training framework calls the computing resources to perform distributed training. According to the method, a user only needs to upload a data set and a stand-alone training code of the user, the training task is compressed into the cluster in a decentralized storage mode, the corresponding training equipment in the cluster decompresses the training task according to the received hash value, and training safety and data reliability are guaranteed. The NFDT framework performs model interpretation and analysis on a model file generated in a standalone version training process, and a training function from a standalone version code structure to a distributed mode is achieved.

Description

technical field [0001] The present invention relates to the field of Internet technology, in particular to a distributed training framework construction method based on NFDT (New Framwork for Distributed Tensorflow, distributed Tensorflow new framework), a distributed training method and a system. Background technique [0002] When using the distributed Tensorflow (tensor flow) framework to build a distributed training platform system, it is found that there are two main problems: [0003] First, when performing distributed Tensorflow training, users need to modify the stand-alone code to distributed code, which increases the difficulty for users. Although it is the Horvovd framework launched by Uber, or the estimator API or Tensorflow2.0 launched by Google , are all underlying packages of Tensorflow, which greatly reduces the difference between the distributed framework and the stand-alone version. However, for many beginners, they still need to be familiar with new APIs o...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F8/30G06F9/448G06K9/62G06N20/00
CPCG06F8/31G06F9/4482G06N20/00G06F18/214
Inventor 兰毅
Owner 钛星投资(深圳)有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products