Model training method and device and cluster system

A cluster system and cluster technology, applied in transmission systems, neural learning methods, biological neural network models, etc., can solve the problems of low efficiency of deep learning training AI models and limited hardware capabilities.

Inactive Publication Date: 2020-06-23
BEIJING BAIDU NETCOM SCI & TECH CO LTD
View PDF4 Cites 10 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] In the above-mentioned HPC, the computing resources of a single computing node are mainly CPU, and the hardware capability is limited, which leads to the low efficiency of the above-mentioned HPC using deep learning to train AI models

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Model training method and device and cluster system
  • Model training method and device and cluster system
  • Model training method and device and cluster system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0065] Exemplary embodiments of the present application are described below in conjunction with the accompanying drawings, which include various details of the embodiments of the present application to facilitate understanding, and they should be regarded as exemplary only. Accordingly, those of ordinary skill in the art will recognize that various changes and modifications of the embodiments described herein can be made without departing from the scope and spirit of the application. Also, descriptions of well-known functions and constructions are omitted in the following description for clarity and conciseness.

[0066] Today, with the rapid development of artificial intelligence, the heterogeneous computing platform composed of CPU and GPU is playing an increasingly important role. In the current big data era, when the training data set is small, the effect of deep learning is not ideal, which is one of the reasons why deep learning has not attracted attention. Deep learnin...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The embodiment of the invention discloses a model training method and device and a cluster system, and relates to the technical field of artificial intelligence. The specific implementation scheme isas follows: in the aspect of hardware, a control node and at least one computing node are interconnected through a network, and a GPU is introduced into the computing node as a computing resource, sothat the hardware capability of a cluster system is greatly improved, and the efficiency of model training is further improved. In the aspect of software, a surm framework is optimized, and a client,a super management platform and the like are introduced, so that the cluster system is more convenient to use.

Description

technical field [0001] The embodiments of the present application relate to the technical field of artificial intelligence (AI), and in particular to a model training method, device and cluster system. Background technique [0002] With the continuous development of artificial intelligence, the training demand for AI models is also increasing. In the process of AI model training, when the training data set is small, the effect of deep learning is not ideal, and it is not even as good as the relatively simple machine learning method. However, when the data set increases, the effect of the AI ​​model trained using deep learning begins to exceed the training effect of other machine learning. [0003] In a common deep learning process, a high-performance computing cluster (high performance computing, HPC) is used to train a large-scale data set to obtain an AI model. The overall structure of HPC can be divided into the following main parts: external network, master node, compu...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): H04L29/08G06N3/08
CPCG06N3/08H04L67/10
Inventor 骆宝童丁瑞全张恒华胡在斌黄凯文李志
Owner BEIJING BAIDU NETCOM SCI & TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products