Method and device for deep learning system of hybrid architecture

A technology of deep learning and hybrid architecture, applied in the field of deep learning system of hybrid architecture, can solve problems such as insufficient computing power, large interaction delay, and insufficient utilization of resources, so as to reduce programming complexity and time The effect of delay and high energy efficiency ratio

Active Publication Date: 2017-07-14
山东英特力数据技术有限公司
View PDF7 Cites 32 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The existing technology has the following deficiencies: 1) The general method adopts the separation of training and reasoning, which needs to maintain two sets of platform environments, and the resources cannot be fully utilized; 2) FPGA / CPLD is completely used for deep learning calculations, and the computing power is not strong enough. Not suitable for large-scale training scenarios; 3) The communication between FPGA / CPLD and the server is generally through DMA, and the time delay between data and CPU server interaction is relatively large

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for deep learning system of hybrid architecture
  • Method and device for deep learning system of hybrid architecture
  • Method and device for deep learning system of hybrid architecture

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0052] Such as image 3 As shown, as a preferred best implementation mode, such as realizing a picture classification problem of Alexnet: the deep learning device of the hybrid architecture is used to realize the parallel operation of deep learning training and reasoning, including POWER8 processor, DDR memory, A server module composed of a network, etc.; a GPU accelerated training module GTX1080 connected to the server through a bus; a CAPI reasoning module ADM-PCIE-KU3 accelerator card connected to the server through a bus. The GPU training module is used to accelerate the training process of the deep learning model; the inference module preloads the AlexNet network model for the reasoning process of deep learning; the server module is used for the control of deep learning, data processing, Network interaction, parameter storage, etc.; the bus interface between the server module and the training module is PCI-E or Nvlink bus; the hardware interface between the server module ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method and device for a deep learning system of a hybrid architecture. The method is characterized by comprising the steps in which, when a training data set is updated, a training module re-trains a deep learning network model and stores weights and offset parameters; a server side monitors changes in a parameter file while monitoring the process, encapsulates the changes into a pre-set data structure and notifies a reasoning module; the reasoning module interrupts a reasoning service, reads the weights and offset file content from the server side and updates the network model; and the server side monitors the process, meanwhile processes the input file that needs reasoning and notifies the reasoning module. The system device comprises a server module, a training module, a reasoning module and a bus interface. The training and reasoning hybrid CPU+GPU+CAPI heterogeneous deep learning system of the invention can make full use of resources, gain higher energy efficiency, and realize CAPI direct access to the server memory and real-time online iterations to update the weights and other parameters of the reasoning model.

Description

technical field [0001] The present invention relates to the technical field of circuit design and machine learning, in particular to a method and device for a deep learning system with a hybrid architecture. Background technique [0002] The rapid development of the information technology industry in the 21st century has brought huge benefits and convenience to people. Deep learning applications are divided into two parts: training and reasoning. Taking ImageNet evaluation as an example, the AlexNet model training process requires 8 million pictures with a total of 1,000 categories. After the AlexNet model extracts features and calculates the loss, then through backpropagation such as SGD To update the weight parameters, so as to continuously converge the model, and finally get the ideal network model. The reasoning process is the process of inputting a forward operation through the network model to obtain the accuracy of the final classification (generally choose Top5). T...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62G06K9/00G06N3/08G06N5/04
CPCG06N3/08G06N5/04G06V10/955G06F18/214G06F18/241
Inventor 程归鹏卢飞江涛
Owner 山东英特力数据技术有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products