Cloud deep neural network optimization method based on CPU and FPGA cooperative computing

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A deep neural network and optimization method technology, applied in the field of computer architecture design, can solve the problems of high data communication overhead, poor flexibility, and low cost performance, and achieve the effects of reducing power consumption, improving performance, and low price

Pending Publication Date: 2020-08-04

FUDAN UNIV

View PDF5 Cites 6 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0005] The purpose of the present invention is to provide a cloud-based deep neural network optimization method based on CPU and FPGA collaborative computing to solve the problems of high energy consumption, low cost performance, poor flexibility, and data communication problems in the processing of deep learning algorithms in current large-scale server clusters. high cost etc.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0019] In order to clearly illustrate the technical features of this solution, the present invention will be described in detail below through specific implementation modes and in conjunction with the accompanying drawings. The following disclosure provides many different embodiments or examples for implementing different structures of the present invention. Descriptions of well-known components and processing techniques and processes are omitted herein to avoid unnecessarily limiting the present invention.

[0020] The present invention provides an optimized method for implementing a deep neural network on a server component comprising a host component having a CPU and a hardware acceleration component connected to the host component; the deep neural network comprising a plurality of layers. The method includes: dividing into two parts respectively suitable for the front and rear ends. The data received by the front end is in the form of a data stream, and the DDR shuttles b...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention belongs to the technical field of computer system structure design, and particularly relates to a cloud deep neural network optimization method based on CPU and FPGA cooperative computing. The method is divided into a front end part and a rear end part. The front end is a server taking a CPU as a core and is responsible for flow control, data receiving and partial processing; and therear end is an acceleration component taking the FPGA as a core, comprises a large-scale parallel processor array, a graphic processing unit, an application-specific integrated circuit and a PCI-E interface, and is responsible for parallel acceleration processing and the like of a key layer of the deep neural network. Firstly, the deep neural network is divided into two parts suitable for front-end processing and rear-end processing according to different levels; the front end shuttles the received data between the front end and the rear end by DDR in the form of a data stream to process eachlayer or a combined layer. The front-end flexible process control is matched with the rear-end efficient parallel structure, so that the energy efficiency ratio of neural network calculation can be greatly improved.

Description

technical field [0001] The invention belongs to the technical field of computer architecture design, and in particular relates to a cloud deep neural network optimization method based on CPU and FPGA collaborative computing. Background technique [0002] In the process of human-computer interaction where multiple interaction modes coexist, interactive modal data with different characteristics and corresponding deep learning models, such as convolutional neural networks (CNNs for short) models, etc., will be generated to construct deep learning Algorithms require long hours and large computing resources. The current mainstream computing architecture includes the following three types: GPU, FPGA, and application-specific custom chip (ASIC). [0003] GPUs were originally designed for generating computer graphics based on polygonal networks, and in fact these processors are also well suited for running neural networks and matrix multiplication calculations. But each GPU also c...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G06F1/3287G06F9/50G06N3/08

CPCG06F1/3287G06F9/5027G06N3/08Y02D10/00

Inventor 卢暾常玉虎顾宁

Owner FUDAN UNIV

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Cloud deep neural network optimization method based on CPU and FPGA cooperative computing

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology