Hardware architecture for accelerating Deep Q-Network algorithm and design space exploration method of hardware architecture

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A hardware architecture and algorithm technology, applied in the field of artificial intelligence, can solve problems such as optimization of FPGA hardware computing architecture, high computational complexity, and large consumption of storage resources.

Active Publication Date: 2020-09-11

HARBIN INST OF TECH

View PDF4 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

However, since the Deep Q-Network algorithm includes two types of calculations, the forward reasoning and the backpropagation of the neural network, when the neural network is large in scale, there are problems of large storage resource consumption and high computational complexity.

However, the current research and implementation of deep reinforcement learning usually uses a large-scale image processing unit GPU board server, which is difficult to apply in edge computing scenarios with limited hardware resources and power consumption, and its applicability is not high; The FPGA hardware computing architecture is not optimized, nor is the design space explored

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment 1

[0036] A kind of hardware framework for accelerating Deep Q-Network algorithm, described hardware framework comprises general processor module and FPGA programmable logic module and external DDR memory, and described FPGA programmable logic module comprises AXI bus interface, Target Q module, Current Q module, Loss calculation module, mode control module, parameter storage unit and weight update unit;

[0037] The general processor module is responsible for interacting with the external environment and realizing the calculation of the reward function, and is also responsible for the maintenance of the DeepQ-Network algorithm experience pool;

[0038] The external DDR memory is mainly responsible for the storage of the Deep Q-Network algorithm experience pool.

[0039] The AXI bus interface is a general AXI bus interface structure, which is responsible for realizing the transmission and feedback of control signals and data signals between the general processor and the FPGA prog...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a hardware architecture for accelerating a Deep Q-Network algorithm and a design space exploration method of the hardware architecture. The hardware architecture comprises: a universal processor module, which is responsible for interacting with an external environment and realizing calculation of a reward function, and is also responsible for maintaining a Deep Q-Network algorithm experience pool; an external DDR memory, which is responsible for storage of an experience pool of the Deep Q-Network algorithm; an AXI bus interface, which is of a universal AXI bus interfacestructure and is responsible for achieving transmission and feedback of control signals and data signals between the universal processor and the FPGA programmable logic module; a Target Q module, which is responsible for realizing forward reasoning calculation of a Target Q network; and a Current Q module, which is responsible for realizing forward reasoning and back propagation of the Current Qnetwork. According to the invention, under the condition of highly optimizing the FPGA hardware architecture, the real-time calculation of the Deep Q-Network algorithm is realized.

Description

technical field [0001] The invention belongs to the technical field of artificial intelligence; in particular, it relates to a hardware framework for accelerating a Deep Q-Network algorithm and a design space exploration method thereof. Background technique [0002] Deep reinforcement learning is an emerging artificial intelligence technology, which evolved from the combination of traditional reinforcement learning algorithms and deep learning algorithms, and is mainly used in the fields of robot control, automatic driving, search and recommendation, etc. Representative algorithms in the field of study. However, since the Deep Q-Network algorithm includes two types of calculations, the forward reasoning and the backpropagation of the neural network, when the neural network is large in scale, there are problems of large storage resource consumption and high computational complexity. However, the current research and implementation of deep reinforcement learning usually use a...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G06N3/08G06N3/04G06N3/063G06N5/04

CPCG06N3/084G06N3/063G06N5/04G06N3/045Y02D10/00

Inventor 刘冰凤雷付平李喜鹏卢学翼吴瑞东王嘉晨童启凡周彦臻谢宇轩

Owner HARBIN INST OF TECH

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Hardware architecture for accelerating Deep Q-Network algorithm and design space exploration method of hardware architecture

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment 1

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology