Object Detection Hardware Accelerator and Acceleration Method

A hardware accelerator and target detection technology, applied in the field of data processing, can solve problems such as large power consumption and delay, limited bus bandwidth, and reduced accelerator work efficiency

Active Publication Date: 2021-04-20
JIHUA LAB
View PDF7 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, most of the existing neural network accelerators use DRAM to read data in the process of data transfer, which needs to be transmitted through the bus. The bandwidth of the bus is limited, and the power consumption and delay of reading a large amount of data from DRAM are very large. Greatly reduces the working efficiency of the accelerator

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Object Detection Hardware Accelerator and Acceleration Method
  • Object Detection Hardware Accelerator and Acceleration Method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0027] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only part of the embodiments of the present invention, not all of them. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0028] It should be noted that all directional indications (such as up, down, left, right, front, back...) in the embodiments of the present invention are only used to explain the relationship between the components in a certain posture (as shown in the figure). Relative positional relationship, movement conditions, etc., if the specific posture changes, the directional indication will also change accordingly.

[0029] It should also be noted that whe...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The present invention relates to the field of data processing, and provides a target detection hardware accelerator and an acceleration method. The accelerator includes a convolution operator integrated with a multiplier and an adder, and the convolution operator receives volumes stored in block random access memory in advance. Product weight data and feature maps, the multiplier performs multiplication operations on the convolution weight data and feature maps to obtain multiplication result data and convolution offset data, and the adder performs multiplication result data and convolution offset data The data is shifted, added and summed to obtain the multiplication and accumulation result data; the pooling operation unit is used to receive the multiplication and accumulation result data and perform a pooling operation, and output the pooling result data; the RBR operation unit is used to perform the multiplication and accumulation result data; The pooling result data is subjected to batch normalization and quantization to obtain target feature data and stored in the block random access memory. The invention can reduce the time and power consumption required by the accelerator for data transfer, and improve the working efficiency of the accelerator.

Description

technical field [0001] The invention relates to the field of data processing, in particular to an object detection hardware accelerator and an acceleration method. Background technique [0002] With the support of big data analysis and large-scale high-speed computing platform, neural network technology has been developed sufficiently. On the one hand, neural network algorithms continue to improve. After CNN (Convolutional Neural Networks, convolutional neural network), new network models such as RNN (Recurrent Neural Network, cyclic neural network) and GAN (Generative Adversarial Networks, generative confrontation network) emerge in endlessly. ; On the other hand, due to the outstanding performance of neural network algorithms in the fields of image recognition, speech analysis and natural language processing, they are widely used in embedded systems. An embedded system is a dedicated system on a chip, which has strict requirements on system performance and power consumpti...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F7/523G06F7/50G06N3/063
CPCG06F7/50G06F7/523G06N3/063
Inventor 陈迟晓张锦山焦博张立华
Owner JIHUA LAB
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products