Supercharge Your Innovation With Domain-Expert AI Agents!

Flow program multi-granularity division and scheduling method based on GPU/CPU hybrid architecture

A hybrid architecture and scheduling method technology, applied in the direction of multi-program device, program startup/switching, resource allocation, etc., to achieve the effect of improving access efficiency, reducing communication traffic, and reducing redundancy

Active Publication Date: 2015-10-07
HUAZHONG UNIV OF SCI & TECH
View PDF5 Cites 23 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] In view of the above defects or improvement needs of the prior art, the present invention provides a stream program multi-granularity division and scheduling method based on the GPU / CPU hybrid architecture, aiming at the massive data parallelism, task parallelism and pipeline parallelism existing under the GPU / CPU hybrid architecture and other issues, using the synchronous data flow graph as an intermediate representation, a multi-granularity division and scheduling method for data flow programs based on GPU / CPU hybrid architecture is designed and implemented. This method effectively improves the performance of data flow programs under GPU / CPU hybrid architecture. execution performance

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Flow program multi-granularity division and scheduling method based on GPU/CPU hybrid architecture
  • Flow program multi-granularity division and scheduling method based on GPU/CPU hybrid architecture
  • Flow program multi-granularity division and scheduling method based on GPU/CPU hybrid architecture

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0026] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention. In addition, the technical features involved in the various embodiments of the present invention described below can be combined with each other as long as they do not constitute a conflict with each other.

[0027] figure 1 Shown is the flow chart of the multi-granularity division and scheduling method of the stream program based on the GPU / CPU hybrid architecture of the present invention, using the COStream data stream program as input, and combining C++ and OpenCL that can be executed in parallel on the GPU / CPU hybrid architecture platform Object code as output. The COStream data flow program ge...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present invention discloses a flow program multi-granularity division and scheduling method based on a GPU / CPU hybrid architecture. The method comprises: according to computation features of each task of a data flow program and data traffic among tasks, distributing each task to a suitable computing platform; splitting GPU end tasks to each GPU in a balanced manner according to parallelism of the GPU end tasks, so as to avoid that great communication overheads among GPUs affect overall execution performance of an application; by selecting suitable CPU cores, distributing CPU end tasks among the CPU cores in a balanced manner, so as to ensure load balance and improve the utilization rate of each CPU core; using a plurality of data storage structures and a plurality of methods of access types to improve memory access efficiency; and generating a target template class and reducing the number of target nodes to reduce the redundancy of object codes. According to the invention, the flow program multi-granularity division and scheduling method based on a GPU / CPU hybrid architecture achieves load balance on a CPU platform and a GPU platform separately, reduces communication overheads, improves data access efficiency, reduces the redundancy of object codes, and improves execution performance of a data flow program.

Description

technical field [0001] The invention belongs to the technical field of a system platform based on a GPU / CPU hybrid architecture, and more specifically relates to a multi-granularity division and scheduling method for stream programs based on a GPU / CPU hybrid architecture. Background technique [0002] The emergence of CPU multi-core systems has greatly improved the execution efficiency of applications in related fields, especially in computing-intensive application fields such as digital media processing and scientific computing. The continuous upgrading of the graphics processing unit (Graphic Processing Unit, hereinafter referred to as GPU) system structure has promoted the development of GPU computing, so that it can not only efficiently perform computer graphics calculations, but also has great advantages in parallel computing of large-scale data. The advantages. Compared with CPU multi-core systems, GPU has the characteristics of multiple core processing units, fine th...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F9/48G06F9/50
Inventor 于俊清陈文斌何云峰管涛唐九飞
Owner HUAZHONG UNIV OF SCI & TECH
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More