Method for sharing stream memory of heterogeneous multi-processor

A heterogeneous multiprocessor, shared stream technology, applied in the field of shared stream memory, can solve the problem that the mutual adaptability of CPU and GPU is not very good, and achieve the effect of improving processing

Inactive Publication Date: 2009-10-07
LANGCHAO ELECTRONIC INFORMATION IND CO LTD
View PDF0 Cites 27 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

A combination of multiple CPUs or GPUs is abbreviated as CPUs and GPUs. Traditionally, GPUs and CPUs are co

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for sharing stream memory of heterogeneous multi-processor
  • Method for sharing stream memory of heterogeneous multi-processor
  • Method for sharing stream memory of heterogeneous multi-processor

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0014] figure 1 It is a configuration diagram of a computing device for parallel processing of application data. In this computing device, it includes a central processing unit CPU and a graphics processing unit GPU. The main processing system has a main processor, which can upload and download data and To settle the results, the main processor connects the heterogeneous processors CPUs and GPUs through data buses. The CPU can be a multi-core CPU, and the GPU is hardware that can support graphics processing and double-precision floating-point operations. The function library stores the source code and executable program. The compilation layer is responsible for compiling the source code. The application loads the executable program to the operation layer through the API call. The operation layer manages the execution of processing tasks through the allocation of computing resources. The computing platform layer is responsible for the physical The identity of the computing dev...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a method for sharing the stream memory of a heterogeneous multi-processor. The method comprises the following steps: an application program runs on a master processor and an API is called for the first time, and one or more executable programs are encoded from the source code containing local variables for a plurality of processor units with stream memory; and then the API is called for the second time so as to load one or more executable programs to a plurality of processor units; collateral execution is conducted on a plurality of treads; when in loading, local storage units are allocated from the local storage of a processor; in addition, when in loading, a first stream storage unit is allocated from the stream memory; when a processing unit executes a plurality of treads simultaneously, the threads access to the values of the variables on the basis of the storage units of the stream memory; the source program containing stream variables further comprises the following steps: the API is called for the third time; in the stream memory, a second stream storage unit is allocated for the stream variables; based on the second stream storage unit, the variable values of the stream variables are accessed through a plurality of processor units.

Description

technical field [0001] The invention relates to a data parallel computing technology, in particular to a method for sharing stream memory when performing data parallel computing through heterogeneous multi-processor CPUs and GPUs. Background technique [0002] As GPUs are gradually incorporated into high-performance parallel computing devices, GPUs are regarded as general-purpose computing devices, and more and more applications are developed to complete data parallel computing. Today, we design these applications with professional interfaces and professional GPU devices provided by suppliers, so that even if the CPU and GPU are used together in the data processing system, the CPU will not be overloaded and the application can run on different vendors' GPUs. on the GPU. [0003] However, as more and more CPUs are embedded into multi-cores to perform data parallel computing, more and more data processing tasks can be completed with CPUs and GPUs. Processors combined with mu...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F9/50G06F9/38
Inventor 魏健王守昊
Owner LANGCHAO ELECTRONIC INFORMATION IND CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products