Heterogeneous format storage method based on cpu and gpu heterogeneous platforms

A technology of heterogeneous platforms and formats, applied in machine execution devices, concurrent instruction execution, etc., to achieve the effect of hiding extra overhead and high computing throughput

Inactive Publication Date: 2019-08-06
JILIN NORMAL UNIV
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, due to the difference between the two storage formats of the same data, using both formats of data on a heterogeneous platform composed of CPU and GPU will bring additional overhead of format conversion, so research effectively hides the additional overhead of format conversion The method to ensure the multiplication of the transposed sparse matrix by the product of the sparse matrix and the dense vector is of great significance to obtain high computing throughput on the heterogeneous platform composed of CPU and GPU.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Heterogeneous format storage method based on cpu and gpu heterogeneous platforms
  • Heterogeneous format storage method based on cpu and gpu heterogeneous platforms

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0024] In order to make the object, technical solution and advantages of the present invention more clearly, the present invention will be further described in detail below in conjunction with the accompanying drawings and specific examples.

[0025] Such as figure 1 Shown: the present invention is a heterogeneous platform composed of CPU and GPU, in the case of using heterogeneous format storage for different types of processors, the implementation method of hiding the extra overhead of CPU execution format conversion, wherein the CPU execution format conversion adopts multi-thread parallelism Implementation, the specific method includes the following steps:

[0026] Step 1: Determine the scale of GPU calculation data and detect the communication bandwidth between CPU and GPU;

[0027] Wherein, the main function described in step 1 includes detecting the data scale calculated by the GPU and the communication bandwidth between the CPU and the GPU. In order to hide the additi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a heterogeneous format storage method based on a CPU (central processing unit) and GPU (graphics processing unit) heterogeneous platform. The method includes the steps: firstly, detecting scales of computational data of a GPU and the communication bandwidth between a CPU and the GPU; secondly, computing number of segments of segmenting transmission according to results detected by step 1; thirdly, executing data format conversion for data of a first segment; fourthly, uploading completely converted data to the GPU, executing computation by the GPU, and executing format conversion for data of next segment; fifthly, judging whether currently computed data are a final data segment or not, finishing algorithm execution if the currently computed data are the final data segment, and executing step 4 if the currently computed data are not the final data segment. Heterogeneous format storage methods are used on different types of processors, the method mainly researches a hidden method of overhead caused by format conversion needed when the CPU executes the GPU, so that higher computation throughput is obtained by an algorithm in an overall node, and the method has wide practical values and application prospects in computational intensive applications such as high-performance computation.

Description

technical field [0001] The invention relates to a heterogeneous platform composed of a CPU and a GPU. The architectures of the CPU and the GPU are very different. In order to give full play to the data processing capabilities of the two processors, the same data adopts different formats on the CPU and the GPU. Storage, specifically related to a heterogeneous platform composed of CPU and GPU, the same data is stored in a heterogeneous format on the CPU side and the GPU side, when the CPU performs format conversion on the GPU side data, the additional overhead caused by the conversion process is hidden The realization form belongs to the field of computer system structure. [0002] technical background [0003] At present, many high-performance computer computing nodes are composed of processors with two different architectures: multi-core CPU and GPU. Because the architectures of these two processors are quite different, the parallel algorithm of the same data structure data i...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F9/38
Inventor 陶袁任可欣付军杜奕秋赵志文姜艳成
Owner JILIN NORMAL UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products