Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Asynchronous threading recombination method and SIMT (single instruction multiple thread) processor based on method

A technology of asynchronous threads and thread groups, which is applied in the fields of electrical digital data processing, instruments, and inter-program communication. It can solve problems such as extra pauses in SIMD arrays, and achieve the effect of eliminating task differences, high performance, and avoiding idleness.

Active Publication Date: 2017-03-08
RES INST OF SOUTHEAST UNIV IN SUZHOU
View PDF6 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

On SIMT processors, synchronous operations will bring additional pauses for SIMD arrays, and there is a non-negligible performance overhead

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Asynchronous threading recombination method and SIMT (single instruction multiple thread) processor based on method
  • Asynchronous threading recombination method and SIMT (single instruction multiple thread) processor based on method
  • Asynchronous threading recombination method and SIMT (single instruction multiple thread) processor based on method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0029] Below in conjunction with accompanying drawing and specific embodiment, further illustrate the present invention, should be understood that these embodiments are only for illustrating the present invention and are not intended to limit the scope of the present invention, after having read the present invention, those skilled in the art will understand various aspects of the present invention Modifications in equivalent forms all fall within the scope defined by the appended claims of this application.

[0030] figure 1 is a structural block diagram of a baseline SIMT processor assumed in the embodiments of the present invention. A SIMT processor includes at least: a thread group pool, a scheduler, and one or more SIMD arrays. The thread group currently executed on the SIMT processor is stored in the thread group pool. The scheduler is responsible for selecting a ready thread group from the thread group pool at each clock cycle, and assigning it to an idle SIMD array t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an asynchronous threading recombination method and a SIMT (single instruction multiple thread) processor based on the method; through asynchronously exchanging a thread between different threading groups, a task difference in the threading group is eliminated, and thereby avoiding the idle use of a processing unit in a SIMD array and improving performance of GPU (graphics processing unit). The SIMT processor is added with a recombination buffer zone and a threading group slot. The recombination scheme is asynchronously generated by means of a recombination buffer zone, and the recombination scheme is stored by using the threading group. Compared with the existed threading combination method, the asynchronous method cannot cause the pause of the SIMD assembly line generated from the synchronous operation of the threading group, thus the SIMT processor achieves higher performance.

Description

technical field [0001] The invention relates to the field of computer architecture, and provides an asynchronous thread reorganization method and a SIMT processor based on the method. Background technique [0002] SIMT (Single Instruction Multiple Thread) is a common execution model in parallel computing. The SIMT processor contains one or more SIMD (Single Instruction Multiple Data) arrays. Each SIMD array consists of a large number of identically structured processing units. All processing elements within a SIMD array process different data but must execute the same instructions during the same clock cycle. This is a limitation of how SIMD is implemented. A parallel program executed on a SIMT processor contains a large number of threads, which are divided into several thread groups and executed alternately. At each clock cycle, the scheduler in the SIMT processor selects a thread group and schedules it to execute on an idle SIMD array. Each processing unit in the SIMD...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F9/48G06F9/54
CPCG06F9/4881G06F9/546G06F2209/548
Inventor 齐志孟炜时龙兴
Owner RES INST OF SOUTHEAST UNIV IN SUZHOU
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products