Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A parallel optimization method for doacross loop

An optimization method and a technology of nested loops, which are applied to multi-program devices, program startup/switching, resource allocation, etc., can solve problems such as failure to achieve multi-core processors, and achieve the effect of eliminating resource waste

Active Publication Date: 2019-10-11
XI AN JIAOTONG UNIV
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] In actual use, the above three strategies have their own advantages and disadvantages, but none of them can reach the optimal value of multi-core processor parallel acceleration effect.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A parallel optimization method for doacross loop
  • A parallel optimization method for doacross loop
  • A parallel optimization method for doacross loop

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0022] In order to make the purpose, technical solution and advantages of the present invention more clear, the present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the embodiments described here are only used to explain the basic idea of ​​the present invention, and are not used to limit the protection scope of the present invention.

[0023] The present invention provides an efficient DOACROSS cycle parallel optimization strategy (YZPA strategy for short). Before eliciting this strategy, the present invention proposes two constraint conditions about the regular advancement of the wave front:

[0024] Constraint 1: Uniformly adopt the rectangular loop block method.

[0025] Constraint 2: The inter-block dependency set IBS between loop blocks contains all unit coordinate vectors of the iterative space after block. The advancement of the wavefront needs to maintain the correct i...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a parallel optimization method of DOACROSS cycle. On the basis of ensuring the correctness of data dependence, by adding a state matrix and a ready node queue, the synchronous operation is transformed into the enqueue / dequeue operation of the ready node queue. The invention realizes the dynamic task assignment of each thread to the computing area, that is, on the basis of guaranteeing the wave front advance, a certain thread will no longer be fixed to the execution of a certain calculation area, but always in time to implementation of those blocks which satisfy executable conditions, and multi-dimensional block partitioning is also realized. The theoretical execution time of the method of the present invention is shorter than the theoretical execution time of the existing three strategies and is close to the optimal execution time.

Description

technical field [0001] The invention relates to the field of parallel development of computer programs, in particular to an efficient parallel optimization strategy of DOACROSS loop. Background technique [0002] Developing parallel programs to take full advantage of multi-core or even many-core processors is still an open problem. Due to its multiple advantages, automatic parallelization technology can play an important role in enhancing the performance of multi-core processor systems. In many computationally intensive applications, typified by scientific and engineering computing applications, nested loops can consume a significant amount of runtime. According to the dependencies carried by the loop, loops can be divided into three categories: loops that do not carry cross-iteration dependencies (all iterations can be completely parallel to other iterations), such loops are called DOALL loops; and loops that carry cross-iteration dependencies Dependent loops are called D...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F9/48G06F9/50
Inventor 伍卫国崔元桢刘松柴晓菲聂世强邹年俊
Owner XI AN JIAOTONG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products