Barrier synchronization method and device

A fence synchronization and fence technology, applied in the field of communication, can solve problems such as chip processing performance degradation

Active Publication Date: 2015-09-23
HUAWEI TECH CO LTD +1
View PDF6 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The present invention provides a barrier synchronization method and device, which solves the problem of the decline in processing performance of chips with multi-core or many-core processors caused by access bottlenecks when the number of threads increases

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Barrier synchronization method and device
  • Barrier synchronization method and device
  • Barrier synchronization method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0095] Embodiment 1 of the present invention provides a barrier synchronization method, which is applied to a chip with a multi-core or many-core processor, and at least two barrier synchronization devices are provided on the chip, such as figure 1 As shown, the method may include:

[0096] 101. The first processor core determines that a currently processed thread program is executed to a predetermined fence synchronization point. Wherein, the first processor core is any one of all processor cores included in the chip.

[0097] 102. The first processor core determines a target barrier synchronization device according to a barrier identifier corresponding to a predetermined barrier synchronization point.

[0098] Wherein, the fence synchronization method of the embodiment of the present invention is applied to a chip with a multi-core or many-core processor, and at least two fence synchronization devices are arranged on the chip, when any processor core in all processor cores,...

Embodiment 2

[0104] Embodiment 2 of the present invention provides a barrier synchronization method, which is applied to a chip with a multi-core or many-core processor, and at least two barrier synchronization devices are arranged on the chip, such as figure 2 As shown, the method may include:

[0105] 201. The target barrier synchronization device receives a barrier synchronization message sent by the first processor core.

[0106] Wherein, the barrier synchronization message is sent by the first processor core when it is determined that the currently processed thread program executes to a predetermined barrier synchronization point, the first processor core is any one of all processor cores contained in the chip, and the barrier synchronization message contains the fence identifier corresponding to the fence synchronization point and the number of thread programs participating in the synchronization, and the target fence synchronization device is a fence synchronization device for proc...

Embodiment 3

[0113] Embodiment 3 of the present invention provides a barrier synchronization method, which is applied to a chip with a multi-core or many-core processor. At least two barrier synchronization devices are provided on the chip, and the at least two barrier synchronization devices are distributed in the network on chip. different locations, for example image 3 The shown embodiment of the present invention provides a schematic structural diagram of a chip provided with four barrier synchronization devices and has a multi-core or many-core processor. Such as Figure 4 As shown, the barrier synchronization method provided in the embodiment of the present invention may include:

[0114] 301. The first processor core determines that a currently processed thread program is executed to a predetermined fence synchronization point.

[0115] Wherein, the first processor core is any one of all processor cores included in the chip.

[0116] 302. The first processor core determines a ta...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a barrier synchronization method and device, and relates to the field of communications, and solves the problem that under the condition that the thread number is increased, the chip processing performance of a multi-core or many-core processor is reduced due to access bottlenecks. The concrete scheme of the barrier synchronization method is that a first processor core determines that a present processed thread program is executed to a predetermined barrier synchronization point; the first processor core is any one of the processor cores included by the chip; a target barrier synchronization device is determined according to the barrier identification corresponding to the predetermined barrier synchronization point; a barrier synchronization message is sent to the target barrier synchronization device; and the barrier synchronization message comprises the barrier identification and the number of the thread programs which participate in the synchronization process. The barrier synchronization method and device are used for the barrier synchronization process.

Description

technical field [0001] The present invention relates to the communication field, in particular to a fence synchronization method and equipment. Background technique [0002] Traditional single-core processors usually increase the main frequency of the processor by using superscalar and pipeline processing technology to achieve the purpose of improving processor performance, but the increase of main frequency will lead to increased power consumption of the processor, and will lead to The heat dissipation of the processor is not good. Moreover, with the continuous development of semiconductor technology, the number of transistors that can be integrated on the chip is gradually increasing. In order to improve the performance of the processor and reduce the power consumption of the processor at the same time, and to make the processor have good heat dissipation, the architecture designer proposes Multi-core or many-core processors with thread-level coarse-grained parallelism. ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F9/52
Inventor 徐卫志
Owner HUAWEI TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products