Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Method and apparatus for a hierarchical synchronization barrier in a multi-node system

a synchronization barrier and hierarchical technology, applied in the field of computer architecture, can solve the problems of long synchronization time, energy-inefficient system, power consumption,

Inactive Publication Date: 2012-07-12
IBM CORP
View PDF5 Cites 87 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Benefits of technology

[0012]An apparatus for a hierarchical barrier synchronization of cores and nodes on a multiprocessor system, in one aspect, may include a plurality of cores arranged in an integrated circuit. A register may be operable to store input bit signals received from each of said plurality of cores. A control logic circuit may be operable to electrically tie and perform a Boolean “AND” function on said stored input bit signals to determine whether said plurality of cores all achieved barrier. The control logic circuit may be further operable to determine whether only on-ch...

Problems solved by technology

Barrier synchronization for on-chip and off-chip synchronization by using a BSR register introduces overhead to at least one processor, which needs to poll the BSR register until all processors reached the barrier.
To poll a register, a number of instructions has to be to executed to determine that synchronization is achieved and communicate this status on-chip and / or off-chip, resulting in a power consuming, energy-inefficient system and causing long latency for synchronization.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and apparatus for a hierarchical synchronization barrier in a multi-node system
  • Method and apparatus for a hierarchical synchronization barrier in a multi-node system
  • Method and apparatus for a hierarchical synchronization barrier in a multi-node system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0020]The present disclosure in one aspect describes a low-latency, low-overhead synchronization method which uses a symmetric software implementation for synchronizing nodes in a multiprocessor system. In the present disclosure, a node or chip refers to entity that is plugged into a socket and contains among other things computing cores. On a node are cores, or processor cores. On some architecture a core has one hardware thread; on other architecture the core may be an SMT (Symmetric Multi-Threaded) core and contain 2, 4, or possibly more hardware threads. Software threads get executed on top of the hardware thread. For the purposes of this document, thread is used to refer to hardware thread. In the event software thread is meant, the entire expression is written. A multiprocessor chip is referred to as a node. In another embodiment, a part of a chip can be referred to as a node, and a single chip can contain multiple nodes (or nodelets), which act as points of communication in t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

A hierarchical barrier synchronization of cores and nodes on a multiprocessor system, in one aspect, may include providing by each of a plurality of threads on a chip, input bit signal to a respective bit in a register, in response to reaching a barrier; determining whether all of the plurality of threads reached the barrier by electrically tying bits of the register together and “AND”ing the input bit signals; determining whether only on-chip synchronization is needed or whether inter-node synchronization is needed; in response to determining that all of the plurality of threads on the chip reached the barrier, notifying the plurality of threads on the chip, if it is determined that only on-chip synchronization is needed; and after all of the plurality of threads on the chip reached the barrier, communicating the synchronization signal to outside of the chip, if it is determined that inter-node synchronization is needed.

Description

FIELD[0001]The present application generally relates to computer architecture and more particularly to a synchronization barrier in a multi-node system.BACKGROUND[0002]In a multiprocessor system, synchronization between multi-nodes is frequently needed to synchronize all nodes. Synchronization is used to indicate that calculation on all nodes has reached a certain point. When synchronizing the nodes in a multi-node system, no node can continue processing until all nodes reach the synchronization point. This approach is used when, for example, partial results are calculated on all nodes in one phase of calculation, and then all partial results have to accumulate into a global result which is needed in the following phase of calculation. It is also used when successive phases of a calculation need to proceed in lock step across all nodes.[0003]Each node in a multi-node system can have one or more processor cores. One or more processor cores can be located on the same chip (i.e., integ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F9/30
CPCG06F9/30087G06F9/522G06F9/3851
Inventor SALAPURA, VALENTINAWISNIEWSKI, ROBERT W.
Owner IBM CORP
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products