TY - JOUR
T1 - Efficient implementation of barrier synchronization in wormhole-routed hypercube multicomputers
AU - Xu, Hong
AU - Mckinley, Philip K.
AU - Ni, Lionel M.
PY - 1992/10
Y1 - 1992/10
N2 - Efficient implementation of barrier synchronization is important to the performance of many parallel algorithms. This paper addresses barrier synchronization in wormhole-routed hypercube multicomputers. A broadcast barrier involves all nodes in a system, whereas the more general multicast barrier involves an arbitrary subset of nodes. Although performance of barrier synchronization can benefit from hardware-supported broadcast and multicast operations, many systems support only single-destination, or unicast, communication in hardware. For such systems, a novel software tree approach, the U-cube tree, is proposed as the basis of barrier synchronization. An important feature of the U-cube tree is that all messages injected into the network are guaranteed to be contention-free. Performance measurements of several barrier synchronization techniques implemented on a 64-node nCUBE-2 are given.
AB - Efficient implementation of barrier synchronization is important to the performance of many parallel algorithms. This paper addresses barrier synchronization in wormhole-routed hypercube multicomputers. A broadcast barrier involves all nodes in a system, whereas the more general multicast barrier involves an arbitrary subset of nodes. Although performance of barrier synchronization can benefit from hardware-supported broadcast and multicast operations, many systems support only single-destination, or unicast, communication in hardware. For such systems, a novel software tree approach, the U-cube tree, is proposed as the basis of barrier synchronization. An important feature of the U-cube tree is that all messages injected into the network are guaranteed to be contention-free. Performance measurements of several barrier synchronization techniques implemented on a 64-node nCUBE-2 are given.
UR - https://www.webofscience.com/wos/woscc/full-record/WOS:A1992JW75700008
UR - https://openalex.org/W2122246186
UR - https://www.scopus.com/pages/publications/38249010795
U2 - 10.1016/0743-7315(92)90031-H
DO - 10.1016/0743-7315(92)90031-H
M3 - Journal Article
SN - 0743-7315
VL - 16
SP - 172
EP - 184
JO - Journal of Parallel and Distributed Computing
JF - Journal of Parallel and Distributed Computing
IS - 2
ER -