Abstract
Designing efficient parallel algorithms in a message-based parallel computer should consider both time-space tradeoffs and computation-communication tradeoffs. In order to balance these tradeoffs and achieve the optimal performance of an algorith, one has to consider various design parameters such as the number of processors required and the size of partitions. In this paper, we demonstrate that, for certain data parallel algorithms, it is possible to determine these design parameters analytically. To serve as a basis for the discussions that follow, a simple model for the NCUBE hypercube computer is introduced. Using this model, we use two examples, array summation and matrix multiplication, to illustrate how their performance can be modeled. By optimizing these expressions, one is able to determine optimal design parameters which arrive at efficient execution. Experiments on a 64-node NCUBE verified the accuracy of the analytic results and are used to further support the discussions.
| Original language | English |
|---|---|
| Pages (from-to) | 475-495 |
| Number of pages | 21 |
| Journal | International Journal of Parallel Programming |
| Volume | 17 |
| Issue number | 6 |
| DOIs | |
| Publication status | Published - Dec 1988 |
| Externally published | Yes |
Keywords
- Parallel algorithms
- algorithm design
- algorithm modeling
- hypercube multiprocessors
- performance analysis