Abstract
Determining an appropriate data distribution among different memories is critical to the performance of data-parallel programs on distributedmemory machines. By analyzing the computational load of data arrays and the communication complexity of various data movement operations in a program, this paper suggests a first-order cost model for determining a small set of appropriate data distribution patterns among many possible choices. A new data distribution specification, namely CYBLOCK, is proposed to enhance the expressiveness of data distribution specifications being proposed in High Performance Fortran. Cost analysis of two case studies: a linear system solver and a Purdue-set benchmark loop, are used to illustrate the proposed evaluation method. The model correctly predicts the relative performance of the case studies when implemented with various regular data distributions on an nCUBE-2 multicomputer.
| Original language | English |
|---|---|
| Article number | 4134204 |
| Pages (from-to) | 175-183 |
| Number of pages | 9 |
| Journal | Proceedings of the International Conference on Parallel Processing |
| Volume | 2 |
| DOIs | |
| Publication status | Published - 1993 |
| Externally published | Yes |
| Event | 1993 International Conference on Parallel Processing, ICPP 1993 - Syracuse, United States Duration: 16 Aug 1993 → 20 Aug 1993 |
Bibliographical note
Publisher Copyright:© 1993 IEEE.