Abstract
This paper discusses a tool that aids in the design, development, and understanding of parallel algorithms for high-performance computers. The tool provides a vehicle for studying memory access patterns, different cache strategies, and the effects of multiprocessors on matrix algorithms in a Fortran setting. Such a tool puts the user in a better position to understand where performance problems may occur and enhances the likelihood of increasing the program's performance before actual execution on a high-performance computer. © 1990.
Original language | English |
---|---|
Pages (from-to) | 185-202 |
Number of pages | 17 |
Journal | Journal of Parallel and Distributed Computing |
Volume | 9 |
Issue number | 2 |
Publication status | Published - Jun 1990 |