Abstract
This paper presents a self-optimization methodology for parallel linear algebra routines on heterogeneous systems. For each routine, a series of decisions is taken automatically in order to obtain an execution time close to the optimum (without rewriting the routine's code). Some of these decisions are: the number of processes to generate, the heterogeneous distribution of these processes over the network of processors, the logical topology of the generated processes, ... To reduce the search space of such decisions, different heuristics have been used. The experiments have been performed with a parallel LU factorization routine similar to the ScaLAPACK one, and good results have been obtained on different heterogeneous platforms.
Original language | English |
---|---|
Title of host publication | Proceedings - IEEE International Conference on Cluster Computing, ICCC|Proc. IEEE Int. Conf. Cluster Comput. ICCC |
Publisher | IEEE |
ISBN (Print) | 0780394852, 9780780394858 |
DOIs | |
Publication status | Published - 2005 |
Event | 2005 IEEE International Conference on Cluster Computing, CLUSTER - Burlington, MA Duration: 1 Jul 2005 → … http://dblp.uni-trier.de/db/conf/cluster/cluster2005.html#CuencaGGD05http://dblp.uni-trier.de/rec/bibtex/conf/cluster/CuencaGGD05.xmlhttp://dblp.uni-trier.de/rec/bibtex/conf/cluster/CuencaGGD05 |
Conference
Conference | 2005 IEEE International Conference on Cluster Computing, CLUSTER |
---|---|
City | Burlington, MA |
Period | 1/07/05 → … |
Internet address |