Communication balancing in parallel sparse matrix-vector multiplication

R.H. Bisseling, W. Meesen

Research output: Contribution to journalArticleAcademicpeer-review

Abstract

Given a partitioning of a sparse matrix for parallel matrix–vector multiplication, which determines the total communication volume, we try to find a suitable vector partitioning that balances the communication load among the processors. We present a new lower bound for the maximum communication cost per processor, an optimal algorithm that attains this bound for the special case where each matrix column is owned by at most two processors, and a new heuristic algorithm for the general case that often attains the lower bound. This heuristic algorithm tries to avoid raising the current lower bound when assigning vector components to processors. Experimental results show that the new algorithm often improves upon the heuristic algorithm that is currently implemented in the sparse matrix partitioning package Mondriaan. Trying both heuristics combined with a greedy improvement procedure solves the problem optimally in most practical cases. The vector partitioning problem is proven to be NP-complete.
Original languageEnglish
Pages (from-to)47-65
Number of pages19
JournalElectronic Transactions on Numerical Analysis
Volume21
Publication statusPublished - 2005

Keywords

  • Wiskunde en Informatica (WIIN)
  • Mathematics
  • Wiskunde en computerwetenschappen
  • Landbouwwetenschappen
  • Wiskunde: algemeen

Fingerprint

Dive into the research topics of 'Communication balancing in parallel sparse matrix-vector multiplication'. Together they form a unique fingerprint.

Cite this