A simple and efficient parallel FFT algorithm using the BSP model

M. Alves de Inda, R.H. Bisseling

Research output: Contribution to journalArticleAcademicpeer-review

Abstract

We present a new parallel radix-4 FFT algorithm based on the BSP model. Our parallel algorithm uses the group-cyclic distribution family, which makes it simple to understand and easy to implement. We show how to reduce the communication cost of the algorithm by a factor of 3, in the case that the input/output vector is in the cyclic distribution. We also show how to reduce computation time on computers with a cache-based architecture. We present performance results on a Cray T3E with up to 64 processors, obtaining reasonable efficiency levels for local problem sizes as small as 256 and very good efficiency levels for local sizes larger than 2048.
Original languageEnglish
Pages (from-to)1847-1878
Number of pages32
JournalParallel Computing
Volume27
Issue number14
DOIs
Publication statusPublished - 2001

Keywords

  • Wiskunde en Informatica (WIIN)
  • Mathematics
  • Wiskunde en computerwetenschappen
  • Landbouwwetenschappen
  • Wiskunde: algemeen

Fingerprint

Dive into the research topics of 'A simple and efficient parallel FFT algorithm using the BSP model'. Together they form a unique fingerprint.

Cite this