Performance tuning of the Helmholtz matrix-vector product kernel in the computational fluid dynamics solver Nek5000/RS for the A64FX processor

Miwako Tsuji, Misun Min, Stefan Kerkemeier, Paul Fischer, Elia Merzari, Mitsuhisa Sato

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Scopus citations

Abstract

Nek5000/RS is an open source computational fluid dynamics solver based on the spectral element method. One of the important kernel of the Nek5000/RS is called "axhelm", which computes the Helmholtz matrix-vector product. In this paper, we have evaluated the axhelm kernel on the A64FX processor for the simplest case of polynomial degree N = 7. We have optimized the kernel for the A64FX processor by using well known optimization techniques such as SIMDization, software pipelining, continuous access enhancing, and software prefetch. We also provide the performance analysis data to investigate the effects of the optimization techniques to help understanding the A64FX processor and the Fujitsu compiler.

Original languageEnglish (US)
Title of host publicationProceedings of International Conference on High Performance Computing in Asia-Pacific Region Workshops, HPCAsia 2022
PublisherAssociation for Computing Machinery
Pages49-59
Number of pages11
ISBN (Electronic)9781450395649
DOIs
StatePublished - Jan 11 2022
Event2022 International Conference on High Performance Computing in Asia-Pacific Region Workshops, HPCAsia 2022 - Virtual, Online, Japan
Duration: Jan 11 2022Jan 14 2022

Publication series

NameACM International Conference Proceeding Series

Conference

Conference2022 International Conference on High Performance Computing in Asia-Pacific Region Workshops, HPCAsia 2022
Country/TerritoryJapan
CityVirtual, Online
Period1/11/221/14/22

All Science Journal Classification (ASJC) codes

  • Software
  • Human-Computer Interaction
  • Computer Vision and Pattern Recognition
  • Computer Networks and Communications

Fingerprint

Dive into the research topics of 'Performance tuning of the Helmholtz matrix-vector product kernel in the computational fluid dynamics solver Nek5000/RS for the A64FX processor'. Together they form a unique fingerprint.

Cite this