Abstract
The Gyrokinetic Particle-in-cell (PIC) method has been successfully applied in studies of low-frequency microturbulence in magnetic fusion plasmas. While the excellent scaling of PIC codes on modern computing platforms is well established, significant challenges remain in achieving high on-chip concurrency for the new path to exascale systems. In addressing associated issues, it is necessary to deal with the basic gather-scatter operation and the relatively low computational intensity in the PIC method. Significant advancements have been achieved in optimizing gather-scatter operations in the gyrokinetic PIC method for next-generation multi-core CPU and GPU architectures. In particular, we will report on new techniques that improve locality, reduce memory conflict, and efficiently utilize shared memory on GPU's. Performance benchmarks on two high-end computing platforms - the IBM BlueGene/Q (Mira) system at the Argonne Leadership Computing Facility (ALCF) and the Cray XK6 (Titan Dev) with the latest GPU at Oak Ridge Leadership Computing Facility (OLCF) - will be presented.
Original language | English (US) |
---|---|
Title of host publication | Proceedings - 2012 SC Companion |
Subtitle of host publication | High Performance Computing, Networking Storage and Analysis, SCC 2012 |
Pages | 1439-1441 |
Number of pages | 3 |
DOIs | |
State | Published - Dec 1 2012 |
Event | 2012 SC Companion: High Performance Computing, Networking Storage and Analysis, SCC 2012 - Salt Lake City, UT, United States Duration: Nov 10 2012 → Nov 16 2012 |
Other
Other | 2012 SC Companion: High Performance Computing, Networking Storage and Analysis, SCC 2012 |
---|---|
Country/Territory | United States |
City | Salt Lake City, UT |
Period | 11/10/12 → 11/16/12 |
All Science Journal Classification (ASJC) codes
- Computational Theory and Mathematics
- Computer Science Applications
- Software