TY - GEN
T1 - Scalable parallelization strategies to accelerate NuFFT data translation on multicores
AU - Zhang, Yuanrui
AU - Liu, Jun
AU - Kultursay, Emre
AU - Kandemir, Mahmut
AU - Pitsianis, Nikos
AU - Sun, Xiaobai
N1 - Funding Information:
This research is supported in part by NSF grants CNS 0720645, CCF 0811687, OCI 821527, CCF 0702519, CNS 0720749, and a grant from Microsoft.
PY - 2010
Y1 - 2010
N2 - The non-uniform FFT (NuFFT) has been widely used in many applications. In this paper, we propose two new scalable parallelization strategies to accelerate the data translation step of the NuFFT on multicore machines. Both schemes employ geometric tiling and binning to exploit data locality, and use recursive partitioning and scheduling with dynamic task allocation to achieve load balancing. The experimental results collected from a commercial multicore machine show that, with the help of our parallelization strategies, the data translation step is no longer the bottleneck in the NuFFT computation, even for large data set sizes, with any input sample distribution.
AB - The non-uniform FFT (NuFFT) has been widely used in many applications. In this paper, we propose two new scalable parallelization strategies to accelerate the data translation step of the NuFFT on multicore machines. Both schemes employ geometric tiling and binning to exploit data locality, and use recursive partitioning and scheduling with dynamic task allocation to achieve load balancing. The experimental results collected from a commercial multicore machine show that, with the help of our parallelization strategies, the data translation step is no longer the bottleneck in the NuFFT computation, even for large data set sizes, with any input sample distribution.
UR - https://www.scopus.com/pages/publications/78249265973
UR - https://www.scopus.com/pages/publications/78249265973#tab=citedBy
U2 - 10.1007/978-3-642-15291-7_13
DO - 10.1007/978-3-642-15291-7_13
M3 - Conference contribution
AN - SCOPUS:78249265973
SN - 3642152902
SN - 9783642152900
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 125
EP - 136
BT - Euro-Par 2010 Parallel Processing - 16th International Euro-Par Conference, Proceedings
T2 - 16th International Euro-Par Conference on Parallel Processing, Euro-Par 2010
Y2 - 31 August 2010 through 3 September 2010
ER -