TY - GEN
T1 - Architecture-Aware Currying
AU - Kandemir, Mahmut Taylan
AU - Akbulut, Gulsum Gudukbay
AU - Choi, Wonil
AU - Karakoy, Mustafa
N1 - Publisher Copyright:
© 2023 IEEE.
PY - 2023
Y1 - 2023
N2 - In near-data computing (NDC), computation is brought into data, as opposed to bringing data to computation. While there is prior work focusing on different NDC opportunities, there is no study, to our knowledge, that investigates the importance of 'neighborhood' in NDC. This paper explores the neighborhood concept in multithreaded programs that run on on-chip network-based manycore systems. We define the concept of 'neighborhood', in terms of on-chip network links, and use it to formulate the NDC problem. We propose a 'generic' compiler algorithm, called 'architecture-aware currying', that uses the neighborhood concept to implement NDC. So, a core can perform some portions of computation with the nearby data and postpone the remainder of the computation until the remaining data become nearby. It can also perform computations - with nearby data - on behalf of other cores. Our experimental evaluation shows that the proposed compiler algorithm outperforms state-of-the-art data locality optimization strategies.
AB - In near-data computing (NDC), computation is brought into data, as opposed to bringing data to computation. While there is prior work focusing on different NDC opportunities, there is no study, to our knowledge, that investigates the importance of 'neighborhood' in NDC. This paper explores the neighborhood concept in multithreaded programs that run on on-chip network-based manycore systems. We define the concept of 'neighborhood', in terms of on-chip network links, and use it to formulate the NDC problem. We propose a 'generic' compiler algorithm, called 'architecture-aware currying', that uses the neighborhood concept to implement NDC. So, a core can perform some portions of computation with the nearby data and postpone the remainder of the computation until the remaining data become nearby. It can also perform computations - with nearby data - on behalf of other cores. Our experimental evaluation shows that the proposed compiler algorithm outperforms state-of-the-art data locality optimization strategies.
UR - http://www.scopus.com/inward/record.url?scp=85182593301&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85182593301&partnerID=8YFLogxK
U2 - 10.1109/PACT58117.2023.00029
DO - 10.1109/PACT58117.2023.00029
M3 - Conference contribution
AN - SCOPUS:85182593301
T3 - Parallel Architectures and Compilation Techniques - Conference Proceedings, PACT
SP - 250
EP - 264
BT - Proceedings - 2023 32nd International Conference on Parallel Architecture and Compilation Techniques, PACT 2023
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 32nd International Conference on Parallel Architecture and Compilation Techniques, PACT 2023
Y2 - 21 October 2023 through 25 October 2023
ER -