TY - JOUR
T1 - AliECS
T2 - 26th International Conference on Computing in High Energy and Nuclear Physics, CHEP 2023
AU - Mrnjavac, Teo
AU - Alexopoulos, Konstantinos
AU - Barroso, Vasco Chibante
AU - Guyot, Claire
AU - Konopka, Piotr
AU - Raduta, George
N1 - Publisher Copyright:
© The Authors, published by EDP Sciences.
PY - 2024/5/6
Y1 - 2024/5/6
N2 - The ALICE Experiment at CERN's Large Hadron Collider (LHC) has undergone a major upgrade during LHC Long Shutdown 2 in 2019-2021, which includes a new computing system called O2 (Online-Offline). To ensure the efficient operation of the upgraded experiment and of its newly designed computing system, a reliable, high performance, full-featured experiment control system has also been developed and deployed at LHC Point 2. The ALICE Experiment Control System (AliECS) is a microservices-oriented system based on state-of-the-art cluster management technologies that emerged recently in the distributed and high-performance computing ecosystem. It is designed, developed and maintained as a comprehensive solution and single entry point for control of experiment data acquisition (up to 3.5 TB/s) and processing. This communication describes the AliECS architecture by providing an in-depth overview of the system's components, interfaces, features, and design elements, as well as its performance. It also reports on the experience with AliECS during the first year of ALICE Run 3 data taking with LHC beam, including integration and operational challenges, and lessons learned from real-world use.
AB - The ALICE Experiment at CERN's Large Hadron Collider (LHC) has undergone a major upgrade during LHC Long Shutdown 2 in 2019-2021, which includes a new computing system called O2 (Online-Offline). To ensure the efficient operation of the upgraded experiment and of its newly designed computing system, a reliable, high performance, full-featured experiment control system has also been developed and deployed at LHC Point 2. The ALICE Experiment Control System (AliECS) is a microservices-oriented system based on state-of-the-art cluster management technologies that emerged recently in the distributed and high-performance computing ecosystem. It is designed, developed and maintained as a comprehensive solution and single entry point for control of experiment data acquisition (up to 3.5 TB/s) and processing. This communication describes the AliECS architecture by providing an in-depth overview of the system's components, interfaces, features, and design elements, as well as its performance. It also reports on the experience with AliECS during the first year of ALICE Run 3 data taking with LHC beam, including integration and operational challenges, and lessons learned from real-world use.
UR - http://www.scopus.com/inward/record.url?scp=85212221670&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85212221670&partnerID=8YFLogxK
U2 - 10.1051/epjconf/202429502027
DO - 10.1051/epjconf/202429502027
M3 - Conference article
AN - SCOPUS:85212221670
SN - 2101-6275
VL - 295
JO - EPJ Web of Conferences
JF - EPJ Web of Conferences
M1 - 02027
Y2 - 8 May 2023 through 12 May 2023
ER -