TY - JOUR
T1 - Distributed Dual Coordinate Ascent in General Tree Networks and Communication Network Effect on Synchronous Machine Learning
AU - Cho, Myung
AU - Lai, Lifeng
AU - Xu, Weiyu
N1 - Funding Information:
Manuscript received July 17, 2020; revised November 30, 2020; accepted January 31, 2021. Date of publication May 10, 2021; date of current version June 17, 2021. The work of Lifeng Lai was supported in part by the National Science Foundation (NSF) under Grant CCF-1717943 and Grant ECCS-2000415. The work of Weiyu Xu was supported in part by the NSF under Grant ECCS-2000425. (Corresponding author: Myung Cho.) Myung Cho is with the Department of Electrical and Computer Engineering, Penn State Behrend, Erie, PA 16563 USA (e-mail: mxc6077@psu.edu).
Publisher Copyright:
© 1983-2012 IEEE.
PY - 2021/7
Y1 - 2021/7
N2 - Due to the big size of data and limited data storage volume of a single computer or a single server, data are often stored in a distributed manner. Thus, performing large-scale machine learning operations with the distributed datasets through communication networks is often required. In this paper, we study the convergence rate of the distributed dual coordinate ascent for distributed machine learning problems in a general tree-structured network. Since a tree network model can be understood as the generalization of a star network, our algorithm can be thought of as the generalization of the distributed dual coordinate ascent in a star network. We provide the convergence rate of the distributed dual coordinate ascent over a general tree network in a recursive manner and analyze the network effect on the convergence rate. Secondly, by considering network communication delays, we optimize the distributed dual coordinate ascent algorithm to maximize its convergence speed. From our analytical result, we can choose the optimal number of local iterations depending on the communication delay severity to achieve the fastest convergence speed. In numerical experiments, we consider machine learning scenarios over communication networks, where local workers cannot directly reach to a central node due to constraints in communication, and demonstrate that the usability of our distributed dual coordinate ascent algorithm in tree networks.
AB - Due to the big size of data and limited data storage volume of a single computer or a single server, data are often stored in a distributed manner. Thus, performing large-scale machine learning operations with the distributed datasets through communication networks is often required. In this paper, we study the convergence rate of the distributed dual coordinate ascent for distributed machine learning problems in a general tree-structured network. Since a tree network model can be understood as the generalization of a star network, our algorithm can be thought of as the generalization of the distributed dual coordinate ascent in a star network. We provide the convergence rate of the distributed dual coordinate ascent over a general tree network in a recursive manner and analyze the network effect on the convergence rate. Secondly, by considering network communication delays, we optimize the distributed dual coordinate ascent algorithm to maximize its convergence speed. From our analytical result, we can choose the optimal number of local iterations depending on the communication delay severity to achieve the fastest convergence speed. In numerical experiments, we consider machine learning scenarios over communication networks, where local workers cannot directly reach to a central node due to constraints in communication, and demonstrate that the usability of our distributed dual coordinate ascent algorithm in tree networks.
UR - http://www.scopus.com/inward/record.url?scp=85105890581&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85105890581&partnerID=8YFLogxK
U2 - 10.1109/JSAC.2021.3078495
DO - 10.1109/JSAC.2021.3078495
M3 - Article
AN - SCOPUS:85105890581
SN - 0733-8716
VL - 39
SP - 2105
EP - 2119
JO - IEEE Journal on Selected Areas in Communications
JF - IEEE Journal on Selected Areas in Communications
IS - 7
M1 - 9427236
ER -