TY - GEN
T1 - A Variable Sample-Size Stochastic Quasi-Newton Method for Smooth and Nonsmooth Stochastic Convex Optimization
AU - Jalilzadeh, Afrooz
AU - Nedic, Angelia
AU - Shanbhag, Uday V.
AU - Yousefian, Farzad
N1 - Publisher Copyright:
© 2018 IEEE.
PY - 2018/7/2
Y1 - 2018/7/2
N2 - In the last several years, stochastic quasi-Newton (SQN) methods have assumed increasing relevance in solving a breadth of machine learning and stochastic optimization problems. Inspired by recently presented SQN schemes [1]-[3], we consider merely convex and possibly nonsmooth stochastic programs and utilize increasing sample-sizes to allow for variance reduction. To this end, we make the following contributions. (i) A regularized and smoothed variable sample-size BFGS update (rsL-BFGS) is developed that can accommodate nonsmooth convex objectives by utilizing iterative regularization and smoothing; (ii) A regularized variable sample-size SQN (rVS-SQN) is developed that admits a rate and oracle complexity bound of {O}(1/k^{1-\varepsilon}) and {O}(\epsilon^{-(3+\varepsilon)/(1-\varepsilon)}), respectively (where \varepsilon, \varepsilon > 0 are arbitrary scalars), improving on past rate statements; (iii) By leveraging (rsL-BFGS), we develop rate statements for the function of the ergodic average through a regularized and smoothed VS-SQN scheme that can accommodate nonsmooth (but smoothable) functions with the convergence rate {O}(1/k^{1/3-2\varepsilon}).
AB - In the last several years, stochastic quasi-Newton (SQN) methods have assumed increasing relevance in solving a breadth of machine learning and stochastic optimization problems. Inspired by recently presented SQN schemes [1]-[3], we consider merely convex and possibly nonsmooth stochastic programs and utilize increasing sample-sizes to allow for variance reduction. To this end, we make the following contributions. (i) A regularized and smoothed variable sample-size BFGS update (rsL-BFGS) is developed that can accommodate nonsmooth convex objectives by utilizing iterative regularization and smoothing; (ii) A regularized variable sample-size SQN (rVS-SQN) is developed that admits a rate and oracle complexity bound of {O}(1/k^{1-\varepsilon}) and {O}(\epsilon^{-(3+\varepsilon)/(1-\varepsilon)}), respectively (where \varepsilon, \varepsilon > 0 are arbitrary scalars), improving on past rate statements; (iii) By leveraging (rsL-BFGS), we develop rate statements for the function of the ergodic average through a regularized and smoothed VS-SQN scheme that can accommodate nonsmooth (but smoothable) functions with the convergence rate {O}(1/k^{1/3-2\varepsilon}).
UR - http://www.scopus.com/inward/record.url?scp=85062176904&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85062176904&partnerID=8YFLogxK
U2 - 10.1109/CDC.2018.8619209
DO - 10.1109/CDC.2018.8619209
M3 - Conference contribution
AN - SCOPUS:85062176904
T3 - Proceedings of the IEEE Conference on Decision and Control
SP - 4097
EP - 4102
BT - 2018 IEEE Conference on Decision and Control, CDC 2018
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 57th IEEE Conference on Decision and Control, CDC 2018
Y2 - 17 December 2018 through 19 December 2018
ER -