TY - JOUR
T1 - Distributed Computation of Equilibria in Misspecified Convex Stochastic Nash Games
AU - Jiang, Hao
AU - Shanbhag, Uday V.
AU - Meyn, Sean P.
N1 - Funding Information:
Manuscript received October 5, 2015; revised June 2, 2016, June 5, 2016, December 6, 2016, and May 16, 2017; accepted May 25, 2017. Date of publication August 21, 2017; date of current version January 26, 2018. The work of H. Jiang and U. V. Shanbhag was supported in part by NSF CAREER CMMI 1246887 and NSF CMMI 1400217 awarded to U. V. Shanbhag. This paper was presented in part at the IEEE Conference on Decision and Control, Orlando, FL, USA, December 2011. Recommended by Associate Editor A. Garcia. (Corresponding author: Uday V. Shanbhag.) H. Jiang is with the Industrial and Enterprise Systems Engineering, University of Illinois at Urbana-Champaign, Urbana, IL 61801 USA (e-mail: jh_double12@hotmail.com).
Publisher Copyright:
© 1963-2012 IEEE.
PY - 2018/2
Y1 - 2018/2
N2 - The distributed computation of Nash equilibria is assuming growing relevance in engineering where such problems emerge in the context of distributed control. Accordingly, we present schemes for computing equilibria of two classes of static stochastic convex games complicated by a parametric misspecification, a natural concern in the control of large-scale networked engineered system. In both schemes, players learn the equilibrium strategy while resolving the misspecification: 1) Monotone stochastic Nash games: We present a set of coupled stochastic approximation schemes distributed across agents in which the first scheme updates each agent's strategy via a projected (stochastic) gradient step, whereas the second scheme updates every agent's belief regarding its misspecified parameter using an independently specified learning problem. We proceed to show that the produced sequences converge in an almost sure sense to the true equilibrium strategy and the true parameter, respectively. Surprisingly, convergence in the equilibrium strategy achieves the optimal rate of convergence in a mean-squared sense with a quantifiable degradation in the rate constant; 2) Stochastic Nash-Cournot games with unobservable aggregate output: We refine 1) to a Cournot setting where we assume that the tuple of strategies is unobservable while payoff functions and strategy sets are public knowledge through a common knowledge assumption. By utilizing observations of noise-corrupted prices, iterative fixed-point schemes are developed, allowing for simultaneously learning the equilibrium strategies and the misspecified parameter in an almost sure sense.
AB - The distributed computation of Nash equilibria is assuming growing relevance in engineering where such problems emerge in the context of distributed control. Accordingly, we present schemes for computing equilibria of two classes of static stochastic convex games complicated by a parametric misspecification, a natural concern in the control of large-scale networked engineered system. In both schemes, players learn the equilibrium strategy while resolving the misspecification: 1) Monotone stochastic Nash games: We present a set of coupled stochastic approximation schemes distributed across agents in which the first scheme updates each agent's strategy via a projected (stochastic) gradient step, whereas the second scheme updates every agent's belief regarding its misspecified parameter using an independently specified learning problem. We proceed to show that the produced sequences converge in an almost sure sense to the true equilibrium strategy and the true parameter, respectively. Surprisingly, convergence in the equilibrium strategy achieves the optimal rate of convergence in a mean-squared sense with a quantifiable degradation in the rate constant; 2) Stochastic Nash-Cournot games with unobservable aggregate output: We refine 1) to a Cournot setting where we assume that the tuple of strategies is unobservable while payoff functions and strategy sets are public knowledge through a common knowledge assumption. By utilizing observations of noise-corrupted prices, iterative fixed-point schemes are developed, allowing for simultaneously learning the equilibrium strategies and the misspecified parameter in an almost sure sense.
UR - http://www.scopus.com/inward/record.url?scp=85028517323&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85028517323&partnerID=8YFLogxK
U2 - 10.1109/TAC.2017.2742061
DO - 10.1109/TAC.2017.2742061
M3 - Article
AN - SCOPUS:85028517323
SN - 0018-9286
VL - 63
SP - 360
EP - 371
JO - IEEE Transactions on Automatic Control
JF - IEEE Transactions on Automatic Control
IS - 2
M1 - 8013743
ER -