TY - JOUR
T1 - Empirical centroid fictitious play
T2 - An approach for distributed learning in multi-agent games
AU - Swenson, Brian
AU - Kar, Soummya
AU - Xavier, Joao
N1 - Publisher Copyright:
© 1991-2012 IEEE.
PY - 2015/8/1
Y1 - 2015/8/1
N2 - The paper is concerned with distributed learning in large-scale games. The well-known fictitious play (FP) algorithm is addressed, which, despite theoretical convergence results, might be impractical to implement in large-scale settings due to intense computation and communication requirements. An adaptation of the FP algorithm, designated as the empirical centroid fictitious play (ECFP), is presented. In ECFP players respond to the centroid of all players' actions rather than track and respond to the individual actions of every player. Convergence of the ECFP algorithm in terms of average empirical frequency (a notion made precise in the paper) to a subset of the Nash equilibria is proven under the assumption that the game is a potential game with permutation invariant potential function. A more general formulation of ECFP is then given (which subsumes FP as a special case) and convergence results are given for the class of potential games. Furthermore, a distributed formulation of the ECFP algorithm is presented, in which, players endowed with a (possibly sparse) preassigned communication graph, engage in local, non-strategic information exchange to eventually agree on a common equilibrium. Convergence results are proven for the distributed ECFP algorithm.
AB - The paper is concerned with distributed learning in large-scale games. The well-known fictitious play (FP) algorithm is addressed, which, despite theoretical convergence results, might be impractical to implement in large-scale settings due to intense computation and communication requirements. An adaptation of the FP algorithm, designated as the empirical centroid fictitious play (ECFP), is presented. In ECFP players respond to the centroid of all players' actions rather than track and respond to the individual actions of every player. Convergence of the ECFP algorithm in terms of average empirical frequency (a notion made precise in the paper) to a subset of the Nash equilibria is proven under the assumption that the game is a potential game with permutation invariant potential function. A more general formulation of ECFP is then given (which subsumes FP as a special case) and convergence results are given for the class of potential games. Furthermore, a distributed formulation of the ECFP algorithm is presented, in which, players endowed with a (possibly sparse) preassigned communication graph, engage in local, non-strategic information exchange to eventually agree on a common equilibrium. Convergence results are proven for the distributed ECFP algorithm.
UR - http://www.scopus.com/inward/record.url?scp=84934326126&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84934326126&partnerID=8YFLogxK
U2 - 10.1109/TSP.2015.2434327
DO - 10.1109/TSP.2015.2434327
M3 - Article
AN - SCOPUS:84934326126
SN - 1053-587X
VL - 63
SP - 3888
EP - 3901
JO - IEEE Transactions on Signal Processing
JF - IEEE Transactions on Signal Processing
IS - 15
M1 - 7109924
ER -