TY - GEN
T1 - Control of parametric games
AU - Fiscko, Carmel
AU - Swenson, Brian
AU - Kar, Soummya
AU - Sinopoli, Bruno
N1 - Publisher Copyright:
© 2019 EUCA.
PY - 2019/6
Y1 - 2019/6
N2 - This work studies a class of multi-player games in which the players' decisions can be influenced by a superplayer. We define a game with n players and parameterized utilities u (., a) where the superplayer controls the value of a. The regular players follow Markovian repeated play dynamics that encompass a wide class of learning dynamics including strict best response. The objective of the superplayer is to control a dynamically to achieve a desired outcome in the game-play, which in this work we define as the realization of target joint strategies. We introduce the class of parametric games and reformulate the superplayer control problem as a Markov decision process (MDP). Reachability criteria are developed, allowing the superplayer to determine which game-play may occur with positive probability. With a reachable goal joint strategy, a cost-optimal policy can be computed using standard tools in dynamic programming. A sample MDP reward function is presented such that a reachable target joint strategy is guaranteed to be played almost surely. Finally, an application in a cyber-security context is provided to illustrate the use of the proposed methodology and its effectiveness.
AB - This work studies a class of multi-player games in which the players' decisions can be influenced by a superplayer. We define a game with n players and parameterized utilities u (., a) where the superplayer controls the value of a. The regular players follow Markovian repeated play dynamics that encompass a wide class of learning dynamics including strict best response. The objective of the superplayer is to control a dynamically to achieve a desired outcome in the game-play, which in this work we define as the realization of target joint strategies. We introduce the class of parametric games and reformulate the superplayer control problem as a Markov decision process (MDP). Reachability criteria are developed, allowing the superplayer to determine which game-play may occur with positive probability. With a reachable goal joint strategy, a cost-optimal policy can be computed using standard tools in dynamic programming. A sample MDP reward function is presented such that a reachable target joint strategy is guaranteed to be played almost surely. Finally, an application in a cyber-security context is provided to illustrate the use of the proposed methodology and its effectiveness.
UR - http://www.scopus.com/inward/record.url?scp=85071524954&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85071524954&partnerID=8YFLogxK
U2 - 10.23919/ECC.2019.8796274
DO - 10.23919/ECC.2019.8796274
M3 - Conference contribution
AN - SCOPUS:85071524954
T3 - 2019 18th European Control Conference, ECC 2019
SP - 1036
EP - 1042
BT - 2019 18th European Control Conference, ECC 2019
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 18th European Control Conference, ECC 2019
Y2 - 25 June 2019 through 28 June 2019
ER -