Abstract
Increasingly available city data and advanced learning techniques have empowered people to improve the efficiency of our city functions. Among them, improving urban transportation efficiency is one of the most prominent topics. Recent studies have proposed to use reinforcement learning (RL) for traffic signal control. Different from traditional transportation approaches which rely heavily on prior knowledge, RL can learn directly from the feedback. However, without a careful model design, existing RL methods typically take a long time to converge and the learned models may fail to adapt to new scenarios. For example, a model trained well for morning traffic may not work for the afternoon traffic because the traffic flow could be reversed, resulting in very different state representation. In this paper, we propose a novel design called FRAP, which is based on the intuitive principle of phase competition in traffic signal control: when two traffic signals conflict, priority should be given to one with larger traffic movement (i.e., higher demand). Through the phase competition modeling, our model achieves invariance to symmetrical cases such as flipping and rotation in traffic flow. By conducting comprehensive experiments, we demonstrate that our model finds better solutions than existing RL methods in the complicated all-phase selection problem, converges much faster during training, and achieves superior generalizability for different road structures and traffic conditions.
| Original language | English (US) |
|---|---|
| Title of host publication | CIKM 2019 - Proceedings of the 28th ACM International Conference on Information and Knowledge Management |
| Publisher | Association for Computing Machinery |
| Pages | 1963-1972 |
| Number of pages | 10 |
| ISBN (Electronic) | 9781450369763 |
| DOIs | |
| State | Published - Nov 3 2019 |
| Event | 28th ACM International Conference on Information and Knowledge Management, CIKM 2019 - Beijing, China Duration: Nov 3 2019 → Nov 7 2019 |
Publication series
| Name | International Conference on Information and Knowledge Management, Proceedings |
|---|
Conference
| Conference | 28th ACM International Conference on Information and Knowledge Management, CIKM 2019 |
|---|---|
| Country/Territory | China |
| City | Beijing |
| Period | 11/3/19 → 11/7/19 |
UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 11 Sustainable Cities and Communities
All Science Journal Classification (ASJC) codes
- General Decision Sciences
- General Business, Management and Accounting
Fingerprint
Dive into the research topics of 'Learning phase competition for traffic signal control'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver