Abstract
The use of artificial intelligence for planning safe and efficient trajectories for autonomous vehicles in dynamic and complex urban environments has grown rapidly. In this paper, we present a novel methodology for autonomous vehicle trajectory planning using Reinforcement Learning with a heuristic based reward function. The decision boundary formulated by a Support Vector Machine (SVM) classifier is used as a heuristic within the reward function to guide the agent along a smooth and collision-free trajectory toward a predefined goal position in a roundabout scenario. This heuristic based reward function is initially coupled with a Time to Collision (TTC) warning system, which is later replaced by dual SVM classifiers for trajectory and collision prediction of moving vehicles. The Soft Actor Critic (SAC) and Deep Deterministic Policy Gradient (DDPG) algorithms are used to train the agent to navigate safely through the dynamic roundabout scenario in minimum time to a goal position. The effectiveness of the proposed methodology is evaluated through simulations and compared against a Spatio-temporal lattice trajectory planner that uses an SVM based classifier as heuristic in its A∗search algorithm.
| Original language | English (US) |
|---|---|
| Pages (from-to) | 635-640 |
| Number of pages | 6 |
| Journal | IFAC-PapersOnLine |
| Volume | 59 |
| Issue number | 30 |
| DOIs | |
| State | Published - Oct 1 2025 |
| Event | 5th Conference on Modeling, Estimation and Control, MECC 2025 - Pittsburgh, United States Duration: Oct 5 2025 → Oct 8 2025 |
All Science Journal Classification (ASJC) codes
- Control and Systems Engineering
Fingerprint
Dive into the research topics of 'Trajectory Planning in an Urban Scenario with Heuristic Guided Reinforcement Learning'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver