I. Introduction
There has been a steady increase in the development of self-driving cars as they possess the potential to radically change the future of mobility. Deriving safe driving policies remains a key challenge in achieving deployable autonomous driving systems. The formulation of driving strategies has been studied by three schools of work, namely rule-based methods, imitation based learning, and reinforcement learning.