I. Introduction
Traffic data prediction is a fundamental task in the field of spatiotemporal data mining and has many practical applications, such as urban management and route planning [1], [2]. Accurate prediction of traffic data can significantly improve the service quality of these applications. Traffic data are often presented as spatiotemporal graphs, and models need to capture temporal relationship (TR) and spatial relationship (SR) to achieve accurate predictions [3]. However, there is still a lack of effective methods to model spatiotemporal correlations and heterogeneity.