I. Introduction
Object re-identification (Re-ID), such as person Re-ID [1], [2] and vehicle Re-ID [3], [4], focuses on searching for pedestrians or vehicles of interest captured by different cameras installed at various locations within a city, which is a significant potential for intelligent transportation systems to construct smart cities. Generally, high-accuracy object Re-ID methods [3], [4] require complex networks, which naturally require massive inference computations. Traditional knowledge distillation (KD) technologies [5], [6] are promising for efficient object Re-ID, as done in [7], [8], and [9]. However, traditional KD technologies face a huge architectural difference (e.g., different network depths) between the heavy teacher network and the lightweight student network, which is hostile to the accuracy performance of the student network.