A similarity-aware MOE-based method for optimizing tensor programs across diverse GPUs | IEEE Conference Publication | IEEE Xplore