I. Introduction
Incorporating Diffusion Models (DMs) began a new era of image Super-Resolution (SR) innovation. While regressionbased methods like standard CNNs may work at low magnification ratios, they often fail to produce the high-frequency details needed for high magnification ratios. Generative models and, more recently, DMs have proven to be effective tools for tackling this issue [1]–[3]. Moreover, DMs produce reconstructions with subjectively perceived better quality compared to regression-based methods [4]. Nevertheless, closing the gap between quantitative image quality and human preferences requires finer high-frequency detail prediction to enhance the overall realism [5]. Another pressing demand is accessibility due to computationally intensive requirements of DMs [6]. For example, creating 50,000 small images (32×32) using a DM can take ca. 20 hours due to the iterative process, but a GAN can do this in under a minute on a Nvidia 2080 Ti GPU.