I. Introduction
The advent of Federated Learning (FL) has facilitated the training of models on a large scale, leveraging distributed data sources [1]. FL has demonstrated effectiveness in various application domains, including healthcare [2], [3], IoT [4], [5], autonomous driving [6], [7] among others. Given that FL is deployed in resource-constrained devices, often each with different constraints and priorities, it becomes imperative to address the specific training and inference requirements of the client devices [8]. For example, a drone will have energy constraints, whereas a smartwatch will be constrained in terms of area.