Journals & Magazines >IEEE Access >Volume: 10

An Ubiquitous 2.6 GHz Radio Propagation Model for Wireless Networks Using Self-Supervised Learning From Satellite Images

Architecture of the Ubiquitous Satellite Aided Radio Propagation (USARP) model for path loss predictions.

Abstract:

The performance of any Mobile Wireless Network (MWN) is dependent on the appropriate level of radio coverage, with Path Loss (PL) models being a valuable resource for its...Show More

Metadata

Abstract:

The performance of any Mobile Wireless Network (MWN) is dependent on the appropriate level of radio coverage, with Path Loss (PL) models being a valuable resource for its evaluation. Recently, advancements in Machine Learning (ML) and Deep Neural Networks (DNNs) have been applied to radio propagation to produce new data-driven PL models. Notoriously, these advancements have also allowed the inclusion of non-classical inputs, such as satellite images. However, data-driven PL models are often developed under the assumption that training and test data distributions are similar, which is a weak assumption in real-world scenarios. Thus, generalization (i.e., the model’s ability to perform on different data distributions) is a crucial aspect of data-driven PL models in the context of Mobile Network Operators (MNOs). This paper proposes a new data-driven PL model, the Ubiquitous Satellite Aided Radio Propagation (USARP) model, developed to enhance the geographical generalization capabilities of empirical PL models, by using satellite images. The USARP model considers self-supervised learning to extract general data representations of the radio environment from satellite images, improving the PL prediction Root Mean Square Error (RMSE) of the

$3^{rd}$ Generation Partnership Project (3GPP) PL model in the order of 9 dB, and for a data distribution distinct from the training data. Moreover, it was demonstrated the potential of the USARP model in terms of geographical and radio environment generalization. Although the generalization capabilities of ML regression algorithms are limited, the chosen USARP architecture and the use of regularization techniques had a positive impact on its geographical generalization performance.

Architecture of the Ubiquitous Satellite Aided Radio Propagation (USARP) model for path loss predictions.

Published in: IEEE Access ( Volume: 10)

Page(s): 78597 - 78615

Date of Publication: 25 July 2022

Electronic ISSN: 2169-3536

DOI: 10.1109/ACCESS.2022.3193486

Funding Agency:

Contents

SECTION I.

Introduction

MNOs are continuously managing their MWNs, from initially planning the deployment of new Base Stations (BSs), to monitoring the existing network infrastructure and optimizing its performance. The network planning phase not only impacts the MNOs Capital Expenditure (CapEx) but also their Operating Expense (OpEx), as the network optimization stage depends on the reliability of the network planning phase. Considering Radio Access Networks (RANs), the planning phase aims to guarantee coverage, capacity, and Quality of Service (QoS) requirements, with the least amount of investment (e.g., minimizing the number of BSs). Still, the ability to estimate coverage accurately is of paramount importance in the development of successful RAN planning [1].

During the RANs optimization phases, the use of Drive Test (DT) data [2], geopositioning of network traces [3], or crowdsourcing data [4], provide accurate data to evaluate and optimize the radio coverage and the network QoS. However, during initial RAN planning phases, PL models are the primary option to estimate and evaluate coverage.

The use of PL models introduces a higher coverage estimation error than the other “signal level” data sources (e.g., DT measurements), but they are the only existent option in the RAN planning phase for coverage prediction. Different levels of PL prediction accuracy can be obtained from different PL models; however, the most accurate PL models tend to be highly computational expensive and require extensive and detailed environment data [5], which limits their practical applicability. Furthermore, the continuous advancements in ML and DNNs are providing the fundamentals for the development of new data-driven PL models [6], [7], where satellite-based data is also being considered as an additional input. The goal is to achieve higher prediction accuracy than conventional (empirical) PL models without introducing excessive computational complexity or requiring extensive environment data. Nonetheless, the generalization capability— i.e., the ability to learn from a limited volume of data and perform similarly in an out-of-distribution data— of ML or deep learning-based models is still being investigated [8]. Moreover, PL models, when calibrated with DT measurements, generally require a specific calibration for each propagation environments [9].

This paper aims to study the geographical generalization capabilities of empirical PL models, including ML/ DNN based ones, towards developing ubiquitous PL models, which can be applied to multiple radio propagation environments without re-calibration or training. Therefore, a novel DNN-based model, the USARP model for PL prediction, is proposed; it uses satellite images with a self-supervised methodology to increase the PL prediction accuracy, enhancing the geographical generalization towards breaking single environment usage restraints of empirical PL models. DT data from real Long Term Evolution (LTE) networks was extensively used for the development and assessment of the USARP model.

The main contributions of this paper are summarized as follows:

Geographical generalization analysis of empirical and ML/ DNN based PL models using data distributions distinct from the initial training data.
Proposal of a two-stage development process for DNN-based PL models using satellite images, namely: 1) use of self-supervised learning to learn radio environment representations from satellite images; 2) employment of the radio environment representations together with DT measurements, for PL prediction.
Proposal of a new data-driven PL prediction model—the USARP model— based on the previous two-stages procedure, and its architecture optimization and evaluation in multiple radio environments, towards a single (multi-environment) PL model solution.

This paper is organized as follows. After the introduction, Section II overviews classical PL models and the recent work on the development of PL models using satellite images. Section III gives a brief description of the data considered in this work (satellite and DTs). Section IV explains how the useful information from satellite images is extracted for PL predictions. First, a brief background on self-supervised learning is provided. Then, the self-supervised algorithm used in the scope of this work is presented, along with the results obtained by applying it to satellite data. In Section V, the error metrics to evaluate the PL predictions are firstly defined. Then, the process leading to the development of the USARP model is presented. Section VI evaluates the PL prediction results of the USARP model, and provides a comparison to benchmark PL models. Section VII analyses the geographical generalization capability of the USARP model towards its use in multiple radio propagation environments. Finally, Section VIII presents the main conclusions and final remarks. The main notation adopted in this paper is summarized in Table 1.

TABLE 1 Main Notation Used in This Paper

SECTION II.

Related Work

This section overviews related work on radio propagation, notably classical PL models and PL models using satellite images. First, a classification of classical radio PL models is presented, highlighting the base structure of empirical models, which are used as a reference throughout this work. Then, the most relevant work in PL models using satellite images, and typically resorting to deep learning algorithms, is presented.

A. Classical Path Loss Models

PL models for MWNs are broadly categorized into two classes: large-scale and small-scale (or fading) models. Large-scale PL models predict the mean strength of the received signal, and small-scale models characterize the rapid fluctuations occurring in a distance of a few wavelengths or on very short time intervals [10].

A UE, with just slight motion, may experience severe signal strength oscillations, as the instantaneously received signal strength results from the contribution of several Multipath Components (MPCs) with distinct directions and random phases. This behavior is known as small-scale fading and may originate signal level fluctuations in a range of 30 dB, for distance differences comparable to the signal wavelength. Small-scale PL models attempt to predict the received signal strength under these circumstances. As the UE moves away from the BS, the local average received signal strength decreases, which is what large-scale PL models predict [10].

Depending on the modelling approach, PL models can be also classified as either deterministic or empirical; while deterministic PL models are derived from the electromagnetic theory (e.g., Maxwell equations), empirical models are obtained by curve fitting from extensive DT signal strength measurements. The deterministic models may apply to various scenarios, by taking into account the reflection and diffraction laws in the PL prediction; therefore, they tend to achieve higher accuracies in the PL prediction than other modeling approaches. However, they have high computational complexity (e.g., require Ray Tracing (RT) or Ray Launching (RL) techniques) and usually demand precise 3-Dimensional (3D) environment information. On the contrary, empirical PL models are mathematically tractable and do not require 3D environment data, despite tending to exhibit lower PL prediction accuracy than the deterministic counterpart [11]. Moreover, as empirical models consider all environmental impacts subjacent on the signal measurements of the respective area, they have higher accuracy in environments similar to the original measurements area [12].

For radio coverage estimation in large areas, empirical models are preferred due to their computational efficiency. The empirical PL models are mostly based on the Alpha-Beta-Gamma (ABG) or the Close In (CI) equations. The PL ABG equation, also known as Floating Intercept (FI), is dependent on the frequency and on the distance, according to [13]:\begin{align*} \text {PL}^{\text {ABG}}(f_{c},d_{3D})\! = \!10\alpha \log _{10}(d_{3D}) \!+\! \beta \!+\! 10\gamma \log _{10}(f_{c}) \!+\! \chi _\sigma ^{\text {ABG}}\!\!\! \\ {}\tag{1}\end{align*} View Source where $\alpha $ and $\gamma $ are coefficients denoting the dependence of PL on distance and frequency, respectively, whereas $\beta $ is an optimized offset value. The variable $d_{3D}$ is the 3D distance between the BS and the UE in meters, $f_{c}$ is the carrier frequency in GHz, and $\chi _\sigma ^{ABG}$ is a zero-mean Gaussian distributed random variable with a standard deviation $\sigma $ , describing the Shadow Fading (SF) signal fluctuations. The coefficients $\alpha $ , $\beta $ and $\gamma $ are obtained directly from real signal measurement campaigns, fitting (1) to the measured data.

The CI PL equation is given by [13]:\begin{equation*} \text {PL}^{\text {CI}}(f_{c},d_{3D}) = \text {FSPL}(f_{c}, 1~\text {m}) + 10n \log _{10} (d_{3D}) \!+\!\chi _\sigma ^{\text {CI}}\tag{2}\end{equation*} View Source where $n$ is the Path Loss Exponent (PLE), and the only parameter that can be used for the model calibration, the $\text {FSPL}(f_{c}, 1~\text {m})$ is the Free Space Path Loss (FSPL) at a BS-UE separation of 1 m and carrier frequency $f_{c}$ in GHz, and $\chi _\sigma ^{\text {CI}}$ is a zero-mean Gaussian distributed random variable with a standard deviation $\sigma $ (SF).

The 3GPP TR 38.901 model [14] is an example of a ABG-based PL model. Its latest version is valid for a wide range of carrier frequencies ($f_{c}$ ), ranging from 0.5 GHz to 100 GHz (including the entire $5^{th}$ Generation (5G) spectrum), and for a limited number of propagation scenarios [15]. This PL model separates Line-of-Sight (LoS) from Non-Line-of-Sight (NLoS) propagation, with specific PL equations and parameters for each propagation condition. Moreover, it considers additional variables, such as the BS and UE heights.

Many other empirical PL models are available in the literature, from the more classical models to the new compliant models. Examples of the more classical PL models include the Okumura-Hata model [16] or the Lee model [12], while the Millimetre-Wave Based Mobile Radio Access Network for Fifth Generation Integrated Communications (mmMAGIC) [17] or the NYUSIM [18] are PL models developed for the 5G.

B. Satellite-Based Path Loss Models

The incorporation of satellite images in radio propagation modeling has been gradually proposed in the last years, fueled by advances in the computer vision field. In [19], the authors proposed the use of satellite images to predict radio channel parameters (PLE and SF) for a given area. The data used to train the model was supported by a deterministic PL model for an Unmanned Aerial Vehicle (UAV) scenario (with a transmitter antenna height of 300 m and a carrier frequency of 900 MHz). Accuracies of 88% and 75% in predicting the PLE and SF, respectively, were reported. The authors proposed the use of pre-trained Convolutional Neural Networks (CNNs), despite being pre-trained on an image dataset very distinct from satellite images, composed of objects, animals, vehicles, and others [20].

In [21], the authors proposed a CNN-based deep learning model for PL estimation using images with building footprints. The PL measurements to train the model were obtained using a deterministic PL model, considering a 900 MHz frequency and an antenna height of 35 m. After training the model, a Mean Square Error (MSE) of 19.52 dB was reported between the ground truth (using the deterministic PL model) and the predicted PL. The authors reported that the proposed model could adapt to modified environments; however, no results have been presented to support that claim.

In [22], a deep learning model that also considers satellite images as input was proposed to estimate LTE signal metrics namely, Reference Signal Received Power (RSRP), Reference Signal Received Quality (RSRQ), and Signal-to-Interference plus Noise Ratio (SINR). The model was developed with real signal measurements, limited to three BSs, and was composed of a CNN to process the image data, and a Neural Network (NN) to process the radio propagation variables (e.g., distance between the UE and the BS). For each training signal measurement, a satellite image (centered on the UE location) is required. The authors reported a MSE of 7.7 dB between measured RSRP and the proposed model predictions. The authors evaluated the generalization performance of the proposed model by considering training and testing data, but both datasets resulted from the same signal measurements distribution and same locations.

In [23], the authors proposed the use of satellite images to estimate PL with a deep learning model. The model was composed of the 3GPP Urban Macro (UMa) PL model and a correction term generated by a DNN. The DNN contains a CNN to extract features from satellite images, a NN to process radio propagation variables, and a final NN that estimates the PL given the features learned by the previous two modules. The proposed model was trained and evaluated using real LTE PL measurements from three BSs and two distinct carrier frequencies. The authors demonstrated that the use of satellite images provided a reduction of 0.8 dB on the RMSE considering the ground truth PL and the model predictions. Nonetheless, the limited amount of data prevented to derive considerations about the generalization capacity of the model. In [24], the same authors expanded the results of [23] (with some model adjustments) by considering a dataset containing 125000 PL measurements from five distinct environments, allowing to further evaluate the generalization capabilities of the model. The authors reported a prediction RMSE of around 6 dB for unseen locations. However, in the latter work, the proposed DNN model was used to estimate RSRP and not PL.

Overall, the proposed PL models use CNNs to extract features from images. Transfer learning has been applied in [19], but most PL models were trained end-to-end (where a model learns all the parameters of the different modules simultaneously) [21]–[24]. PL models considering only images as input have been proposed in [19], [21], [25], along with mix approaches considering both image-based features and already known radio propagation variables [22]–[24]. Moreover, in several contributions [22]–[24], real signal measurements were used to develop the propagation models. Nevertheless, the used measurements tend to be limited in number and type of environment, limiting the generalization analysis of the proposed models.

This work proposes a new DNN-based PL model using both satellite images and radio propagation variables as input, where the satellite images are used as a complementary data source to increase PL prediction accuracy. The proposed model uses pretraining to enhance its geographical generalization capabilities but, instead of transfer learning, it uses a self-supervised paradigm, which has demonstrated promising results on several applications [26]. To the best of the author’s knowledge, this work constitutes the first application of self-supervised learning to data-driven PL models. Moreover, this work is supported by data from a live network with extensive PL measurements obtained from multiple BSs, in distinct radio propagation environments. Related work has been generally supported by simulated data or limited measurements, restricted to the same geographical area. Furthermore, this work analyzes the geographical generalization capability of data-driven PL models, a topic that has received limited contributions in the related literature, culminating with the proposal of a PL model with enhanced geographical generalization capability.

SECTION III.

Satellite and Drive Test Data

In this section, the data that supported the development of this work is presented, comprising the description of the used satellite data and the procedures to obtain the PL from DT measurements.

A. Satellite Data

In this work, the used satellite images cover an area of 194 km², encompassing a mix of urban/suburban environments, along with some areas dominated by vegetation and trees (see Fig. 1). The images were stored in Geospatial Tagged Image File Format (GeoTIFF) format files, being already georeferenced with the same coordinate system used in the DT data, have a pixel resolution of 5 m $\times $ 5 m (after subsampling), corresponding to Visible Satellite Images (VSIs) with three bands, setting the Red, Green, Blue (RGB) color information, and were cropped with a size of 2 km $\times $ 2 km. The reference area, depicted in Fig. 1, contains the geographical area correspondent to the DT data used throughout this work.

FIGURE 1.

Reference area including urban and suburban environments along with some open areas [31].

An Ubiquitous 2.6 GHz Radio Propagation Model for Wireless Networks Using Self-Supervised Learning From Satellite Images

Alerts

Abstract:

Metadata

Abstract:

Funding Agency:

Introduction

Related Work

A. Classical Path Loss Models

B. Satellite-Based Path Loss Models

Satellite and Drive Test Data

A. Satellite Data

B. Drive Test Data

Self-Supervised Learning With Satellite Data

A. Background

1) Self-Supervised Representation Learning

2) Convolutional Neural Networks

B. Self-Supervised Model

C. Satellite Self-Supervised Learning

Ubiquitous Satellite Aided Radio Propagation

A. USARP Inputs

B. USARP Base Architecture

C. USARP Architecture Optimization

D. USARP Model

1) Final Architecture

2) Regularization and Hyperparameter Tuning

Results

A. Benchmark Models

B. USARP Model

C. USARP Ablation Studies

Extending the USARP Model to Multiple Radio Environments

A. Data

B. Results

Conclusion

ACKNOWLEDGMENT

Authors

Figures

References

Citations

Keywords

Metrics

References

IEEE Account

Purchase Details

Profile Information

Need Help?