I. Introduction
The Shannon capacity of a channel manifests the spectral efficiency (SE) that it supports. Massive MIMO (multiple-input multiple-output) improves the sum SE of cellular networks by spatial multiplexing of a large number of user equipments (UEs) per cell [1]. It is therefore considered a key time-division duplex (TDD) technology for the next generation of cellular networks [2]–[4]. The main difference between Massive MIMO and classical multiuser MIMO is the large number of antennas, , at each base station (BS) whose signals are processed by individual radio-frequency chains. By exploiting channel estimates for coherent receive combining, the uplink signal power of a desired UE is reinforced by a factor , while the power of the noise and independent interference does not increase. The same principle holds for the transmit precoding in the downlink. Since the channel estimates are obtained by uplink pilot signaling and the pilot resources are limited by the channel coherence time, the same pilots must be reused in multiple cells. This leads to pilot contamination which has two main consequences: the channel estimation quality is reduced due to pilot interference and the channel estimate of a desired UE is correlated with the channels to the interfering UEs that use the same pilot. Marzetta showed in his seminal paper [1] that the interference from these UEs during data transmission is also reinforced by a factor , under the assumptions of maximum ratio (MR) combining/precoding and independent and identically distributed (i.i.d.) Rayleigh fading channels. This means that pilot contamination creates a finite SE limit as .