Skip to Main Content
A parallelization of the low-frequency multilevel fast multipole algorithm (MLFMA) for graphics processing units (GPUs) is presented. The implementation exhibits speedups between 10 and 30 compared to a serial CPU implementation of the algorithm. The error of the MLFMA on the GPU is controllable down to machine precision. Under the typical method-of-moments (MoM) error requirement of three correct digits, modern GPUs are shown to handle problems with up to 7.5 million degrees of freedom in dense matrix approximation.