1. INTRODUCTION
Realtime speech denoising is a highly demanded audio processing task—perhaps more demanded than ever—as our world is still shadowed by the COVID pandemic and the online meeting is becoming a "new normal" of our daily social life. Most recent, state-of-the-art realtime denoising techniques are all based on neural networks [1, 2, 3, 4, 5, 6, 7, 8]. They seek novel network structures to achieve plausible denoising quality while retaining network simplicity to reduce processing time.