I. Introduction
The exponential growth in the number of IoT multimedia devices with screens, cameras, and microphones brings significant opportunities to multimedia content distribution. In the past few years, tens of millions of video and game consumers have been transformed into media creators with the support of social Apps, such as Tiktok, Wechat, Roblox, Twitch, YouTube, and so on. Various IoT devices have been utilized during the content creation and sharing process. The IoT device makers are currently aggressively working with the service providers to add hardware and software supports into their systems to enable more advanced AI technology, such as computer vision, face recognition, and natural language understanding (NLP) for the demand of media generation.