Optimizing Distributed Deployment of Mixture-of-Experts Model Inference in Serverless Computing | IEEE Conference Publication | IEEE Xplore