Efficient Deployment of Large Language Model across Cloud-Device Systems | IEEE Conference Publication | IEEE Xplore