CPP: Compensated Post-Training Pruning Approach for On-Device Large Language Model Services | IEEE Conference Publication | IEEE Xplore