Shrink or Substitute: Handling Process Failures in HPC Systems Using In-Situ Recovery | IEEE Conference Publication | IEEE Xplore