Availability in parallel systems: Automatic process restart | IBM Journals & Magazine | IEEE Xplore