Design and evaluation of fault tolerance techniques for highly parallel architectures | IEEE Conference Publication | IEEE Xplore