Efficient and fault-tolerant distributed host monitoring using system-level diagnosis | IEEE Conference Publication | IEEE Xplore