This paper presents a novel use of visualization applied to debugging the Cplant TM cluster hardware at Sandia National Laboratories. As commodity cluster systems grow in popularity and grow in size, tracking component failures within the hardware will become more and more difficult. We have developed a tool that facilitates visual debugging of errors within the switches and cables connecting the processors. Combining an abstract system model with color-coding for both error and job information enables failing components to be identified.
Published in:
Visualization, 2001. VIS '01. Proceedings
Date of Conference: 21-26 Oct. 2001