Skip to Main Content
Cluster is now the main architecture of high-performance computers. We discuss distributed and hierarchical autonomic management mechanisms based on partition and election. Three autonomic management strategies are proposed, including a new definition of autonomic element, a new formula to calculate the number of nodes managed in a logical partition and a new trigger-bully election algorithm. Based on the three strategies, we design and realize a high-performance cluster management system ACView in this paper, and this ACView system is now successfully applied in a real high performance cluster.