By Topic

The need for new monitoring and management technologies in large scale computing systems

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$31 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

2 Author(s)
Buchholz, J. ; HLRS High Performance Comput. Center Stuttgart, Stuttgart, Germany ; Volk, E.

Currently administrators of high performance computing (HPC) resources are facing new challenges caused by several changes in the usage of the resources like rapidly growing user community and their needs on application level, interdisciplinary usage and therefore new functional requirements like storage at specific storage providers. The increasing complexity of the whole system administration needs to be supported technically. In this paper we explain the reasons why the administration of HPC resources is slightly different from other resources and show the consequences when these differences are not regarded by administration tools. After exposing their limitations and deficiencies we describe the upcoming needs from the HPC providers' perspective in comparison to the currently available features. In order to solve addressed problems in a very generic way, we present a possible solution of a hierarchically structured monitoring and management framework.

Published in:

eChallenges, 2010

Date of Conference:

27-29 Oct. 2010