Performance models for Cluster-enabled OpenMP implementations
Cai, J.
Rendell, A.P.
Strazdins, P.E.
H'sien Jin Wong
Dept. of Comput. Sci., Australian Nat. Univ., Acton, ACT;
This paper appears in: Computer Systems Architecture Conference, 2008. ACSAC 2008. 13th Asia-Pacific
Publication Date: 4-6 Aug. 2008
On page(s): 1-8
Location: Hsinchu,
ISBN: 978-1-4244-2682-9
INSPEC Accession Number: 10220655
Digital Object Identifier: 10.1109/APCSAC.2008.4625433
Current Version Published: 2008-09-16
Abstract
A key issue for cluster-enabled OpenMP implementations based on software distributed shared memory (sDSM) systems, is maintaining the consistency of the shared memory space. This forms the major source of overhead for these systems, and is driven by the detection and servicing of page faults. This paper investigates how application performance can be modelled based on the number of page faults. Two simple models are proposed, one based on the number of page faults along the critical path of the computation, and one based on the aggregated numbers of page faults. Two different sDSM systems are considered. The models are evaluated using the OpenMP NAS parallel benchmarks on an 8-node AMD-based Gigabit Ethernet cluster. Both models gave estimates accurate to within 10% in most cases, with the critical path model showing slightly better accuracy; accuracy is lost if the underlying page faults cannot be overlapped, or if the application makes extensive use of the OpenMP flush directive.
Index
Terms
Available to subscribers and IEEE members.
References
Available to subscribers and IEEE members.
Citing Documents
Available to subscribers and IEEE members.