2016 Second International Workshop on Extreme Scale Programming Models and Middlewar (ESPM2)

18-18 Nov. 2016

Filter Results

Displaying Results 1 - 13 of 13
  • [Title page]

    Publication Year: 2016, Page(s): i
    Request permission for commercial reuse | |PDF file iconPDF (151 KB)
    Freely Available from IEEE
  • [Copyright notice]

    Publication Year: 2016, Page(s): ii
    Request permission for commercial reuse | |PDF file iconPDF (69 KB)
    Freely Available from IEEE
  • Table of contents

    Publication Year: 2016, Page(s):iii - iv
    Request permission for commercial reuse | |PDF file iconPDF (123 KB)
    Freely Available from IEEE
  • Message from the Program Chairs

    Publication Year: 2016, Page(s): v
    Request permission for commercial reuse | |PDF file iconPDF (68 KB)
    Freely Available from IEEE
  • Keynote

    Publication Year: 2016, Page(s): 1
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (107 KB)

    Provides an abstract of the keynote presentation and a brief professional biography of the presenter. The complete presentation was not made available for publication as part of the conference proceedings. View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • In-Staging Data Placement for Asynchronous Coupling of Task-Based Scientific Workflows

    Publication Year: 2016, Page(s):2 - 9
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (894 KB) | HTML iconHTML

    Coupled application workflows composed of applications implemented using task-based models present new coupling and data exchange challenges, due to the asynchronous interaction and coupling behaviors between tasks of the component applications. In this paper, we present an adaptive data placement approach that addresses these challenges by dynamically adjusting to the asynchronous coupling patter... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • PGAS Communication Runtime for Extreme Large Data Computation

    Publication Year: 2016, Page(s):10 - 16
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (511 KB) | HTML iconHTML

    For partitioned global address space (PGAS) runtimes, supporting out-of-core data computation is an important issue. Some researchers showed that flash SSDs are useful for out-of-core data computation.In this paper, we introduce ComEx-PM, a PGAS communication runtime. ComEx-PM supports out-of-core data computation using a flash SSD. ComEx-PM launched multiple processes in each node. Memory region ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • A Scalable Task Parallelism Approach for LU Decomposition with Multicore CPUs

    Publication Year: 2016, Page(s):17 - 23
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (163 KB) | HTML iconHTML

    Many scientific applications have linear systems A · x = b which need to be solved for different vectors b. LU decomposition, which is a variant of Gaussian Elimination, is an efficient technique to solve a linear system. The main idea of the LU decomposition is to factorize A into an upper (U) triangular and a lower (L) triangular matrix such that A = LU. This paper presents an OpenMP task... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Metaprogramming-Enabled Parallel Execution of Apparently Sequential C++ Code

    Publication Year: 2016, Page(s):24 - 31
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (378 KB) | HTML iconHTML

    Task-based execution models have received considerable attention in recent years to meet the performance challenges facing high-performance computing (HPC). In this paper we introduce MetaPASS - Metaprogramming-enabled Para-llelism from Apparently Sequential Semantics - a proof-of-concept, non-intrusive header library that enables implicit task-based parallelism in a sequential C++ code. MetaPASS ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • SWE-X10: Simulating Shallow Water Waves with Lazy Activation of Patches Using Actorx10

    Publication Year: 2016, Page(s):32 - 39
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (638 KB) | HTML iconHTML

    We present an efficient Finite Volume solver for the shallow water equations using an actor extension of the X10 programming language, ActorX10, as programming model. Each actor is assigned to a Cartesian patch of the computational grid. Using the actor's finite state machine to control patch updates, we realize lazy activation of patches, only when a propagating wave enters the respective patch. ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Runtime Coordinated Heterogeneous Tasks in Charm++

    Publication Year: 2016, Page(s):40 - 43
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (184 KB) | HTML iconHTML

    Effective utilization of the increasingly heterogeneous hardware in modern supercomputers is a significant challenge. Many applications have seen performance gains by using GPUs, but many implementations leave CPUs sitting idle.In this paper, we describe a runtime managed system for coordinating heterogeneous execution. This system manages data transfers to and from GPU devices and schedules work ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • An Overview of Performance Portability in the Uintah Runtime System through the Use of Kokkos

    Publication Year: 2016, Page(s):44 - 47
    Cited by:  Papers (1)
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (208 KB) | HTML iconHTML

    The current diversity in nodal parallel computer architectures is seen in machines based upon multicore CPUs, GPUs and the Intel Xeon Phi's. A class of approaches for enabling scalability of complex applications on such architectures is based upon Asynchronous Many Task software architectures such as that in the Uintah framework used for the parallel solution of solid and fluid mechanics problems.... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Author index

    Publication Year: 2016, Page(s): 48
    Request permission for commercial reuse | |PDF file iconPDF (56 KB)
    Freely Available from IEEE