Parallel and Distributed Processing, 1999. PDP '99. Proceedings of the Seventh Euromicro Workshop on

3-5 Feb. 1999

Filter Results

Displaying Results 1 - 25 of 50
  • Proceedings of the Seventh Euromicro Workshop on Parallel and Distributed Processing. PDP'99

    Publication Year: 1999
    Request permission for commercial reuse | |PDF file iconPDF (257 KB)
    Freely Available from IEEE
  • Index of authors

    Publication Year: 1999, Page(s):381 - 382
    Request permission for commercial reuse | |PDF file iconPDF (193 KB)
    Freely Available from IEEE
  • A method to solve the scalability problem in managing massively parallel processing on the Internet

    Publication Year: 1999, Page(s):256 - 262
    Cited by:  Papers (1)
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (92 KB)

    A set of computers connected by the Internet can be seen as a MIMD virtual parallel computer with distributed memory. We call such a computer a Massively Parallel Virtual Computer (MPVC). The implementation of an MPVC will be feasible in the next few years, as new technologies brine to Internet communication speeds undreamed of nowadays. Problems such as Monte Carlo simulations and image processin... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Matching architecture and software technology for HPC systems

    Publication Year: 1999, Page(s):2 - 9
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (32 KB)

    PQE2000 is an Italian project on High Performance Computing (HPC), whose goal is the realization of innovative general purpose systems and programming tools, as well as the development of new strategic HPC applications for industry, commerce and public services. The research activities of PQE2000 include MPP architecture, software tools and programming environments, and applications in technical, ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • A parallel processor for neural networks

    Publication Year: 1999, Page(s):89 - 96
    Cited by:  Papers (1)
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (284 KB)

    We present two different algorithms implemented through neural networks on a multiprocessor device. The parallel single-chip TI TMS32C80 Multimedia Video Processor (MVP). The goal of this experimentation is to test, on real problems, the performance of this powerful unit made up by one Master Risc Processor and by four Slave Digital Signal Processors (DSPs) and to evaluate its suitability to neura... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Dynamic coupling of grid-based multidisciplinary applications

    Publication Year: 1999, Page(s):249 - 255
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (96 KB)

    The GRISSLi coupling interface is a runtime library designed for the coupled computation of grid-based multidisciplinary applications. Established simulation programs, which are devoted to a single discipline, can be interconnected to a complex simulation system via the GRISSLi coupling interface without recoding the participating programs. We introduce a data model to establish the links between ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Performance optimisations of the NPB FT kernel by special-purpose unroller

    Publication Year: 1999, Page(s):84 - 88
    Cited by:  Papers (1)
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (112 KB)

    The fast Fourier transform (FFT) is the cornerstone of many supercomputer applications and therefore needs careful performance tuning. Most often, however the real performance of the FFT implementations is far below the acceptable figures. In this paper we explore several strategies for performance optimisations of the FFT computation, such as enhancing instruction-level parallelism, loop merging,... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • A resource management tool for heterogeneous networks

    Publication Year: 1999, Page(s):367 - 373
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (92 KB)

    We describe the design principles and implementation of a tool to be used as resource manager on arbitrary networks of workstations. It evaluates both statically (offline) and dynamically (on-line) the computational power and workload of each node in the network, in order to select the most performant computers after each application request for task spawning to the network. The tool is a componen... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Optimal-election algorithms for hypercubes

    Publication Year: 1999, Page(s):215 - 220
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (116 KB)

    Leader election is a fundamental problem in distributed computing and regards a wide number of applications. In order to solve this problem, it is possible and convenient to exploit the topological properties of the specific distributed systems, so to reduce time and message complexity. In this paper we study the problem of leader election in a hypercube network on the assumption that the system p... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Scheduling tasks with communication delays on a two-levels virtual distributed system

    Publication Year: 1999, Page(s):344 - 348
    Cited by:  Papers (2)
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (180 KB)

    A set of tasks has to be scheduled on the parallel identical processors of the clusters of a two-levels distributed memory multiprocessor, subject to precedence constraints and small intra-cluster communication delays. The architecture model includes network of shared memory multiprocessors. In this paper, we present a new critical-path like algorithm that finds an optimal solution to this new pro... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Effective Monte Carlo simulation on System-V massively parallel associative string processing architecture

    Publication Year: 1999, Page(s):281 - 288
    Cited by:  Papers (2)
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (944 KB)

    We show that the latest version of massively parallel processing associative string processing architecture (System-V) is applicable for fast Monte Carlo simulation if an effective on-processor random number generator is implemented. Our lagged Fibonacci generator can produce 10 8 random numbers on a processor string of 12 K PE-s. The time dependent Monte Carlo algorithm of the one-dime... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • A special purpose hybrid SIMD processor for logic event simulation

    Publication Year: 1999, Page(s):74 - 83
    Cited by:  Papers (1)  |  Patents (2)
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (212 KB)

    This paper introduces a hybrid associative memory/SIMD parallel processor, APPLES, which has been specifically designed for logic simulation. It reviews the computational structure which permits parallel execution of logic gate evaluations in memory. This facilitates fine grain execution on a massive scale of the basic tasks inherent in VLSI logic simulation. Furthermore, unlike other SIMD approac... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Modeling advanced modular avionics architectures for early real-time performance analysis

    Publication Year: 1999, Page(s):181 - 188
    Cited by:  Papers (2)
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (144 KB)

    Behavioral simulation including the impact of architectural choices is required to help the designer in reducing design ambiguities and errors of embedded complex systems which are distributed (for performance and reliability requirements) and present hard real-time features (time critical avionics functions), as early in the product life cycle as possible. This paper presents a modeling and simul... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • A replicated resource architecture for high performance network service

    Publication Year: 1999, Page(s):105 - 111
    Cited by:  Papers (1)
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (100 KB)

    Distributed Learning Environments represent the hope that communications and information technology can improve and widen access to education while maintaining and improving its quality. Such environments consist of network applications and services. Good interactive response time is crucial to their success. Slow responses can quickly dissuade teachers and learners alike from investing their time... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Sharing a common time reference in a heterogeneous distributed system

    Publication Year: 1999, Page(s):359 - 366
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (276 KB)

    This article introduces SYD, a tool for the generation of time-stamps synchronized with the Universal Time Coordinate: the implementation includes an NTP client and is written in Java. Possible applications include (but are not limited to) the measurement of the performance of distributed applications, their debugging, and the implementation of real-time requirements in distributed systems. The fe... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • An atmosphere-ocean forecast system on a hybrid architecture

    Publication Year: 1999, Page(s):309 - 314
    Cited by:  Papers (1)
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (168 KB)

    An operational forecast system is being developed at ENEA, which consists of a high resolution meteorological limited area model coupled with ocean models for the prediction of the state of the Mediterranean Sea and of high water events in the Venice Lagoon. In order to satisfy the operational requirement (3-days forecasts in a few hours) a very effective two level parallelization strategy was ado... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Associating composition of Petri net specifications with application designs in GRADE

    Publication Year: 1999, Page(s):204 - 211
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (340 KB)

    To provide high-level graphical support for developing message passing programs, an integrated programming environment (GRADE) is being developed. GRADE currently provides tools to construct, execute, debug, monitor and visualise message-passing based parallel programs. The paper describes the extension of GRADE with formal method support based on Petri nets composition. We outline specification c... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Performance analysis on CRAY T3E

    Publication Year: 1999, Page(s):241 - 248
    Cited by:  Papers (2)
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (144 KB)

    One of the reasons why parallel programming is considered to be a difficult task is that users frequently cannot predict the performance impact of implementation decisions prior to program execution. This results in a cycle of incremental performance improvements based on run-time performance data. While gathering and analyzing performance data is supported by a large number of tools, typically in... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Utilising parallel resources by speculation

    Publication Year: 1999, Page(s):339 - 343
    Cited by:  Patents (23)
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (908 KB)

    This paper introduces Simultaneous Speculation Scheduling, a new compiler technique that enables speculative execution of alternative program paths. In our approach concurrently executed threads are generated that represent alternative program paths. Each thread is the result of a speculation on the outcome of one or more branches. All threads are simultaneously executed although only one of them ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • A RAID reconfiguration scheme for gracefully degraded operations

    Publication Year: 1999, Page(s):66 - 73
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (100 KB)

    One distinct advantage of Redundant Array of Independent Disks (RAID) is fault tolerance. But the performance of a disk array in degraded mode is so poor that no one uses the RAID after failure. Continuous operation of RAID in degraded mode is very important in many real time applications, which can not be interrupted in providing continuous services. In this paper, we propose an efficient archite... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • On the performance of nearest-neighbors load balancing algorithms in parallel systems

    Publication Year: 1999, Page(s):170 - 177
    Cited by:  Papers (4)  |  Patents (2)
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (324 KB)

    DASUD (Diffusion Algorithm Searching Unbalanced Domains) is a totally distributed load-balancing algorithm which belongs to the nearest-neighbors class. DASUD detects unbalanced domains (a processor and its immediate neighbors) and corrects this situation by allowing load movements between non-connected processors. DASUD has been evaluated by comparison with two well-known nearest-neighbors load b... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Adaptive microphone array beamforming for teleconferencing using VHDL and parallel architectures

    Publication Year: 1999, Page(s):13 - 18
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (80 KB)

    This paper describes how the VHDL hardware description language can be used to aid the analysis, design and implementation of adaptive array beamformers for use in teleconferencing environments using parallel architectures. Beamformer models are often implemented in traditional programming languages such as C, Pascal and FORTRAN. This paper argues that VHDL provides a number of benefits to designe... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Integrating selected fault masking and self-diagnosis mechanisms

    Publication Year: 1999, Page(s):97 - 104
    Cited by:  Papers (1)
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (196 KB)

    Fault Detection and Isolation (FDI) using self-diagnosing elements is used in distributed systems to facilitate system monitoring and maintenance. Voting algorithms are used to increase reliability by masking faults. This research addresses the integration of self-diagnosing elements and voting algorithms in a Triple Modular Redundant (TMR) configuration. A self-diagnosing element model is present... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • An approach to mapping parallel programs on hypercube multiprocessors

    Publication Year: 1999, Page(s):221 - 225
    Cited by:  Papers (1)
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (28 KB)

    In this work, we propose a heuristic algorithm based on genetic algorithm for the task-to-processor mapping problem in the context of local-memory multiprocessors with a hypercube interconnection topology. Hyper-cube multiprocessors have offered a cost effective and feasible approach to supercomputing through parallelism at the processor level by directly connecting a large number of low-cost proc... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Dynamic load adaption in LIPS

    Publication Year: 1999, Page(s):374 - 379
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (64 KB)

    LIPS is a system for distributed computing using idle-cycles in heterogeneous networks of workstations. Especially data- and compute-intensive applications in the field of cryptography and computer algebra have used the system. The system provides its user with the tuple space based generative communication paradigm of parallel computing as known from the coordination language LINDA. In LIPS, fail... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.