Supercomputing '88:Proceedings of the 1988 ACM/IEEE Conference on Supercomputing, Vol. I

14-18 Nov. 1988

Filter Results

Displaying Results 1 - 25 of 58
  • Proceedings. Supercomputing '88 (IEEE Cat. No.88CH2617-9)

    Publication Year: 1988
    Request permission for commercial reuse | PDF file iconPDF (204 KB)
    Freely Available from IEEE
  • Faust: an environment for programming parallel scientific applications

    Publication Year: 1988, Page(s):3 - 10
    Cited by:  Papers (3)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (501 KB)

    The goals and architecture of the Faust environment are presented. The components of the Faust user-interface library are described. The Faust program abstraction supports three levels of detail: the process graph, the subroutine interconnection graph, and the program source code. Each level is described in detail.<> View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • iWarp: an integrated solution to high-speed parallel computing

    Publication Year: 1988, Page(s):330 - 339
    Cited by:  Papers (151)  |  Patents (14)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (908 KB)

    A description is given of the iWarp architecture and how it supports various communication models and system configurations. The heart of an iWarp system is the iWarp component: a single-chip processor that requires only the addition of memory chips to form a complete system building block, called the iWarp cell. Each iWarp component contains both a powerful computation engine that runs at 20 MFLO... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Growing discord: programming philosophy and hardware design

    Publication Year: 1988, Page(s):18 - 26
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (688 KB)

    Current compiler technology is examined with respect to choices or trends in supercomputer design. The basic techniques are extensions of strategies for vector machines, but they have limited effectiveness in a parallel environment. Technical reasons for this are presented. The nature of automatic and nonautomatic code optimization is explored View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • The Convex C240 architecture

    Publication Year: 1988, Page(s):321 - 329
    Cited by:  Papers (4)  |  Patents (16)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (652 KB)

    A description is given of the C240, a tightly coupled, shared memory, parallel multiprocessor that supports up to 40-ns ECL/CMOS Cray-like processors. It is managed by a fully semaphored Unix operating system and can support up to 4 Gb of directly addressable physical memory. Convex proprietary compiler technology provides automatic vectorization and parallelization for Fortran, C and Ada. The all... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • The Joint Research Council's supercomputing unit

    Publication Year: 1988, Page(s):450 - 455
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (404 KB)

    A brief description is given of the Joint Research Councils' Supercomputer Unit, a Cray X-MP/48 installation for the use of academic researchers in the UK. Both hardware and system software are examined. Operational experience and scientific use of the machine are discussed View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Vector and parallel processing of the nuclear reactor transient analysis code RELAP5

    Publication Year: 1988, Page(s):230 - 236
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (500 KB)

    An experiment on vector processing and multitasking of the nuclear reactor transient analysis code RELAPS has been carried out. Vector processing and multitasking were achieved by using the independency of the spatial meshes. The vectorization ratio is 83%. The performance ratio in the vector mode to that in the scalar mode is about three on the FACOM VP-100. For multitasking the spatial meshes ar... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Interactive scientific visualization and parallel display techniques

    Publication Year: 1988, Page(s):132 - 139
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (668 KB)

    A graphics environment for essentially real-time interactive scientific visualization of computational fluid mechanics is described. A researcher can interactively examine fluid data on a frame buffer with animated flow visualization diagnostics that mimic those in the experimental laboratory. This provides an effective and interactive way to analyze the underlying physical mechanisms and to compa... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Some patterns of technological change in high-performance computers

    Publication Year: 1988, Page(s):312 - 320
    Cited by:  Papers (4)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (824 KB)

    The author describes the patterns of the unusually rapid change that high-performance computer technology is undergoing. He identifies the stages in which technological changes occur and then defines `change' as the first derivative of an information function that describes the state of a technology. He explores the driving forces that cause three generic patterns of technological change: incremen... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • FACOM 6443 magneto-optic disk sub-system

    Publication Year: 1988, Page(s):266 - 271
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (368 KB)

    A rewritable optical disk drive is described. The drive has a large storage capacity of 8.9 Gb/spindle with a few seconds of access time. It can store data economically in a small floor space, and it can be used as a supplement or partial alternative to magnetic disk drives or tape library units. The authors discuss the drive technologies, support software, and application of this subsystem View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Supercomputer integration program at Boeing

    Publication Year: 1988, Page(s):442 - 449
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (660 KB)

    The authors define supercomputers as specialized machines, describe their impact on software and applications, and introduce two major supercomputing issues, namely, visualization and communication. They then discuss supercomputer integration programs, and their current experience related to the Alabama Supercomputer Network View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Profiles in mass storage: a tale of two systems

    Publication Year: 1988, Page(s):222 - 228
    Cited by:  Patents (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (584 KB)

    The Los Alamos common file system (CFS) and the NCAR mass storage system (MSS) are file storage and file management systems that serve heterogeneous computing networks of supercomputers, general-purpose computers, scientific workstations, and personal computers. The authors discuss philosophical, implementation, and performance aspects of the two systems. Areas covered include the computing enviro... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • The design of a lockup-free cache for high-performance multiprocessors

    Publication Year: 1988, Page(s):352 - 359
    Cited by:  Papers (5)  |  Patents (15)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (680 KB)

    The performance of cache-based, shared-memory multiprocessors can suffer greatly from moderate cache miss rates because of the usually high ratio between memory-access and cache-access times. The authors propose a lockup-free cache design in which the handling of one or several cache misses is overlapped with processor activity. In multiprocessors, lockup-free caches aggravate the memory coherence... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • A scientific visualization workbench

    Publication Year: 1988, Page(s):148 - 155
    Cited by:  Papers (2)  |  Patents (3)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (1076 KB)

    A system for visualization of data from supercomputer simulations has been developed. The scientific visualization workbench, as it is called, is based on an industry standard workstation (a Sun 3/160C), the NeWS window system, and a video/graphics add-in card, which is supported by NeWS. Also involved is a frame buffer attached to a 48-Mb/s Cray channel and a video link from the frame buffer to t... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • An efficient pipelined dataflow processor architecture

    Publication Year: 1988, Page(s):368 - 373
    Cited by:  Papers (22)  |  Patents (7)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (428 KB)

    It is demonstrated that the principles of pipelined instruction execution can be effectively applied in data-flow computers, yielding an architecture that avoids the main sources of pipeline gaps during program execution in many conventional designs. The processing element uses an architecture called argument-fetch data-flow architecture. It has two parts: a data-flow instruction scheduling unit (... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Compiling issues for supercomputers

    Publication Year: 1988, Page(s):164 - 173
    Cited by:  Papers (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (612 KB)

    Two issues for supercomputer compilers are considered. Exact and inexact tests for computing data dependences and focus on the linearization scheme are first discussed. A program transformation call loop spreading is then presented that can be used to execute in parallel adjacent loops with interloop dependences View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • V-Pascal: an automatic vectorizing compiler for Pascal with no language extensions

    Publication Year: 1988, Page(s):182 - 189
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (660 KB)

    A detailed anatomy of the automatic vectorizing compiler V-Pascal (Version 1, now operational) is given. With no language extensions, V-Pascal efficiently vectorizes the whole of arbitrarily given multiply nested for loops using the mechanism of vector indirect addressing. Higher functions of V-Pascal, such as vectorization of WHILE-DO and recursive calls, vectorization of relational operations on... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Polycyclic vector scheduling vs. chaining on 1-port vector supercomputers

    Publication Year: 1988, Page(s):122 - 129
    Cited by:  Papers (7)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (600 KB)

    The impact of chaining and several instruction scheduling schemes on one-memory-port vector supercomputers, illustrated by the Cray-1 and Cray-2, is studied. The lack of instruction chaining in the Cray-2 vector processor requires a different instruction scheduling scheme from that of the Cray-1. Situations are characterized in which simple vector scheduling can generate optimal code that fully ut... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Benchmarking the Connection Machine 2

    Publication Year: 1988, Page(s):304 - 309
    Cited by:  Papers (3)  |  Patents (2)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (432 KB)

    The Connection Machine 2 (CM-2) is a massively parallel, single-instruction-multiple-data machine with potential for computation rates in the range of several billions of floating point operations per second. A two-dimensional shallow-water equations model was run on a CM-2 to obtain an estimate of its performance on atmospheric and ocean sciences problems. C* and *Lisp versions were developed. Th... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Vectorization of conjugate-gradient methods for large-scale minimization

    Publication Year: 1988, Page(s):410 - 418
    Cited by:  Papers (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (524 KB)

    Vectorization techniques are applied to the nonlinear conjugate-gradient method for large-scale unconstrained minimization. Computational results are presented for a robust limited-memory quasi-Newton-like conjugate-gradient algorithm applied to meteorological problems. The vectorization results in speedups up to a factor of 21 compared to the performance of the scalar code, when nonlinear functio... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • An evaluation of vector Fortran 200 generated by Cyber 205 and ETA-10 pre-compilation tools

    Publication Year: 1988, Page(s):106 - 113
    Cited by:  Papers (3)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (388 KB)

    Vectorizing precompilers such as KAP/205 and VAST-2 complement the efficient use of Fortran on the CDC Cyber 205. With the advent of the ETA-10 and its EOS/VSOS environment, the performance of these Fortran 200 preprocessors has come under closer scrutiny. The extent or quantity of vectorization that can be achieved has been examined elsewhere with reference to a test set of Fortran DO loops. The ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Optimal scheduling policies for mixed scalar-vector multiprocessor supercomputers

    Publication Year: 1988, Page(s):258 - 264
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (412 KB)

    A queuing network model is developed for multiprocessing supercomputer systems implemented on two heterogeneous processors, a scaler unit and a vector unit, which work concurrently on a single process. The model is analyzed theoretically for a representative case of model job mixes. It is found that the utilization of one of the two central service stations in the model can only be maximized at th... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Performance comparison of the Cray-2 and Cray X-MP/416

    Publication Year: 1988, Page(s):288 - 295
    Cited by:  Papers (3)  |  Patents (47)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (564 KB)

    The serial and parallel performance of the Cray-2 is analyzed using the standard Los Alamos benchmark set plus codes adopted for parallel processing. For comparison, architectural and performance data are given for the Cray-X-MP/416. Factors affecting performance, such as memory bandwidth, size and access speed of memory, and software exploitation of hardware, are examined. The parallel-processing... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Highly vectorized algorithm for transient simulation of space reactor systems

    Publication Year: 1988, Page(s):393 - 399
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (540 KB)

    A project to develop a code specifically designed to model and analyze space reactor systems is discussed. The code, named CENTAR (Code for Extended Nonlinear Transient Analysis of Extraterrestrial Reactors), is highly vectorizable and machine-independent. Timing tests of the code have been run on VAX, Cray, NEC, Convex and SUN systems. It has been found that CENTAR executes approximately seven ti... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • A transformational approach to the derivation of hardware algorithms from recurrence equations

    Publication Year: 1988, Page(s):433 - 440
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (508 KB)

    An approach to the derivation of hardware algorithms from recurrence equations is proposed that is based on program transformation. A representation for hardware algorithms, called relational representation, is introduced, in which one relational program corresponding to a given recurrence equation (namely a specification) is transformed to another relational program corresponding to a hardware al... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.