By Topic

Proceedings. The 5th Annual IEEE Symposium on Field-Programmable Custom Computing Machines Cat. No.97TB100186)

16-18 April 1997

Filter Results

Displaying Results 1 - 25 of 34
  • Proceedings. The 5th Annual IEEE Symposium on Field-Programmable Custom Computing Machines Cat. No.97TB100186)

    Publication Year: 1997
    Request permission for commercial reuse | PDF file iconPDF (200 KB)
    Freely Available from IEEE
  • Index of authors

    Publication Year: 1997, Page(s):249 - 250
    Request permission for commercial reuse | PDF file iconPDF (72 KB)
    Freely Available from IEEE
  • A parallel hardware evolvable computer POLYP

    Publication Year: 1997, Page(s):238 - 239
    Cited by:  Papers (1)  |  Patents (24)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (160 KB)

    Previous work (J.S. McCaskill et al., 1996; 1997) has shown the power of massively parallel configurable hardware (NGEN) in conjunction with dataflow architectures for the simulation of evolving populations. NGEN is a flexible computer hardware for rapid custom circuit simulation of fine grained physical processes via a massively parallel architecture, e.g. 144 hardware configurable field programm... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Incremental reconfiguration for pipelined applications

    Publication Year: 1997, Page(s):47 - 55
    Cited by:  Papers (21)  |  Patents (195)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (716 KB)

    This paper examines the implementation of pipelined applications using run-time reconfiguration. Throughput and latency of pipelined applications can be significantly improved when reconfiguration is performed at the level of individual pipeline stages, as opposed to configuration of the entire FPGA. If reconfiguration and execution can be performed simultaneously, the performance of a pipelined a... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Datapath-oriented FPGA mapping and placement for configurable computing

    Publication Year: 1997, Page(s):234 - 235
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (196 KB)

    Widespread acceptance of FPGA-based reconfigurable coprocessors will be expedited if compilation time for FPGA configurations can be reduced to be comparable to software compilation. This research achieves this goal, generating complete datapath layouts in fractions of a second rather than hours. Our algorithm, adapted from instruction selection in compilers, packs multiple operations into single ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • High level compilation for fine grained FPGAs

    Publication Year: 1997, Page(s):165 - 173
    Cited by:  Papers (7)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (604 KB)

    The authors present an integrated tool set to generate highly optimized hardware computation blocks from a C language subset. By starting with a C language description of the algorithm, they address the problem of making FPGA processors accessible to programmers as opposed to hardware designers. Their work is specifically targeted to fine grained FPGAs such as the National Semiconductor CLAyT... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Increased FPGA capacity enables scalable, flexible CCMs: an example from image processing

    Publication Year: 1997, Page(s):211 - 217
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (524 KB)

    The need to partition computation across multiple programmable devices in array architecture CCMs leads to performance bottlenecks in data flow through the computer and wiring delays between adjacent devices. However, significant improvements in FPGA capacities have brought one to a threshold where direct inter-chip connections are not required because an entire algorithm can be implemented on a s... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • A wireless LAN demodulator in a Pamette: design and experience

    Publication Year: 1997, Page(s):40 - 45
    Cited by:  Papers (2)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (588 KB)

    We have implemented the digital section of a wireless local area network (WLAN) demodulator in a reconfigurable interface card called the PCI Pamette. The entire baseband section of the demodulator has been implemented using the Pamette and a simple analog to digital mezzanine board. This is the second version of the demodulator, the first being a card-based design using a mixture of discrete and ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Speech recognition HMM training on reconfigurable parallel processor

    Publication Year: 1997, Page(s):242 - 243
    Cited by:  Papers (3)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (188 KB)

    Armstrong III is a 20 node multi-computer that is currently operational. In addition to a RISC processor, each node contains reconfigurable resources implemented with FPGAs. The in-circuit reprogramability of static RAM based FPGAs allows the computational capabilities of a node to be dynamically matched to the computational requirements of an application. Most reconfigurable computers in existenc... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Implementation of single precision floating point square root on FPGAs

    Publication Year: 1997, Page(s):226 - 232
    Cited by:  Papers (21)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (532 KB)

    The square root operation is hard to implement on FPGAs because of the complexity of the algorithms. In this paper, we present a non-restoring square root algorithm and two very simple single precision floating point square root implementations based on the algorithm on FPGAs. One is low-cost iterative implementation that uses a traditional adder/subtracter. The operation latency is 25 clock cycle... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • On acceleration of the check tautology logic synthesis algorithm using an FPGA-based reconfigurable coprocessor

    Publication Year: 1997, Page(s):246 - 247
    Cited by:  Papers (4)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (204 KB)

    We summarize our study on implementing tautology checking, a fundamental logic synthesis algorithm, using an FPGA based reconfigurable application specific coprocessor. The use of the tautology checking algorithm is first discussed followed by the specifics of hardware accelerator implementation and interface to application software. We compare our hardware accelerator for the tautology check algo... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • The swappable logic unit: a paradigm for virtual hardware

    Publication Year: 1997, Page(s):77 - 86
    Cited by:  Papers (36)  |  Patents (2)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (872 KB)

    Swappable Logic Units (SLUs) were introduced by the author previously (1996) to play a role in virtual hardware subsystems that is analogous to the role of pages or segments in virtual memory subsystems. The intention is that a conventional operating system can be extended to manage SLU circuitry implemented using FPGA real estate. In order to minimise operating system overheads, two particular SL... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • FPGA synthesis on the XC6200 using IRIS and Trianus/Hades (or from heaven to hell and back again)

    Publication Year: 1997, Page(s):155 - 164
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (724 KB)

    The implementation of a number of FIR filter structures in the Xilinx XC6200 technology is presented. The designs have been implemented using a combination of IRIS, an architectural synthesis tool and Trianus/Hades a set of integrated tools for implementing algorithms on Custom Computing Machines. The main attraction of this approach is that it allows algorithms to be compiled quickly allowing per... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Real-time stereo vision on the PARTS reconfigurable computer

    Publication Year: 1997, Page(s):201 - 210
    Cited by:  Papers (63)  |  Patents (17)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (816 KB)

    The paper describes a powerful, scalable, reconfigurable computer called the PARTS engine. The PARTS engine consists of 16 Xilinx 4025 FPGAs, and 16 one-megabyte SRAMs. The FPGAs are connected in a partial torus-each associated with two adjacent SRAMs. The SRAMs are tightly coupled to the FPGAs so that all the SRAMs can be accessed concurrently. The PARTS engine fits on a standard PCI card in a pe... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • An FPGA-based coprocessor for ATM firewalls

    Publication Year: 1997, Page(s):30 - 39
    Cited by:  Papers (10)  |  Patents (45)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (1188 KB)

    This implementation of the firewall enables a high degree of traffic selectability yet avoids the usual performance penalty associated with IP level firewalls. This approach is applicable to high-speed broadband networks, and asynchronous transfer mode (ATM) networks are addressed in particular. Security management is achieved through a new technique of active connection management with authentica... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Systems performance measurement on PCI Pamette

    Publication Year: 1997, Page(s):125 - 133
    Cited by:  Papers (13)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (804 KB)

    We describe the use of a reconfigurable board to obtain information on the performance that can be expected on particular systems. Our goal is to use the reconfigurability, of the board's interface to test a system and discover not only the maximum bandwidth and best latency attainable, but also the way to reliably achieve these figures. The board we present uses the now widespread PCI bus. PCI is... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Laser defect correction applications to FPGA based custom computers

    Publication Year: 1997, Page(s):240 - 241
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (172 KB)

    The complexity and speed of monolithic FPGA based custom computers has been set by the presence of defective sections which limit chip area. Test FPGAs show that laser link defect avoidance routing around flawed blocks generates delays <50% of active switches, making the error cell distribution nearly invisible View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Compilation tools for run-time reconfigurable designs

    Publication Year: 1997, Page(s):56 - 65
    Cited by:  Papers (30)  |  Patents (41)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (828 KB)

    This paper describes a framework and tools for automating the production of designs which can be partially reconfigured at run time. The tools include: a partial evaluator, which produces configuration files for a given design, where the number of configurations can be minimised by a process, known as compile-time sequencing; an incremental configuration calculator, which takes the output of the p... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Acceleration of an FPGA router

    Publication Year: 1997, Page(s):175 - 181
    Cited by:  Papers (8)  |  Patents (5)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (544 KB)

    The authors describe their experience and progress in accelerating an FPGA router. Placement and routing is undoubtedly the most time-consuming process in automatic chip design or configuring programmable logic devices as reconfigurable computing elements. Their goal is to accelerate routing of FPGAs by 10 fold with a combination of processor clusters and hardware acceleration. Coarse-grain parall... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Comparison of arithmetic architectures for Reed-Solomon decoders in reconfigurable hardware

    Publication Year: 1997, Page(s):219 - 225
    Cited by:  Papers (11)  |  Patents (2)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (584 KB)

    Reed-Solomon (RS) error correction codes are being widely used in modern communication systems such as compact disk players or satellite communication links. RS codes rely on arithmetic in finite, or Galois fields. The specific field GF(28) is of central importance for many practical systems. The most costly, and thus most critical, elementary operations in RS decoders are multiplicatio... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Efficient implementation of the DCT on custom computers

    Publication Year: 1997, Page(s):244 - 245
    Cited by:  Papers (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (152 KB)

    The discrete cosine transform (DCT) is a key step in many image and video coding applications, and its efficient implementation has been extensively studied for software implementations and for custom VLSI. We analyse the use of the distributed arithmetic algorithm for the efficient implementation of the DCT in reconfigurable logic View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • The Chimaera reconfigurable functional unit

    Publication Year: 1997, Page(s):87 - 96
    Cited by:  Papers (88)  |  Patents (23)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (916 KB)

    By strictly separating reconfigurable logic from their host processor, current custom computing systems suffer from a significant communication bottleneck. In this paper we describe Chimaera, a system that overcomes this bottleneck by integrating reconfigurable logic into the host processor itself with direct access to the host processor's register file, the system enables the creation of multi-op... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • The RAW benchmark suite: computation structures for general purpose computing

    Publication Year: 1997, Page(s):134 - 143
    Cited by:  Papers (24)  |  Patents (15)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (876 KB)

    The RAW benchmark suite consists of twelve programs designed to facilitate comparing, validating, and improving reconfigurable computing systems. These benchmarks run the gamut of algorithms found in general purpose computing, including sorting, matrix operations, and graph algorithms. The suite includes an architecture-independent compilation framework, Raw Computation Structures (RawCS), to expr... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • A dynamic reconfiguration run-time system

    Publication Year: 1997, Page(s):66 - 75
    Cited by:  Papers (29)  |  Patents (29)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (1008 KB)

    The feasibility of run-time reconfiguration of FPGAs has been established by a large number of case studies. However, these systems have typically involved an ad hoc combination of hardware and software. The software that manages the dynamic reconfiguration is typically specialised to one application and one hardware configuration. We present three different applications of dynamic reconfiguration... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Fault simulation on reconfigurable hardware

    Publication Year: 1997, Page(s):182 - 190
    Cited by:  Papers (9)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (684 KB)

    The authors introduce a new approach to fault simulation, using reconfigurable hardware to implement a critical path tracing algorithm. The performance estimate shows that the approach is at least on order of magnitude faster than serial fault emulation used in prior work View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.