Early Access ArticlesEarly Access articles are made available in advance of the final electronic or print versions. Early Access articles are peer reviewed but may not be fully edited. They are fully citable from the moment they appear in IEEE Xplore.
Web browsing is an important application domain, but it imposes a significant power burden on mobile devices. While the heterogeneous multi-core design offers the potential for energy-efficient computing, existing web browsers fail to exploit the hardware to optimize mobile web browsing. Our work aims to offer a better way to optimize web browsing on heterogeneous mobile devices. We achieve this b... View full abstract»
Many modern embedded CPUs prefer a Non-Coherent DMA (NC-DMA) over a Coherent DMA (C-DMA) because of simplicity. The NC DMA, however, requires a CPU to manually invalidate or flush a wide range of cache space. Especially when an I/O device writes data to a main memory region, the CPU needs to invalidate the cache space corresponding to the same memory regiontwice (1) to prevent dirty cache lines fr... View full abstract»
Dynamic scripting languages become very popular for high productivity. However, many of these languages have significant runtime overheads because they employ interpreter-based virtual machines. One of the major overheads for the interpreter is derived from operand accesses, which significantly increase memory accesses. We propose VMOR, microarchitectural support for the operand accesses in the in... View full abstract»
Hardware-based DRAM cache techniques for GPGPUs propose to use GPU DRAM as a cache of the host (system) memory. However, these approaches do not exploit the opportunity of allocating store-before-load data (data that is written before being read by GPU cores) on GPU DRAM that would save multiple CPU-GPU transactions. In this context, we propose ReDRAM, a novel memory allocation strategy for GPGPUs... View full abstract»
The use of hardware encryption and new memory technologies such as phase change memory (PCM) are gaining popularity in a variety of server applications. While PCM provides energy and density advantages over conventional DRAM memory, it faces endurance challenges. Such challenges are exacerbated when employing memory encryption as the stored data is essentially randomized, losing data locality and ... View full abstract»
The energy-crisis of computing leads to even more sophisticated power management. To facilitate dynamic yet accurate power management, hardware activity has to be tracked and processed at a very fine granularity in both space and time. Specialized activity monitors in the form of performance counters or sensors dispersed across chip serve the purpose. The abundance of such activity monitors, their... View full abstract»
Decoupled look-ahead (DLA) architecture has been shown to be an effective way to improve single-thread performance. However, a default implementation requires an additional core. While an SMT flavor is possible, a naive implementation is inefficient and thus slow. In this paper, we propose an optimized implementation called Bootstrapping that makes DLA just as effective on a single (SMT) core as u... View full abstract»
Genomics is the key to enable the personal customization of medical care. How to fast and energy-efficiently analyze the huge amounts of genomic sequence data generated by next generation sequencing technologies has become one of the most significant challenges facing genomics today. Existing hardware platforms achieve low genome sequencing throughput with significant hardware and power overhead. ... View full abstract»
Aims & Scope
IEEE Computer Architecture Letters is a rigorously peer-reviewed forum for publishing early, high-impact results in the areas of uni- and multiprocessor computer systems, computer architecture, microarchitecture, workload characterization, performance evaluation and simulation techniques, and power-aware computing.
Meet Our Editors
Daniel J. Sorin
Electrical & Computer Engineering
PO Box 90291
Durham, NC 27708