Cool-Cache: A Compiler-Enabled Energy Efficient Data Caching Framework for Embedded/Multimedia Processors

The unique characteristics of multimedia/embedded applications dictate media-sensitive architectural and compiler approaches to reduce the power consumption of the data cache. Our goal is exploring energy savings for embedded/multimedia workloads without sacrificing performance. Here, we present two complementary media-sensitive energy-saving techniques that leverage static information.

Attachment	Size
PDF	814.81 KB

An Analysis of Scalar Memory Accesses in Embedded and Multimedia Systems

In an earlier paper about the FlexCache Project, we described our vision of a multipartitioned cache where memory accesses are separated based on their static predictability and memory footprint, and managed with various compiler controlled techniques supported by instruction set architecture extensions, or with traditional hardware control.

Attachment	Size
PDF	193.45 KB

Cool-Fetch: Compiler-Enabled Power-Aware Fetch Throttling

In this paper, we present an architecture-compiler based approach to reduce energy consumption in the processor. While we mainly target the fetch unit, an important side-effect of our approach is that we obtain energy savings in many other parts in the processor. The explanation is that the fetch unit often runs substantially ahead of execution, bringing in instructions to different stages in the processor that may never be executed.

Attachment	Size
PDF	152.85 KB

Cool-Mem: Combining Statically Speculative Memory Accessing with Selective Address Translation for Energy Efficiency

This paper presents Cool-Mem, a family of memory system architectures that integrate conventional memory system mechanisms, energy-aware address translation, and compilerenabled cache disambiguation techniques, to reduce energy consumption in general purpose architectures.

Attachment	Size
PDF	233.23 KB

The Minimax Cache: An Energy-Efficient Framework for Media Processors

This work is based on our philosophy of providing interlayer system-level power awareness in computing systems. Here, we couple this approach with our vision of multipartitioned memory systems, where memory accesses are separated based on their static predictability and memory footprint and managed with various compiler controlled techniques. We show that media applications are mapped more efficiently when scalar memory accesses are redirected to a minicache.

Attachment	Size
PDF	78.38 KB

Cool-Cache for Hot Multimedia

We claim that the unique characteristics of multimedia applications dictate media-sensitive architectural and compiler approaches to reduce the power consumption of the data cache. Our motivation is exploring energy savings for real-time multimedia workloads without sacrificing performance. Here, we present two complementary mediasensitive energy-saving techniques that leverage static information.

Attachment	Size
PDF	947.86 KB

On Memory Behavior of Scalars in Embedded Multimedia Systems

In an earlier paper about the FlexCache project, we described our vision of a multipartitioned cache where memory accesses are separated based on their static predictability and memory footprint, and managed with various compiler controlled techniques supported by instruction set architecture extensions, or with traditional hardware control.

Attachment	Size
PDF	186.06 KB

LoGPC: Modeling Network Contention in Message-Passing Programs

In many real applications, for example, those with frequent and irregular communication patterns or those using large messages, network contention and contention for message processing resources can be a significant part of the total execution time.This paper presents a new cost model, called LoGPC, that extends the LogP [9] and LogGP [4] models to account for the impact of network contention and network interface DMA behavior on the performance of message passing programs.

Attachment	Size
PDF	509.34 KB

SimpleFit: a Framework for Analyzing Design Tradeoffs in Raw Architectures

The semiconductor industry roadmap projects that advances in VLSI technology will permit more than one billion transistors on a chip by the year 2010. The MIT Raw microprocessor is a proposed architecture that strives to exploit these chip-level resources by implementing thousands of tiles, each comprising a processing element and a small amount of memory, coupled by a static two-dimensional interconnect. A compiler partitions fine-grain instruction-level parallelism across the tiles and statically schedules intertile communication over the interconnect.

Attachment	Size
PDF	1.81 MB

Hot Pages: Software Caching for Raw Microprocessors

This paper describes Hot Pages, a software based solution for managing on-chip data on the MIT Raw Machine, a scalable, parallel,microprocessor architecture. This software system transparently manages the mapping between the program address space

Attachment	Size
PDF	106.13 KB