Search
Now showing items 1-6 of 6
Symbiotic Cache Resizing for CMPs with Shared LLC
(2013-09-11)
This paper investigates the problem of finding the optimal sizes of private caches and a shared LLC in CMPs. Resizing private and shared caches in modern CMPs is one way to squeeze wasteful power consumption out of ...
Memory Performance Analysis for Parallel Programs Using Concurrent Reuse Distance
(2010-10-05)
Performance on multicore processors is determined largely by on-chip
cache. Computer architects have conducted numerous studies in the past
that vary core count and cache capacity as well as problem size to
understand ...
Studying Directory Access Patterns via Reuse Distance Analysis and Evaluating Their Impact on Multi-Level Directory Caches
(2014-01-13)
The trend for multicore CPUs is towards increasing core count. One of
the key limiters to scaling will be the on-chip directory cache. Our
work investigates moving portions of the directory away from the cores,
perhaps ...
Understanding Multicore Cache Behavior of Loop-based Parallel Programs via Reuse Distance Analysis
(2012-01-17)
Understanding multicore memory behavior is crucial, but can be
challenging due to the cache hierarchies employed in modern CPUs. In
today's hierarchies, performance is determined by complex thread
interactions, such as ...
Exploiting Multi-Loop Parallelism on Heterogeneous Microprocessors
(2016-11-10)
Heterogeneous microprocessors integrate CPUs and GPUs on the same chip,
providing fast CPU-GPU communication and enabling cores to compute on
data "in place." These advantages will permit integrated GPUs to exploit
a ...
Design and Evaluation of Monolithic Computers Implemented Using Crossbar ReRAM
(2019-07-16)
A monolithic computer is an emerging architecture in which a multicore CPU and a high-capacity main memory system are all integrated in a single die. We believe such architectures will be possible in the near future due ...