Feasibility Study of Scaling an XMT Many-Core

dc.contributor.authorO'Brien, Sean
dc.contributor.authorVishkin, Uzi
dc.contributor.authorEdwards, James
dc.contributor.authorWaks, Edo
dc.contributor.authorYang, Bao
dc.date.accessioned2015-02-26T22:25:37Z
dc.date.available2015-02-26T22:25:37Z
dc.date.issued2015-01-19
dc.description.abstractThe reason for recent focus on communication avoidance is that high rates of data movement become infeasible due to excessive power dissipation. However, shifting the responsibility of minimizing data movement to the parallel algorithm designer comes at significant costs to programmer’s productivity, as well as: (i) reduced speedups and (ii) the risk of repelling application developers from adopting parallelism. The UMD Explicit Multi-Threading (XMT) framework has demonstrated advantages on ease of parallel programming through its support of PRAM-like programming, combined with strong, often unprecedented speedups. Such programming and speedups involve considerable data movement between processors and shared memory. Another reason that XMT is a good test case for a study of data movement is that XMT permits isolation and direct study of most of its data movement (and its power dissipation). Our new results demonstrate that an XMT single-chip many-core processor with tens of thousands of cores and a high throughput network on chip is thermally feasible, though at some cost. This leads to a perhaps game-changing outcome: instead of imposing upfront strict restrictions on data movement, as advocated in a recent report from the National Academies, opt for due diligence that accounts for the full impact on cost. For example, does the increased cost due to communication avoidance (including programmer’s productivity, reduced speedups and desertion risk) indeed offset the cost of the solution we present? More specifically, we investigate in this paper the design of an XMT many-core for 3D VLSI with microfluidic cooling. We used state-of-the-art simulation tools to model the power and thermal properties of such an architecture with 8k to 64k lightweight cores, requiring between 2 and 8 silicon layers. Inter-chip communication using silicon compatible photonics is also considered. We found that, with the use of microfluidic cooling, power dissipation becomes a cost issue rather than a feasibility constraint. Robustness of the results is also discussed.en_US
dc.description.sponsorshipDARPA, NSF, NIHen_US
dc.identifierhttps://doi.org/10.13016/M2XK7J
dc.identifier.urihttp://hdl.handle.net/1903/16316
dc.language.isoen_USen_US
dc.relation.isAvailableAtCollege of Computer, Mathematical & Natural Sciencesen_us
dc.relation.isAvailableAtComputer Scienceen_us
dc.relation.isAvailableAtDigital Repository at the University of Marylanden_us
dc.relation.isAvailableAtUniversity of Maryland (College Park, MD)en_us
dc.subjectdata movementen_US
dc.subjectparallel algorithmsen_US
dc.subjectPRAMen_US
dc.subjectparallel architectureen_US
dc.subjectmany-coreen_US
dc.subject3D VLSIen_US
dc.subjectmicrofluidic coolingen_US
dc.subjectsilicon photonicsen_US
dc.titleFeasibility Study of Scaling an XMT Many-Coreen_US
dc.typeOtheren_US

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
SPAA15submission.pdf
Size:
1.28 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.57 KB
Format:
Item-specific license agreed upon to submission
Description: