XMT-M: A Scalable Decentralized Processor

dc.contributor.authorBerkovich, Efraimen_US
dc.contributor.authorNuzman, Josephen_US
dc.contributor.authorFranklin, Manojen_US
dc.contributor.authorJacob, Bruceen_US
dc.contributor.authorVishkin, Uzien_US
dc.date.accessioned2004-05-31T22:59:28Z
dc.date.available2004-05-31T22:59:28Z
dc.date.created1999-09en_US
dc.date.issued1999-10-09en_US
dc.description.abstractA defining challenge for research in computer science and engineering has been the ongoing quest for reducing the completion time of a single computation task. Even outside the parallel processing communities, there is little doubt that the key to further progress in this quest is to do parallel processing of some kind. A recently proposed parallel processing framework that spans the entire spectrum from (parallel) algorithms to architecture to implementation is the explicit multi-threading (XMT) framework. This framework provides: (i) simple and natural parallel algorithms for essentially every general-purpose application, including notoriously difficult irregular integer applications, and (ii) a multi-threaded programming model for these algorithms which allows an ``independence-of-order'' semantics: every thread can proceed at its own speed, independent of other concurrent threads. To the extent possible, the XMT framework uses established ideas in parallel processing. This paper presents XMT-M, a microarchitecture implementation of the XMT model that is possible with current technology. XMT-M offers an engineering design point that addresses four concerns: buildability, programmability, performance, and scalability. The XMT-M hardware is geared to execute multiple threads in parallel on a single chip: relying on very few new gadgets, it can execute parallel threads without busy-waits! Existing code can be run on XMT-M as a single thread without any modifications, thereby providing backward compatibility for commercial acceptance. Simulation-based studies of XMT-M demonstrate considerable improvements in performance relative to the best serial processor even for small, and therefore practical, input sizes. (Also cross-referenced as UMIACS-TR-99-55)en_US
dc.format.extent1319986 bytes
dc.format.mimetypeapplication/postscript
dc.identifier.urihttp://hdl.handle.net/1903/1030
dc.language.isoen_US
dc.relation.isAvailableAtDigital Repository at the University of Marylanden_US
dc.relation.isAvailableAtUniversity of Maryland (College Park, Md.)en_US
dc.relation.isAvailableAtTech Reports in Computer Science and Engineeringen_US
dc.relation.isAvailableAtUMIACS Technical Reportsen_US
dc.relation.ispartofseriesUM Computer Science Department; CS-TR-4061en_US
dc.relation.ispartofseriesUMIACS; UMIACS-TR-99-55en_US
dc.titleXMT-M: A Scalable Decentralized Processoren_US
dc.typeTechnical Reporten_US

Files

Original bundle

Now showing 1 - 2 of 2
No Thumbnail Available
Name:
CS-TR-4061.ps
Size:
1.26 MB
Format:
Postscript Files
Loading...
Thumbnail Image
Name:
CS-TR-4061.pdf
Size:
269.73 KB
Format:
Adobe Portable Document Format
Description:
Auto-generated copy of CS-TR-4061.ps