Designing Practical Efficient Algorithms for Symmetric Multiprocessors

dc.contributor.authorHelman, David R.en_US
dc.contributor.authorJaJa, Josephen_US
dc.date.accessioned2004-05-31T22:52:22Z
dc.date.available2004-05-31T22:52:22Z
dc.date.created1998-10en_US
dc.date.issued1998-10-15en_US
dc.description.abstractSymmetric multiprocessors (SMPs) dominate the high-end server market and are currently the primary candidate for constructing large scale multiprocessor systems. Yet, the design of efficient parallel algorithms for this platform currently poses several challenges. In this paper, we present a computational model for designing efficient algorithms for symmetric multiprocessors. We then use this model to create efficient solutions to two widely different types of problems - linked list prefix computations and generalized sorting. Our novel algorithm for prefix computations builds upon the sparse ruling set approach of Reid-Miller and Blelloch. Besides being somewhat simpler and requiring nearly half the number of memory accesses, we can bound our complexity with high probability instead of merely on average. Our algorithm for generalized sorting is a modification of our algorithm for sorting by regular sampling on distributed memory architectures. The algorithm is a stable sort which appears to be asymptotically faster than any of the published algorithms for SMPs. Both of our algorithms were implemented in C using POSIX threads and run on three symmetric multiprocessors - the DEC AlphaServer, the Silicon Graphics Power Challenge, and the HP-Convex Exemplar. We ran our code for each algorithm using a variety of benchmarks which we identified to examine the dependence of our algorithm on memory access patterns. In spite of the fact that the processors must compete for access to main memory, both algorithms still yielded scalable performance up to 16 processors, which was the largest platform available to us. For some problems, our prefix computation algorithm actually matched or exceeded the performance of the best sequential solution using only a single thread. Similarly, our generalized sorting algorithm always beat the performance of sequential merge sort by at least an order of magnitude, even with a single thread. (Also cross-referenced as UMIACS-TR-98-44)en_US
dc.format.extent842124 bytes
dc.format.mimetypeapplication/postscript
dc.identifier.urihttp://hdl.handle.net/1903/961
dc.language.isoen_US
dc.relation.isAvailableAtDigital Repository at the University of Marylanden_US
dc.relation.isAvailableAtUniversity of Maryland (College Park, Md.)en_US
dc.relation.isAvailableAtTech Reports in Computer Science and Engineeringen_US
dc.relation.isAvailableAtUMIACS Technical Reportsen_US
dc.relation.ispartofseriesUM Computer Science Department; CS-TR-3928en_US
dc.relation.ispartofseriesUMIACS; UMIACS-TR-98-44en_US
dc.titleDesigning Practical Efficient Algorithms for Symmetric Multiprocessorsen_US
dc.typeTechnical Reporten_US

Files

Original bundle

Now showing 1 - 2 of 2
Loading...
Thumbnail Image
Name:
CS-TR-3928.ps
Size:
822.39 KB
Format:
Postscript Files
Loading...
Thumbnail Image
Name:
CS-TR-3928.pdf
Size:
290.32 KB
Format:
Adobe Portable Document Format
Description:
Auto-generated copy of CS-TR-3928.ps