Optimizing for a Many-Core Architecture without Compromising Ease-of-Programming

Caragea, George Constantin

Optimizing for a Many-Core Architecture without Compromising Ease-of-Programming

dc.contributor.advisor	Vishkin, Uzi	en_US
dc.contributor.advisor	Barua, Rajeev	en_US
dc.contributor.author	Caragea, George Constantin	en_US
dc.contributor.department	Computer Science	en_US
dc.contributor.publisher	Digital Repository at the University of Maryland	en_US
dc.contributor.publisher	University of Maryland (College Park, Md.)	en_US
dc.date.accessioned	2011-10-08T06:31:11Z
dc.date.available	2011-10-08T06:31:11Z
dc.date.issued	2011	en_US
dc.description.abstract	Faced with nearly stagnant clock speed advances, chip manufacturers have turned to parallelism as the source for continuing performance improvements. But even though numerous parallel architectures have already been brought to market, a universally accepted methodology for programming them for general purpose applications has yet to emerge. Existing solutions tend to be hardware-specific, rendering them difficult to use for the majority of application programmers and domain experts, and not providing scalability guarantees for future generations of the hardware. This dissertation advances the validation of the following thesis: it is possible to develop efficient general-purpose programs for a many-core platform using a model recognized for its simplicity. To prove this thesis, we refer to the eXplicit Multi-Threading (XMT) architecture designed and built at the University of Maryland. XMT is an attempt at re-inventing parallel computing with a solid theoretical foundation and an aggressive scalable design. Algorithmically, XMT is inspired by the PRAM (Parallel Random Access Machine) model and the architecture design is focused on reducing inter-task communication and synchronization overheads and providing an easy-to-program parallel model. This thesis builds upon the existing XMT infrastructure to improve support for efficient execution with a focus on ease-of-programming. Our contributions aim at reducing the programmer's effort in developing XMT applications and improving the overall performance. More concretely, we: (1) present a work-flow guiding programmers to produce efficient parallel solutions starting from a high-level problem; (2) introduce an analytical performance model for XMT programs and provide a methodology to project running time from an implementation; (3) propose and evaluate RAP -- an improved resource-aware compiler loop prefetching algorithm targeted at fine-grained many-core architectures; we demonstrate performance improvements of up to 34.79% on average over the GCC loop prefetching implementation and up to 24.61% on average over a simple hardware prefetching scheme; and (4) implement a number of parallel benchmarks and evaluate the overall performance of XMT relative to existing serial and parallel solutions, showing speedups of up to 13.89x vs.~ a serial processor and 8.10x vs.~parallel code optimized for an existing many-core (GPU). We also discuss the implementation and optimization of the Max-Flow algorithm on XMT, a problem which is among the more advanced in terms of complexity, benchmarking and research interest in the parallel algorithms community. We demonstrate better speed-ups compared to a best serial solution than previous attempts on other parallel platforms.	en_US
dc.identifier.uri	http://hdl.handle.net/1903/12062
dc.subject.pqcontrolled	Computer science	en_US
dc.subject.pqcontrolled	Computer engineering	en_US
dc.subject.pquncontrolled	benchmark	en_US
dc.subject.pquncontrolled	compiler	en_US
dc.subject.pquncontrolled	maxflow	en_US
dc.subject.pquncontrolled	parallel	en_US
dc.subject.pquncontrolled	PRAM	en_US
dc.subject.pquncontrolled	prefetching	en_US
dc.title	Optimizing for a Many-Core Architecture without Compromising Ease-of-Programming	en_US
dc.type	Dissertation	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Caragea_umd_0117E_12318.pdf
Size:: 1.23 MB
Format:: Adobe Portable Document Format

Download

Collections

UMD Theses and Dissertations
Computer Science Theses and Dissertations