Fast Multipole Methods on Graphics Processors

dc.contributor.authorGumerov, Nail A.
dc.contributor.authorDuraiswami, Ramani
dc.date.accessioned2008-03-07T14:12:17Z
dc.date.available2008-03-07T14:12:17Z
dc.date.issued2007-11
dc.description.abstractThe Fast Multipole Method allows the rapid evaluation of sums of radial basis functions centered at points distributed inside a computational domain at a large number of evaluation points to a specified accuracy $\epsilon$. The method scales as $O (N)$ in both time and memory compared to the direct method with complexity $O(N^2)$, which allows the solution of larger problems with given resources. Graphical processing units (GPU) are now increasingly viewed as data parallel compute coprocessors that can provide significant computational performance at low price. We describe acceleration of the FMM using the data parallel GPU architecture. The FMM has a complex hierarchical (adaptive) structure, which is not easily implemented on data parallel processors. We described strategies for parallelization of all components of the FMM, develop a model to explain the performance of the algorithm on the GPU architecture; and determined optimal settings for the FMM on the GPU, which are different from those on usual CPUs. Some innovations in the FMM algorithm, including the use of modified stencils, real polynomial basis functions for the Laplace kernel, and decompositions of the translation operators, are also described. We obtained accelerations of the Laplace kernel FMMon a single NVIDIA GeForce 8800 GTX GPU in the range of 30-60 compared to a serial CPU FMM implementation. For a problem with a million sources, the summations involved are performed in approximately one second. This performance is equivalent to solving of the same problem at a 43 Teraflop rate if we use straightforward summation.en
dc.format.extent569646 bytes
dc.format.mimetypeapplication/pdf
dc.identifier.citationFantalgo LLC, Technical Report 2007-2en
dc.identifier.urihttp://hdl.handle.net/1903/7549
dc.language.isoen_USen
dc.relation.isAvailableAtCollege of Computer, Mathematical & Physical Sciencesen_us
dc.relation.isAvailableAtComputer Scienceen_us
dc.relation.isAvailableAtDigital Repository at the University of Marylanden_us
dc.relation.isAvailableAtUniversity of Maryland (College Park, MD)en_us
dc.subjectFast Multipole Methodsen
dc.subjectGraphics processorsen
dc.subjectGPUsen
dc.subjectCoulomb kernelen
dc.titleFast Multipole Methods on Graphics Processorsen
dc.typePreprinten

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
paper_gpu_fmm_revised_final.pdf
Size:
556.29 KB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.81 KB
Format:
Item-specific license agreed upon to submission
Description: