Research And Application Of Parallel Computing Algorithms For Statistical Phylogenetic Inference

dc.contributor.advisorCummings, Michael Pen_US
dc.contributor.authorAyres, Daniel L.en_US
dc.contributor.departmentBiologyen_US
dc.contributor.publisherDigital Repository at the University of Marylanden_US
dc.contributor.publisherUniversity of Maryland (College Park, Md.)en_US
dc.date.accessioned2017-09-14T05:41:36Z
dc.date.available2017-09-14T05:41:36Z
dc.date.issued2017en_US
dc.description.abstractEstimating the evolutionary history of organisms, phylogenetic inference, is a critical step in many analyses involving biological sequence data such as DNA. The likelihood calculations at the heart of the most effective methods for statistical phylogenetic analyses are extremely computationally intensive, and hence these analyses become a bottleneck in many studies. Recent progress in computer hardware, specifically the increase in pervasiveness of highly parallel, many-core processors has created opportunities for new approaches to computationally intensive methods, such as those in phylogenetic inference. We have developed an open source library, BEAGLE, which uses parallel computing methods to greatly accelerate statistical phylogenetic inference, for both maximum likelihood and Bayesian approaches. BEAGLE defines a uniform application programming interface and includes a collection of efficient implementations that use NVIDIA CUDA, OpenCL, and C++ threading frameworks for evaluating likelihoods under a wide variety of evolutionary models, on GPUs as well as on multi-core CPUs. BEAGLE employs a number of different parallelization techniques for phylogenetic inference, at different granularity levels and for distinct processor architectures. On CUDA and OpenCL devices, the library enables concurrent computation of site likelihoods, data subsets, and independent subtrees. The general design features of the library also provide a model for software development using parallel computing frameworks that is applicable to other domains. BEAGLE has been integrated with some of the leading programs in the field, such as MrBayes and BEAST, and is used in a diverse range of evolutionary studies, including those of disease causing viruses. The library can provide significant performance gains, with the exact increase in performance depending on the specific properties of the data set, evolutionary model, and hardware. In general, nucleotide analyses are accelerated on the order of 10-fold and codon analyses on the order of 100-fold.en_US
dc.identifierhttps://doi.org/10.13016/M2FQ9Q584
dc.identifier.urihttp://hdl.handle.net/1903/19951
dc.language.isoenen_US
dc.subject.pqcontrolledBioinformaticsen_US
dc.subject.pqcontrolledComputer scienceen_US
dc.subject.pquncontrolledBayes methodsen_US
dc.subject.pquncontrolledBiology computingen_US
dc.subject.pquncontrolledEvolution (biology)en_US
dc.subject.pquncontrolledMaximum likelihood estimationen_US
dc.subject.pquncontrolledParallel programmingen_US
dc.subject.pquncontrolledPhylogenyen_US
dc.titleResearch And Application Of Parallel Computing Algorithms For Statistical Phylogenetic Inferenceen_US
dc.typeDissertationen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Ayres_umd_0117E_18316.pdf
Size:
1.27 MB
Format:
Adobe Portable Document Format