Dinoflagellate Genomic Organization and Phylogenetic Marker Discovery Utilizing Deep Sequencing Data

dc.contributor.advisorDelwiche, Charles Fen_US
dc.contributor.authorMendez, Gregory Scotten_US
dc.contributor.departmentCell Biology & Molecular Geneticsen_US
dc.contributor.publisherDigital Repository at the University of Marylanden_US
dc.contributor.publisherUniversity of Maryland (College Park, Md.)en_US
dc.date.accessioned2016-09-03T05:44:32Z
dc.date.available2016-09-03T05:44:32Z
dc.date.issued2016en_US
dc.description.abstractDinoflagellates possess large genomes in which most genes are present in many copies. This has made studies of their genomic organization and phylogenetics challenging. Recent advances in sequencing technology have made deep sequencing of dinoflagellate transcriptomes feasible. This dissertation investigates the genomic organization of dinoflagellates to better understand the challenges of assembling dinoflagellate transcriptomic and genomic data from short read sequencing methods, and develops new techniques that utilize deep sequencing data to identify orthologous genes across a diverse set of taxa. To better understand the genomic organization of dinoflagellates, a genomic cosmid clone of the tandemly repeated gene Alchohol Dehydrogenase (AHD) was sequenced and analyzed. The organization of this clone was found to be counter to prevailing hypotheses of genomic organization in dinoflagellates. Further, a new non-canonical splicing motif was described that could greatly improve the automated modeling and annotation of genomic data. A custom phylogenetic marker discovery pipeline, incorporating methods that leverage the statistical power of large data sets was written. A case study on Stramenopiles was undertaken to test the utility in resolving relationships between known groups as well as the phylogenetic affinity of seven unknown taxa. The pipeline generated a set of 373 genes useful as phylogenetic markers that successfully resolved relationships among the major groups of Stramenopiles, and placed all unknown taxa on the tree with strong bootstrap support. This pipeline was then used to discover 668 genes useful as phylogenetic markers in dinoflagellates. Phylogenetic analysis of 58 dinoflagellates, using this set of markers, produced a phylogeny with good support of all branches. The Suessiales were found to be sister to the Peridinales. The Prorocentrales formed a monophyletic group with the Dinophysiales that was sister to the Gonyaulacales. The Gymnodinales was found to be paraphyletic, forming three monophyletic groups. While this pipeline was used to find phylogenetic markers, it will likely also be useful for finding orthologs of interest for other purposes, for the discovery of horizontally transferred genes, and for the separation of sequences in metagenomic data sets.en_US
dc.identifierhttps://doi.org/10.13016/M2RV37
dc.identifier.urihttp://hdl.handle.net/1903/18615
dc.language.isoenen_US
dc.subject.pqcontrolledBiologyen_US
dc.subject.pqcontrolledBioinformaticsen_US
dc.subject.pqcontrolledMolecular biologyen_US
dc.subject.pquncontrolledalgaeen_US
dc.subject.pquncontrolledbioinformaticsen_US
dc.subject.pquncontrolleddinoflagellateen_US
dc.subject.pquncontrolledphylogeneticsen_US
dc.subject.pquncontrolledstramenopileen_US
dc.subject.pquncontrolledtranscriptomicsen_US
dc.titleDinoflagellate Genomic Organization and Phylogenetic Marker Discovery Utilizing Deep Sequencing Dataen_US
dc.typeDissertationen_US

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Mendez_umd_0117E_17496.pdf
Size:
5.01 MB
Format:
Adobe Portable Document Format