An improved genome reference for the African cichlid, Metriaclima zebra

dc.contributor.authorConte, Matthew A.
dc.contributor.authorKocher, Thomas D.
dc.date.accessioned2017-08-30T15:39:56Z
dc.date.available2017-08-30T15:39:56Z
dc.date.issued2015
dc.descriptionFunding for Open Access provided by the UMD Libraries Open Access Publishing Fund.en_US
dc.description.abstractBackground: Problems associated with using draft genome assemblies are well documented and have become more pronounced with the use of short read data for de novo genome assembly. We set out to improve the draft genome assembly of the African cichlid fish, Metriaclima zebra, using a set of Pacific Biosciences SMRT sequencing reads corresponding to 16.5x coverage of the genome. Here we characterize the improvements that these long reads allowed us to make to the state-of-the-art draft genome previously assembled from short read data. Results: Our new assembly closed 68 % of the existing gaps and added 90.6Mbp of new non-gap sequence to the existing draft assembly of M. zebra. Comparison of the new assembly to the sequence of several bacterial artificial chromosome clones confirmed the accuracy of the new assembly. The closure of sequence gaps revealed thousands of new exons, allowing significant improvement in gene models. We corrected one known misassembly, and identified and fixed other likely misassemblies. 63.5 Mbp (70 %) of the new sequence was classified as repetitive and the new sequence allowed for the assembly of many more transposable elements. Conclusions: Our improvements to the M. zebra draft genome suggest that a reasonable investment in long reads could greatly improve many comparable vertebrate draft genome assemblies.en_US
dc.description.urihttps://doi.org/10.1186/s12864-015-1930-5
dc.identifierhttps://doi.org/10.13016/M2736M25W
dc.identifier.citationConte, M.A., Kocher, T.D. An improved genome reference for the African cichlid, Metriaclima zebra . BMC Genomics 16, 724 (2015).en_US
dc.identifier.urihttp://hdl.handle.net/1903/19673
dc.language.isoen_USen_US
dc.publisherBioMed Centralen_US
dc.relation.isAvailableAtCollege of Computer, Mathematical & Physical Sciencesen_us
dc.relation.isAvailableAtDigital Repository at the University of Marylanden_us
dc.relation.isAvailableAtBiologyen_us
dc.relation.isAvailableAtUniversity of Maryland (College Park, MD)en_us
dc.subjectAfrican cichlid fishen_US
dc.subjectGenome assemblyen_US
dc.subjectPacific Biosciences SMRT sequencingen_US
dc.subjectTransposable elementsen_US
dc.titleAn improved genome reference for the African cichlid, Metriaclima zebraen_US
dc.typeArticleen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
s12864-015-1930-5.pdf
Size:
2.76 MB
Format:
Adobe Portable Document Format
Description: