Exploiting sparseness in de novo genome assembly

dc.contributor.authorYe, Chengxi
dc.contributor.authorSam Ma, Zhanshan
dc.contributor.authorCannon, Charles H
dc.contributor.authorPop, Mihai
dc.contributor.authorYu, Douglas W
dc.date.accessioned2021-10-28T16:50:48Z
dc.date.available2021-10-28T16:50:48Z
dc.date.issued2012-04-19
dc.description.abstractThe very large memory requirements for the construction of assembly graphs for de novo genome assembly limit current algorithms to super-computing environments. In this paper, we demonstrate that constructing a sparse assembly graph which stores only a small fraction of the observed k- mers as nodes and the links between these nodes allows the de novo assembly of even moderately-sized genomes (~500 M) on a typical laptop computer. We implement this sparse graph concept in a proof-of-principle software package, SparseAssembler, utilizing a new sparse k- mer graph structure evolved from the de Bruijn graph. We test our SparseAssembler with both simulated and real data, achieving ~90% memory savings and retaining high assembly accuracy, without sacrificing speed in comparison to existing de novo assemblers.en_US
dc.description.urihttps://doi.org/10.1186/1471-2105-13-S6-S1
dc.identifierhttps://doi.org/10.13016/6f48-znm9
dc.identifier.citationYe, C., Ma, Z.S., Cannon, C.H. et al. Exploiting sparseness in de novo genome assembly. BMC Bioinformatics 13, S1 (2012).en_US
dc.identifier.urihttp://hdl.handle.net/1903/28070
dc.language.isoen_USen_US
dc.publisherSpringer Natureen_US
dc.relation.isAvailableAtCollege of Computer, Mathematical & Natural Sciencesen_us
dc.relation.isAvailableAtComputer Scienceen_us
dc.relation.isAvailableAtDigital Repository at the University of Marylanden_us
dc.relation.isAvailableAtUniversity of Maryland (College Park, MD)en_us
dc.subjectGenome Assemblyen_US
dc.subjectMemory Requirementen_US
dc.subjectSequencing Erroren_US
dc.subjectMemory Usageen_US
dc.subjectSparse Graphen_US
dc.titleExploiting sparseness in de novo genome assemblyen_US
dc.typeArticleen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
1471-2105-13-S6-S1.pdf
Size:
796.71 KB
Format:
Adobe Portable Document Format
Description: