AGORA: Assembly Guided by Optical Restriction Alignment

dc.contributor.authorLin, Henry
dc.contributor.authorGoldstein, Steve
dc.contributor.authorMendelowitz, Lee
dc.contributor.authorShiguo, Zhou
dc.contributor.authorWetzel, Joshua
dc.contributor.authorSchwartz, David C
dc.contributor.authorPop, Mihai
dc.date.accessioned2013-01-10T21:44:44Z
dc.date.available2013-01-10T21:44:44Z
dc.date.issued2012-08-02
dc.description.abstractBackground: Genome assembly is difficult due to repeated sequences within the genome, which create ambiguities and cause the final assembly to be broken up into many separate sequences (contigs). Long range linking information, such as mate-pairs or mapping data, is necessary to help assembly software resolve repeats, thereby leading to a more complete reconstruction of genomes. Prior work has used optical maps for validating assemblies and scaffolding contigs, after an initial assembly has been produced. However, optical maps have not previously been used within the genome assembly process. Here, we use optical map information within the popular de Bruijn graph assembly paradigm to eliminate paths in the de Bruijn graph which are not consistent with the optical map and help determine the correct reconstruction of the genome. Results: We developed a new algorithm called AGORA: Assembly Guided by Optical Restriction Alignment. AGORA is the first algorithm to use optical map information directly within the de Bruijn graph framework to help produce an accurate assembly of a genome that is consistent with the optical map information provided. Our simulations on bacterial genomes show that AGORA is effective at producing assemblies closely matching the reference sequences. Additionally, we show that noise in the optical map can have a strong impact on the final assembly quality for some complex genomes, and we also measure how various characteristics of the starting de Bruijn graph may impact the quality of the final assembly. Lastly, we show that a proper choice of restriction enzyme for the optical map may substantially improve the quality of the final assembly. Conclusions: Our work shows that optical maps can be used effectively to assemble genomes within the de Bruijn graph assembly framework. Our experiments also provide insights into the characteristics of the mapping data that most affect the performance of our algorithm, indicating the potential benefit of more accurate optical mapping technologies, such as nano-coding.en_US
dc.description.urihttps://doi.org/10.1186/1471-2105-13-189
dc.identifier.citationLin, H.C., Goldstein, S., Mendelowitz, L. et al. AGORA: Assembly Guided by Optical Restriction Alignment. BMC Bioinformatics 13, 189 (2012).en_US
dc.identifier.urihttp://hdl.handle.net/1903/13386
dc.language.isoen_USen_US
dc.relation.isAvailableAtCollege of Computer, Mathematical & Natural Sciencesen_us
dc.relation.isAvailableAtComputer Scienceen_us
dc.relation.isAvailableAtDigital Repository at the University of Marylanden_us
dc.relation.isAvailableAtUniversity of Maryland (College Park, MD)en_us
dc.subjectgenome assemblyen_US
dc.subjectoptical mapsen_US
dc.titleAGORA: Assembly Guided by Optical Restriction Alignmenten_US
dc.typeArticleen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Lin, et al.pdf
Size:
913.55 KB
Format:
Adobe Portable Document Format