Critical Assessment of Short-Read Assemblers for the Metagenomic Identification of Foodborne and Waterborne Pathogens Using Simulated Bacterial Communities

dc.contributor.authorChen, Zhao
dc.contributor.authorMeng, Jianghong
dc.date.accessioned2023-10-16T18:22:48Z
dc.date.available2023-10-16T18:22:48Z
dc.date.issued2022-12-06
dc.description.abstractMetagenomics offers the highest level of strain discrimination of bacterial pathogens from complex food and water microbiota. With the rapid evolvement of assembly algorithms, defining an optimal assembler based on the performance in the metagenomic identification of foodborne and waterborne pathogens is warranted. We aimed to benchmark short-read assemblers for the metagenomic identification of foodborne and waterborne pathogens using simulated bacterial communities. Bacterial communities on fresh spinach and in surface water were simulated by generating paired-end short reads of Illumina HiSeq, MiSeq, and NovaSeq at different sequencing depths. Multidrug-resistant Salmonella Indiana SI43 and Pseudomonas aeruginosa PAO1 were included in the simulated communities on fresh spinach and in surface water, respectively. ABySS, IDBA-UD, MaSuRCA, MEGAHIT, metaSPAdes, and Ray Meta were benchmarked in terms of assembly quality, identifications of plasmids, virulence genes, Salmonella pathogenicity island, antimicrobial resistance genes, chromosomal point mutations, serotyping, multilocus sequence typing, and whole-genome phylogeny. Overall, MEGHIT, metaSPAdes, and Ray Meta were more effective for metagenomic identification. We did not obtain an optimal assembler when using the extracted reads classified as Salmonella or P. aeruginosa for downstream genomic analyses, but the extracted reads showed consistent phylogenetic topology with the reference genome when they were aligned with Salmonella or P. aeruginosa strains. In most cases, HiSeq, MiSeq, and NovaSeq were comparable at the same sequencing depth, while higher sequencing depths generally led to more accurate results. As assembly algorithms advance and mature, the evaluation of assemblers should be a continuous process.
dc.description.urihttps://doi.org/10.3390/microorganisms10122416
dc.identifierhttps://doi.org/10.13016/dspace/gloj-2xle
dc.identifier.citationChen, Z.; Meng, J. Critical Assessment of Short-Read Assemblers for the Metagenomic Identification of Foodborne and Waterborne Pathogens Using Simulated Bacterial Communities. Microorganisms 2022, 10, 2416.
dc.identifier.urihttp://hdl.handle.net/1903/31023
dc.language.isoen_US
dc.publisherMDPI
dc.relation.isAvailableAtCollege of Agriculture & Natural Resourcesen_us
dc.relation.isAvailableAtNutrition & Food Scienceen_us
dc.relation.isAvailableAtDigital Repository at the University of Marylanden_us
dc.relation.isAvailableAtUniversity of Maryland (College Park, MD)en_us
dc.subjectillumina sequencing
dc.subjectmetagenome assembly
dc.subjectshort-read aseembly
dc.subjectshort-read assembler
dc.subjectfoodborne pathogen
dc.subjectwaterborne pathogen
dc.subjectspinach
dc.subjectsurface water
dc.titleCritical Assessment of Short-Read Assemblers for the Metagenomic Identification of Foodborne and Waterborne Pathogens Using Simulated Bacterial Communities
dc.typeArticle
local.equitableAccessSubmissionNo

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
microorganisms-10-02416.pdf
Size:
2.29 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.55 KB
Format:
Item-specific license agreed upon to submission
Description: