APPLICATION OF MACHINE LEARNING AND BIOINFORMATICS TO IMPROVE SEAFOOD SAFETY ASSOCIATED WITH VIBRIO SPP.

dc.contributor.advisorPradhan, Abanien_US
dc.contributor.authorFeng, Shuyien_US
dc.contributor.departmentFood Scienceen_US
dc.contributor.publisherDigital Repository at the University of Marylanden_US
dc.contributor.publisherUniversity of Maryland (College Park, Md.)en_US
dc.date.accessioned2026-01-28T06:32:42Z
dc.date.issued2025en_US
dc.description.abstractWith the increasing availability of whole genome sequencing data of foodborne pathogens, bioinformatics and machine learning have transformed and reshaped food safety and public health with improved accuracy and efficiency. On the other hand, the ongoing changes in climatic and environmental conditions are believed to have a profound impact on ocean and seafood safety. Vibrio spp., particularly Vibrio parahaemolyticus and Vibrio vulnificus, are the leading causes responsible for illnesses and outbreaks linked to seafood. The projected changing climate patterns could expand the distribution of Vibrio spp. both geographically and seasonally, enhance their pathogenicity, and contribute to the development of antimicrobial resistance (AMR) in Vibrio spp., resulting in increased risks threatening public health. Therefore, the overarching goal of this dissertation was to explore the potential of bioinformatics and machine learning to improve seafood safety associated with Vibrio spp. under changing climates. Specifically, regression models were developed using six different machine learning algorithms, to predict the concentrations of total and pathogenic V. parahaemolyticus and V. vulnificus isolated from seawater and oyster samples, based on environmental conditions. Robust models were obtained for forecasting levels of total and pathogenic V. parahaemolyticus and V. vulnificus from seawater samples and levels of pathogenic V. parahaemolyticus from oyster samples. Moreover, by coupling pangenome analysis and machine learning classification models, we characterized and differentiated the genomic profiles of V. parahaemolyticus isolated from different sources (environment, seafood, and clinic), in terms of survival, virulence, and antimicrobial resistance. Apart from identifying significant survival and AMR gene-related patterns, we also identified the most influential genes coding key virulence factors (thermostable direct haemolysin (TDH), TDH-related haemolysin, type III secretion system, and alpha-hemolysin) in differentiating seafood and clinical isolates. In addition, the impact of different bioinformatics pipelines (pangenome, core genome multilocus sequence typing (cgMLST), and whole genome multilocus sequence typing) on the downstream analysis (machine learning models for source attribution of V. parahaemolyticus) was investigated. cgMLST was identified as the optimal choice considering both pipeline efficiency and model accuracy. Overall, this dissertation advances the use of bioinformatics and machine learning techniques to improve seafood safety.en_US
dc.identifierhttps://doi.org/10.13016/oxfj-ji98
dc.identifier.urihttp://hdl.handle.net/1903/35116
dc.language.isoenen_US
dc.subject.pqcontrolledFood scienceen_US
dc.titleAPPLICATION OF MACHINE LEARNING AND BIOINFORMATICS TO IMPROVE SEAFOOD SAFETY ASSOCIATED WITH VIBRIO SPP.en_US
dc.typeDissertationen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Feng_umd_0117E_25707.pdf
Size:
2.59 MB
Format:
Adobe Portable Document Format
Download
(RESTRICTED ACCESS)