Topological Skeletonization Scripts for Assessing Estimated Malaria Parasite Migration Contours Under Sparse Sampling

dc.contributor.advisorStewart, Kathleen
dc.contributor.authorStewart, Kathleen
dc.contributor.authorLi, Yao
dc.date.accessioned2025-06-04T15:58:50Z
dc.date.issued2025-05
dc.descriptionWe have developed a sample filtering framework that applies topological skeletons for a contour region and kernel density estimation based on malaria parasite sampling locations to assess malaria parasite sample density for each contour and filter contour regions with lower sample density. The Topological Skeletonization Scripts package provides two reproducible Jupyter notebooks that convert Effective-Migration-Surface (EEMS) contour rasters into vector-ready topological skeletons.
dc.description.abstractMalaria parasite gene flow simulation models that utilize georeferenced genomic data have become key tools for inferring parasite migration patterns and diversity through simulation-based approaches. However, the accuracy of the models’ outputs can be compromised when sampling locations are sparse, leading to uncertainty in the results. We have developed a sample filtering framework that applies topological skeletons for a contour region and kernel density estimation based on malaria parasite sampling locations to assess malaria parasite sample density for each contour and filter contour regions with lower sample density. The Topological Skeletonization Scripts package provides two reproducible Jupyter notebooks that convert Effective-Migration-Surface (EEMS) contour rasters into vector-ready topological skeletons. The topo_ske3spots.ipynb script operates on a simulated hexagon-shaped barrier example scenario produced using msprime, a population genetic simulator of ancestry and DNA sequence evolution, while the topo_ske_cam.ipynb notebook processes empirical Plasmodium falciparum malaria parasites collected in Cambodia from 2008 to 2013 (data archived as part of the MalariaGen Community Project). Each notebook implements automated thresholding, morphological cleaning, medial-axis skeletonization (scikit-image), and exports the results as aligned GeoTIFFs, GeoJSON polylines, and diagnostic plots. Developed in Python 3.9 and tested on macOS 10.15 under a Miniconda-managed environment, the workflows utilize open-source libraries (GDAL/rasterio, GeoPandas/Shapely, NumPy, OpenCV).
dc.description.sponsorshipNational Science Foundation under Grant no. BCS2049805
dc.identifierhttps://doi.org/10.13016/nylf-gdne
dc.identifier.urihttp://hdl.handle.net/1903/33904
dc.language.isoen_US
dc.relation.isAvailableAtCollege of Behavioral & Social Sciencesen_us
dc.relation.isAvailableAtGeographyen_us
dc.relation.isAvailableAtDigital Repository at the University of Marylanden_us
dc.relation.isAvailableAtUniversity of Maryland (College Park, MD)en_us
dc.rightsAttribution-NonCommercial-NoDerivs 3.0 United Statesen
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/3.0/us/
dc.subjectmalaria
dc.subjectspatial sampling
dc.subjectparasite migration estimation
dc.subjectparasite migration contours
dc.subjecttopological skeletonization
dc.titleTopological Skeletonization Scripts for Assessing Estimated Malaria Parasite Migration Contours Under Sparse Sampling
dc.typeSoftware

Files

Original bundle

Now showing 1 - 2 of 2
Loading...
Thumbnail Image
Name:
scripts (1).zip
Size:
11.33 MB
Format:
Unknown data format
Loading...
Thumbnail Image
Name:
README_updated (1).txt
Size:
3.25 KB
Format:
Plain Text