Biology Theses and Dissertations
Permanent URI for this collectionhttp://hdl.handle.net/1903/2749
Browse
2 results
Search Results
Item ECOLOGICAL APPLICATIONS OF MACHINE LEARNING TO DIGITIZED NATURAL HISTORY DATA(2022) Robillard, Alexander John; Rowe, Christopher; Bailey, Helen; Marine-Estuarine-Environmental Sciences; Digital Repository at the University of Maryland; University of Maryland (College Park, Md.)Natural history collections are a valuable resource for assessment of biodiversity and species decline. Over the past few decades, digitization of specimens has increased the accessibility and value of these collections. As such the number and size of these digitized data sets have outpaced the tools needed to evaluate them. To address this, researchers have turned to machine learning to automate data-driven decisions. Specifically, applications of deep learning to complex ecological problems is becoming more common. As such, this dissertation aims to contribute to this trend by addressing, in three distinct chapters, conservation, evolutionary and ecological questions using deep learning models. For example, in the first chapter we focus on current regulations prohibiting the sale and distribution of hawksbill sea turtle derived products, which continues internationally in physical and online marketplaces. To curb the sale of illegal tortoiseshell, application of new technologies like convolutional neural networks (CNNs) is needed. Therein we describe a curated data set (n = 4,428) which was used to develop a CNN application we are calling “SEE Shell”, which can identify real and faux hawksbill derived products from image data. Developed on a MobileNetV2 using TensorFlow, SEE Shell was tested against a validation (n = 665) and test (n = 649) set where it achieved an accuracy between 82.6-92.2% correctness depending on the certainty threshold used. We expect SEE Shell will give potential buyers more agency in their purchasing decision, in addition to enabling retailers to rapidly filter their online marketplaces. In the second chapter we focus on recent research which utilized geometric morphometrics, associated genetic data, and Principal Component Analysis to successfully delineate Chelonia mydas (green sea turtle) morphotypes from carapace measurements. Therein we demonstrate a similar, yet more rapid approach to this analysis using computer vision models. We applied a U-Net to isolate carapace pixels of (n = 204) of juvenile C. mydas from multiple foraging grounds across the Eastern Pacific, Western Pacific, and Western Atlantic. These images were then sorted based on general alignment (shape) and coloration of the pixels within the image using a pre-trained computer vision model (MobileNetV2). The dimensions of these data were then reduced and projected using Universal Manifold Approximation and Projection. Associated vectors were then compared to simple genetic distance using a Mantel test. Data points were then labeled post-hoc for exploratory analysis. We found clear congruence between carapace morphology and genetic distance between haplotypes, suggesting that our image data have biological relevance. Our findings also suggest that carapace morphotype is associated with specific haplotypes within C. mydas. Our cluster analysis (k = 3) corroborates past research which suggests there are at least three morphotypes from across the Eastern Pacific, Western Pacific, and Western Atlantic. Finally, within the third chapter we discuss the sharp increase in agricultural and infrastructure development and the paucity of widespread data available to support conservation management decisions around the Amazon. To address these issues, we outline a more rapid and accurate tool for identifying fish fauna in the world's largest freshwater ecosystem, the Amazon. Current strategies for identification of freshwater fishes require high levels of training and taxonomic expertise for morphological identification or genetic testing for species recognition at a molecular level. To overcome these challenges, we built an image masking model (U-Net) and a CNN to mask and classify Amazonian fish in photographs. Fish used to generate training data were collected and photographed in tributaries in seasonally flooded forests of the upper Morona River valley in Loreto, Peru in 2018 and 2019. Species identifications in the training images (n = 3,068) were verified by expert ichthyologists. These images were supplemented with photographs taken of additional Amazonian fish specimens housed in the ichthyological collection of the Smithsonian’s National Museum of Natural History. We generated a CNN model that identified 33 genera of fishes with a mean accuracy of 97.9%. Wider availability of accurate freshwater fish image recognition tools, such as the one described here, will enable fishermen, local communities, and citizen scientists to more effectively participate in collecting and sharing data from their territories to inform policy and management decisions that impact them directly.Item Diversity, Invasibility, and Resource Use in Marine Fouling Communities of San Francisco Bay(2011) Altman, Safra; Reaka, Marjorie L; Hines, Anson H; Behavior, Ecology, Evolution and Systematics; Digital Repository at the University of Maryland; University of Maryland (College Park, Md.)Invasive species threaten the biodiversity of estuaries worldwide. To examine the relationships between biodiversity, invasibility, and invasion success, I conducted field surveys and experiments in San Francisco Bay marine fouling communities, including 1) surveys to estimate alpha, gamma, and beta diversity of native, non-native and cryptogenic components of the community; 2) experiments to assess the influence of diversity and resource availability on short-term recruitment of novel non-indigenous species (NIS) into test communities and subsequent community development over time; and 3) an experiment to explore the role of facilitative interactions of NIS in the diversity-invasibility relationship. Surveys (10-24 sites) showed that non-native alpha diversity was significantly greater than native or cryptogenic alpha diversity, beta diversity was significantly greater for native and cryptogenic species than for NIS, and gamma diversity was similar for NIS and native species. These results indicate that native species had high turn over from site to site while NIS were spread throughout the Bay. Experiments showed that on short time scales (2-4 weeks), the effect of initial diversity on the density of recruitment of NIS was significant and negative, with no effect of resource level (increased open space). Changes in community composition over time (2-24 weeks) also indicated significant inverse relationships between percent cover of NIS and diversity of the initial community with no evidence of a resource effect. Abundant NIS occupied less space in communities with higher initial diversity. However, the same NIS occupied (i.e., had invaded) all experimental communities regardless of starting diversity. Additional experiments revealed that recruitment to secondary substrates did not vary significantly with invasive species diversity or resource availability. When total recruitment to primary and secondary substrates were combined, there was no longer a significant relationship between diversity and recruitment. Analysis of secondary settlement patterns revealed that some NIS, such as Bugula neritina, were facilitating recruitment and settlement of additional NIS. In contrast, other species, such as Clathria prolifera and Botryllus schlosseri, inhibited secondary settlement of NIS. The influence of diversity and primary resource availability on secondary settlement did not appear to affect settlement on facilitative species, but reduced settlement on inhibitive species.