EVOLUTION OF THE CRISPR IMMUNE SYSTEM FROM ECOLOGICAL TO MOLECULAR SCALES
dc.contributor.advisor | Johnson, Philip LF | en_US |
dc.contributor.author | Xiao, Wei | en_US |
dc.contributor.department | Biology | en_US |
dc.contributor.publisher | Digital Repository at the University of Maryland | en_US |
dc.contributor.publisher | University of Maryland (College Park, Md.) | en_US |
dc.date.accessioned | 2024-06-29T05:42:43Z | |
dc.date.available | 2024-06-29T05:42:43Z | |
dc.date.issued | 2024 | en_US |
dc.description.abstract | Bacteria and archaea inhabit environments that constantly face viral infections and other external genetic threats. They have evolved an arsenal of defense strategies to protect themselves. My research delves into the CRISPR immune system, the only known adaptive immune system of prokaryotes. My work explores three different dimensions of the CRISPR immune system, ranging from ecological to molecular scales.From an evolutionary perspective, CRISPR is widely distributed across the prokaryotic tree, underscoring its immune effectiveness. However, the CRISPR distribution is uneven and some lineages are devoid of CRISPR. Here, I identify two ecological drivers of the CRISPR immune system. By analyzing both 16S rRNA data and metagenomic data, I find the CRISPR system is favored in less abundant prokaryotes in the saltwater environment and higher diverse prokaryote communities in the human oral environment. On the molecular level, the CRISPR system selects and cleaves its “favorite” DNA segments (also known as “spacers”) from invading viral genomes to form immune memories. I explore how the spacer sequence composition affects its acquisition rate by the CRISPR system. I develop a convolutional neural network model to predict the spacer acquisition rate based on the spacer sequence composition in natural microbial communities. The model interpretation reveals that the PAM-proximal end of the spacer is more important in predicting the spacer abundance, which is consistent with previous findings from controlled experimental studies. Combining these scales, CRISPR repeat sequences coevolve with the rest of the genome. Thus, I explore the potential of utilizing CRISPR repeat sequences for taxonomy profiling. I find a strong relationship between unique repeat sequences and taxonomy in both the RefSeq database and a human metagenomic dataset. Then I show high accuracy when utilizing repeat sequences in taxonomy annotation of human metagenomic contigs. This novel method not only aids in annotating CRISPR arrays but also introduces a novel tool for metagenomic sequence annotation. | en_US |
dc.identifier | https://doi.org/10.13016/hbs4-bxob | |
dc.identifier.uri | http://hdl.handle.net/1903/32879 | |
dc.language.iso | en | en_US |
dc.subject.pqcontrolled | Biology | en_US |
dc.subject.pqcontrolled | Bioinformatics | en_US |
dc.subject.pqcontrolled | Microbiology | en_US |
dc.subject.pquncontrolled | CRISPR | en_US |
dc.subject.pquncontrolled | Ecology | en_US |
dc.subject.pquncontrolled | Immune system | en_US |
dc.subject.pquncontrolled | Machine Learning | en_US |
dc.subject.pquncontrolled | Prokaryotes | en_US |
dc.title | EVOLUTION OF THE CRISPR IMMUNE SYSTEM FROM ECOLOGICAL TO MOLECULAR SCALES | en_US |
dc.type | Dissertation | en_US |
Files
Original bundle
1 - 1 of 1
No Thumbnail Available
- Name:
- Xiao_umd_0117E_24149.pdf
- Size:
- 4.32 MB
- Format:
- Adobe Portable Document Format
(RESTRICTED ACCESS)