Trust through Transparency: Towards Reliable AI for All

dc.contributor.advisorFeizi, Soheilen_US
dc.contributor.authorMoayeri, Mazdaen_US
dc.contributor.departmentComputer Scienceen_US
dc.contributor.publisherDigital Repository at the University of Marylanden_US
dc.contributor.publisherUniversity of Maryland (College Park, Md.)en_US
dc.date.accessioned2025-08-08T11:43:57Z
dc.date.issued2025en_US
dc.description.abstractSeemingly performant models can break down in unexpected and uneven ways, from image classifiers failing to recognize an otter out of water, to LLMs being nearly 3 times worse at recalling facts about Somalia than Sweden. In this dissertation, I’ll detail interpretability techniques to scalably illuminate and efficiently intervene on discovered model deficiencies. First, I’ll present evidence for pervasive reliance on spurious correlations by vision models, by way of carefully constructed benchmarks. Then, I’ll automate these approaches, demonstrating the power of leveraging auxiliary models to more efficiently organize data, towards uncovering and articulating subsets where models struggle. Finally, I’ll show how these same techniques can be applied to mitigate instances of real-world geographic disparities and even tackle sociotechnical challenges like artistic copyright infringement. In general, it can be difficult to trust what we do not fully understand, especially when unexpected failures arise. By scalably identifying failure modes before they cause harm, we enhance transparency around model abilities and limitations, thus better informing when models can be trusted to work reliably for all.en_US
dc.identifierhttps://doi.org/10.13016/awz7-oxld
dc.identifier.urihttp://hdl.handle.net/1903/34092
dc.language.isoenen_US
dc.subject.pqcontrolledArtificial intelligenceen_US
dc.subject.pqcontrolledComputer scienceen_US
dc.subject.pquncontrolledcomputer visionen_US
dc.subject.pquncontrolleddeep learningen_US
dc.subject.pquncontrolledfairnessen_US
dc.subject.pquncontrolledrobustnessen_US
dc.subject.pquncontrolledspurious correlationsen_US
dc.subject.pquncontrolledvision language modelsen_US
dc.titleTrust through Transparency: Towards Reliable AI for Allen_US
dc.typeDissertationen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Moayeri_umd_0117E_24939.pdf
Size:
49.77 MB
Format:
Adobe Portable Document Format