A Multifaceted Quantification of Bias in Large Language Models

Sotnikova, Anna

A Multifaceted Quantification of Bias in Large Language Models

dc.contributor.advisor	Daumé III, Hal	en_US
dc.contributor.author	Sotnikova, Anna	en_US
dc.contributor.department	Applied Mathematics and Scientific Computation	en_US
dc.contributor.publisher	Digital Repository at the University of Maryland	en_US
dc.contributor.publisher	University of Maryland (College Park, Md.)	en_US
dc.date.accessioned	2024-02-14T06:35:43Z
dc.date.available	2024-02-14T06:35:43Z
dc.date.issued	2023	en_US
dc.description.abstract	Language models are rapidly developing, demonstrating impressive capabilities in comprehending, generating, and manipulating text. As they advance, they unlock diverse applications across various domains and become increasingly integrated into our daily lives. Nevertheless, these models, trained on vast and unfiltered datasets, come with a range of potential drawbacks and ethical issues. One significant concern is the potential amplification of biases present in the training data, generating stereotypes and reinforcing societal injustices when language models are deployed. In this work, we propose methods to quantify biases in large language models. We examine stereotypical associations for a wide variety of social groups characterized by both single and intersectional identities. Additionally, we propose a framework for measuring stereotype leakage across different languages within multilingual large language models. Finally, we introduce an algorithm that allows us to optimize human data collection in conditions of high levels of human disagreement.	en_US
dc.identifier	https://doi.org/10.13016/0eex-wnsj
dc.identifier.uri	http://hdl.handle.net/1903/31722
dc.language.iso	en	en_US
dc.subject.pqcontrolled	Applied mathematics	en_US
dc.subject.pqcontrolled	Computer science	en_US
dc.subject.pquncontrolled	Artificial Intelligence	en_US
dc.subject.pquncontrolled	Ethics	en_US
dc.subject.pquncontrolled	Large Language Models	en_US
dc.subject.pquncontrolled	Natural Language Processing	en_US
dc.subject.pquncontrolled	Stereotypes	en_US
dc.title	A Multifaceted Quantification of Bias in Large Language Models	en_US
dc.type	Dissertation	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Sotnikova_umd_0117E_23859.pdf
Size:: 2.83 MB
Format:: Adobe Portable Document Format

Download

Collections

UMD Theses and Dissertations
Computer Science Theses and Dissertations
Mathematics Theses and Dissertations