Developing and Measuring Latent Constructs in Text

dc.contributor.advisorResnik, Philipen_US
dc.contributor.authorHoyle, Alexander Miserlisen_US
dc.contributor.departmentComputer Scienceen_US
dc.contributor.publisherDigital Repository at the University of Marylanden_US
dc.contributor.publisherUniversity of Maryland (College Park, Md.)en_US
dc.date.accessioned2025-02-07T06:32:24Z
dc.date.issued2024en_US
dc.description.abstractConstructs---like inflation, populism, or paranoia---are of fundamental concern to social science. Constructs are the vocabulary over which theory operates, and so a central activity is the development and measurement of latent constructs from observable data. Although the social sciences comprise fields with different epistemological norms, they share a concern for valid operationalizations that transparently map between data and measure. Economists at the US Bureau of Labor Statistics, for example, follow a hundred-page handbook to sample the egg prices that constitute the Consumer Price Index; Clinical psychologists rely on suites of psychometric tests to diagnose schizophrenia. In many fields, this observable data takes the form of language: as a social phenomenon, language data can encode many of the latent social constructs that people care about. Commensurate with both increasing sophistication in language technologies and amounts of available data, there has thus emerged a "text-as-data" paradigm aimed at "amplifying and augmenting" the analyses that compose research. At the same time, Natural Language Processing (NLP), the field from which analysis tools originate, has often remained separate from real-world problems and guiding theories---as least when it comes to social science. Instead, it focuses on atomized tasks under the assumption that progress on low-level language aspects will generalize to higher-level problems that involve overlapping elements. This dissertation focuses on NLP methods and evaluations that facilitate the development and measurement of latent constructs from natural language, while remaining sensitive to social sciences' need for interpretability and validity.en_US
dc.identifierhttps://doi.org/10.13016/jdlq-leek
dc.identifier.urihttp://hdl.handle.net/1903/33755
dc.language.isoenen_US
dc.subject.pqcontrolledComputer scienceen_US
dc.subject.pqcontrolledArtificial intelligenceen_US
dc.subject.pqcontrolledInformation scienceen_US
dc.subject.pquncontrolledComputational text analysisen_US
dc.subject.pquncontrolledContent analysisen_US
dc.subject.pquncontrolledMachine learningen_US
dc.subject.pquncontrolledNatural language processingen_US
dc.subject.pquncontrolledSocial scienceen_US
dc.subject.pquncontrolledTopic modelsen_US
dc.titleDeveloping and Measuring Latent Constructs in Texten_US
dc.typeDissertationen_US

Files

Original bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
Hoyle_umd_0117E_24694.pdf
Size:
5.84 MB
Format:
Adobe Portable Document Format
Download
(RESTRICTED ACCESS)