Using LLMs to score Hierarchical Organization of Narrative Recall

dc.contributor.advisorBolger, Donald
dc.contributor.authorYou, Hannah
dc.contributor.authorWorthy, Jaylen
dc.contributor.authorRickles, Ben
dc.date.accessioned2024-04-15T17:02:59Z
dc.date.available2024-04-15T17:02:59Z
dc.date.issued2024
dc.description.abstractResearch has suggested that the ability to comprehend narratives relies on hierarchical organization of memory. Disorders like dementia can be detected by measures of narrative recall, however these often rely on human graders. Here we explored how LLMs can be leveraged to measure hierarchical organization of narrative recall (HONR). We had participants (n=4) recall two, 45-50 sentence stories. We then submitted the recalls (n=8) to a two-part process where we first asked chatGPT to connect the sentences in the recall to the sentences in the actual stories. Next we took the output from ChatGPT and devised two measures of HONR: Event Bunching and Linearity as proxies for Local and Global Coherence, respectively. We compared a sample of human scored responses to those semi-automated responses and found a high interrater reliability, suggesting LLMs can reliably answer questions about narrative structure.
dc.identifierhttps://doi.org/10.13016/u7me-ectv
dc.identifier.urihttp://hdl.handle.net/1903/32461
dc.language.isoen_US
dc.relation.isAvailableAtDigital Repository at the University of Maryland
dc.relation.isAvailableAtUniversity of Maryland (College Park, Md)
dc.relation.isAvailableAtOffice of Undergraduate Research
dc.rightsAttribution-NonCommercial-NoDerivs 3.0 United Statesen
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/3.0/us/
dc.titleUsing LLMs to score Hierarchical Organization of Narrative Recall
dc.typePresentation
local.equitableAccessSubmissionNo

Files

Original bundle

Now showing 1 - 1 of 1
Thumbnail Image
Name:
URD_2024_LLMs.pptx.pdf
Size:
732.35 KB
Format:
Adobe Portable Document Format