Modeling Language Development: How Machine Learning can Enhance Analysis of the Language Environment
dc.contributor.advisor | Huang, Yi Ting | |
dc.contributor.advisor | Newman, Rochelle | |
dc.contributor.advisor | Domanski, Sophie | |
dc.contributor.author | Harvey, James | |
dc.date.accessioned | 2024-12-20T16:04:41Z | |
dc.date.available | 2024-12-20T16:04:41Z | |
dc.date.issued | 2024-12-18 | |
dc.description.abstract | Language sampling elicits a representative picture of a child’s language and provides methods for assessing functional communication beyond what is offered by standardized tests. Naturalistic sampling reduces time costs, and offers an ideal way to assess differences in home language associated with differences in socioeconomic status (SES). Unfortunately, naturalistic dense recordings present challenges in terms of how to scale analysis and extract meaningful information. This study investigates the application and analysis of the Language ENvironment Analysis system (LENA) for sampling home language using technology-assisted transcription and topic modeling. To evaluate the efficacy of transcription, segments were selected in reference to their amount of meaningful speech as measured by LENA, and transcribed by Whisper, OpenAI’s automatic speech recognition software. Research assistants trimmed text files to retain available adult language separated by utterance. Results suggest that this method of sampling, technology-assisted transcription, and automated analysis of traditional language metrics reproduces expected associations between parental input, SES, and standardized child vocabulary size. Topic models did not identify activity contexts, likely due to the nature of the input. This research presents a validated pipeline to produce dense representative data that utilizes modern approaches to reduce traditional time costs. | |
dc.identifier | https://doi.org/10.13016/dqfi-fjzy | |
dc.identifier.uri | http://hdl.handle.net/1903/33554 | |
dc.language.iso | en_US | |
dc.relation.isAvailableAt | Department of Hearing & Speech Sciences | |
dc.relation.isAvailableAt | College of Behavioral and Social Sciences | |
dc.relation.isAvailableAt | Digital Repository at the University of Maryland | |
dc.relation.isAvailableAt | University of Maryland (College Park, Md) | |
dc.rights | Attribution-NoDerivs 3.0 United States | en |
dc.rights.uri | http://creativecommons.org/licenses/by-nd/3.0/us/ | |
dc.title | Modeling Language Development: How Machine Learning can Enhance Analysis of the Language Environment | |
dc.type | Thesis |
Files
Original bundle
1 - 1 of 1
No Thumbnail Available
- Name:
- Honors Thesis Final.pdf
- Size:
- 1.06 MB
- Format:
- Adobe Portable Document Format