Detection and Recognition of Asynchronous Auditory/Visual Speech: Effects of Age, Hearing Loss, and Talker Accent
View/ Open
External Link(s)
https://doi.org/10.3389/fpsyg.2021.772867Date
2022-01Author
Gordon-Salant, Sandra
Schwartz, Maya
Oppler, Kelsey
Yeni-Komshian, Grace
DRUM DOI
Metadata
Show full item recordAbstract
This investigation examined age-related differences in auditory-visual (AV) integration as reflected on perceptual judgments of temporally misaligned AV English sentences spoken by native English and native Spanish talkers. In the detection task, it was expected that slowed auditory temporal processing of older participants, relative to younger participants, would be manifest as a shift in the range over which participants would judge asynchronous stimuli as synchronous (referred to as the “AV simultaneity window”). The older participants were also expected to exhibit greater declines in speech recognition for asynchronous AV stimuli than younger participants. Talker accent was hypothesized to influence listener performance, with older listeners exhibiting a greater narrowing of the AV simultaneity window and much poorer recognition of asynchronous AV foreign-accented speech compared to younger listeners. Participant groups included younger and older participants with normal hearing and older participants with hearing loss. Stimuli were video recordings of sentences produced by native English and native Spanish talkers. The video recordings were altered in 50 ms steps by delaying either the audio or video onset. Participants performed a detection task in which the judged whether the sentences were synchronous or asynchronous, and performed a recognition task for multiple synchronous and asynchronous conditions. Both the detection and recognition tasks were conducted at the individualized signal-to-noise ratio (SNR) corresponding to approximately 70% correct speech recognition performance for synchronous AV sentences. Older listeners with and without hearing loss generally showed wider AV simultaneity windows than younger listeners, possibly reflecting slowed auditory temporal processing in auditory lead conditions and reduced sensitivity to asynchrony in auditory lag conditions. However, older and younger listeners were affected similarly by misalignment of auditory and visual signal onsets on the speech recognition task. This suggests that older listeners are negatively impacted by temporal misalignments for speech recognition, even when they do not notice that the stimuli are asynchronous. Overall, the findings show that when listener performance is equated for simultaneous AV speech signals, age effects are apparent in detection judgments but not in recognition of asynchronous speech.
Rights
CC0 1.0 Universalhttp://creativecommons.org/publicdomain/zero/1.0/
The following license files are associated with this item:
- Creative Commons
Related items
Showing items related by title, author, creator and subject.
-
Automatic Speech Codec Identification with Applications to Tampering Detection of Speech Recordings
Zhou, Jingting (2011)In this work many versions of CELP codecs are explored, and an observation is made that different codebooks are used to encode noisy part of residual. Taking advantage of noise patterns they generated, an algorithm was ... -
Discrimination of Speech From Non-Speech Based on Multiscale Spectro-Temporal Modulations
Mesgarani, Nima (2005-05-16)We describe a content-based audio classification algorithm based on novel multiscale spectrotemporal modulation features inspired by a model of auditory cortical processing. The task explored is to discriminate speech from ... -
Infant-Directed Speech: Maternal Pitch Variability, Rate of Speech, and Child Language Outcomes
Raneri, Daniele Palma (2015)Research regarding the influence of specific features typical of infant-directed speech (IDS) and their potential role in facilitating children’s language development is still needed. Very little evidence links features ...