TEMPORAL CODING OF SPEECH IN HUMAN AUDITORY CORTEX

Ding, Nai

TEMPORAL CODING OF SPEECH IN HUMAN AUDITORY CORTEX

dc.contributor.advisor	Simon, Jonathan Z	en_US
dc.contributor.author	Ding, Nai	en_US
dc.contributor.department	Electrical Engineering	en_US
dc.contributor.publisher	Digital Repository at the University of Maryland	en_US
dc.contributor.publisher	University of Maryland (College Park, Md.)	en_US
dc.date.accessioned	2012-10-10T11:11:56Z
dc.date.available	2012-10-10T11:11:56Z
dc.date.issued	2012	en_US
dc.description.abstract	Human listeners can reliably recognize speech in complex listening environments. The underlying neural mechanisms, however, remain unclear and cannot yet be emulated by any artificial system. In this dissertation, we study how speech is represented in the human auditory cortex and how the neural representation contributes to reliable speech recognition. Cortical activity from normal hearing human subjects is noninvasively recorded using magnetoencephalography, during natural speech listening. It is first demonstrated that neural activity from auditory cortex is precisely synchronized to the slow temporal modulations of speech, when the speech signal is presented in a quiet listening environment. How this neural representation is affected by acoustic interference is then investigated. Acoustic interference degrades speech perception via two mechanisms, informational masking and energetic masking, which are addressed respectively by using a competing speech stream and a stationary noise as the interfering sound. When two speech streams are presented simultaneously, cortical activity is predominantly synchronized to the speech stream the listener attends to, even if the unattended, competing speech stream is 8 dB more intense. When speech is presented together with spectrally matched stationary noise, cortical activity remains precisely synchronized to the temporal modulations of speech until the noise is 9 dB more intense. Critically, the accuracy of neural synchronization to speech predicts how well individual listeners can understand speech in noise. Further analysis reveals that two neural sources contribute to speech synchronized cortical activity, one with a shorter response latency of about 50 ms and the other with a longer response latency of about 100 ms. The longer-latency component, but not the shorter-latency component, shows selectivity to the attended speech and invariance to background noise, indicating a transition from encoding the acoustic scene to encoding the behaviorally important auditory object, in auditory cortex. Taken together, we have demonstrated that during natural speech comprehension, neural activity in the human auditory cortex is precisely synchronized to the slow temporal modulations of speech. This neural synchronization is robust to acoustic interference, whether speech or noise, and therefore provides a strong candidate for the neural basis of acoustic background invariant speech recognition.	en_US
dc.identifier.uri	http://hdl.handle.net/1903/12988
dc.subject.pqcontrolled	Engineering	en_US
dc.subject.pqcontrolled	Electrical engineering	en_US
dc.subject.pqcontrolled	Neurosciences	en_US
dc.subject.pquncontrolled	auditory scene analysis	en_US
dc.subject.pquncontrolled	human auditory cortex	en_US
dc.subject.pquncontrolled	magnetoencephalography (MEG)	en_US
dc.subject.pquncontrolled	spectro-temporal response function (STRF)	en_US
dc.subject.pquncontrolled	speech	en_US
dc.subject.pquncontrolled	temporal processing	en_US
dc.title	TEMPORAL CODING OF SPEECH IN HUMAN AUDITORY CORTEX	en_US
dc.type	Dissertation	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Ding_umd_0117E_13274.pdf
Size:: 5.03 MB
Format:: Adobe Portable Document Format

Download

Collections

UMD Theses and Dissertations
Electrical & Computer Engineering Theses and Dissertations