Representation of speech in the primary auditory cortex and its implications for robust speech processing

Mesgarani, Nima

Representation of speech in the primary auditory cortex and its implications for robust speech processing

dc.contributor.advisor	Shamma, Shihab	en_US
dc.contributor.author	Mesgarani, Nima	en_US
dc.contributor.department	Electrical Engineering	en_US
dc.contributor.publisher	Digital Repository at the University of Maryland	en_US
dc.contributor.publisher	University of Maryland (College Park, Md.)	en_US
dc.date.accessioned	2008-10-11T05:52:07Z
dc.date.available	2008-10-11T05:52:07Z
dc.date.issued	2008-08-05	en_US
dc.description.abstract	Speech has evolved as a primary form of communication between humans. This most used means of communication has been the subject of intense study for years, but there is still a lot that we do not know about it. It is an oft repeated fact, that even the performance of the best speech processing algorithms still lags far behind that of the average human, It seems inescapable that unless we know more about the way the brain performs this task, our machines can not go much further. This thesis focuses on the question of speech representation in the brain, both from a physiological and technological perspective. We explore the representation of speech through the encoding of its smallest elements - phonemic features - in the primary auditory cortex. We report on how population of neurons with diverse tuning properties respond discriminately to phonemes resulting in explicit encoding of their parameters. Next, we show that this sparse encoding of the phonemic features is a simple consequence of the linear spectro-temporal properties of the auditory cortical neurons and that a Spectro-Temporal receptive field model can predict similar patterns of activation. This is an important step toward the realization of systems that operate based on the same principles as the cortex. Using an inverse method of reconstruction, we shall also explore the extent to which phonemic features are preserved in the cortical representation of noisy speech. The results suggest that the cortical responses are more robust to noise and that the important features of phonemes are preserved in the cortical representation even in noise. Finally, we explain how a model of this cortical representation can be used for speech processing and enhancement applications to improve their robustness and performance.	en_US
dc.format.extent	15071377 bytes
dc.format.mimetype	application/pdf
dc.identifier.uri	http://hdl.handle.net/1903/8586
dc.language.iso	en_US
dc.subject.pqcontrolled	Engineering, Electronics and Electrical	en_US
dc.subject.pqcontrolled	Biology, Neuroscience	en_US
dc.subject.pquncontrolled	Auditory Cortex	en_US
dc.subject.pquncontrolled	Auditory Model	en_US
dc.subject.pquncontrolled	Speech	en_US
dc.title	Representation of speech in the primary auditory cortex and its implications for robust speech processing	en_US
dc.type	Dissertation	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: umi-umd-5695.pdf
Size:: 14.37 MB
Format:: Adobe Portable Document Format

Download

Collections

UMD Theses and Dissertations
Electrical & Computer Engineering Theses and Dissertations