Discrimination of Speech From Non-Speech Based on Multiscale Spectro-Temporal Modulations

dc.contributor.advisorShamma, Shihaben_US
dc.contributor.authorMesgarani, Nimaen_US
dc.contributor.departmentElectrical Engineeringen_US
dc.contributor.publisherDigital Repository at the University of Marylanden_US
dc.contributor.publisherUniversity of Maryland (College Park, Md.)en_US
dc.date.accessioned2006-02-04T06:31:12Z
dc.date.available2006-02-04T06:31:12Z
dc.date.issued2005-05-16en_US
dc.description.abstractWe describe a content-based audio classification algorithm based on novel multiscale spectrotemporal modulation features inspired by a model of auditory cortical processing. The task explored is to discriminate speech from non-speech consisting of animal vocalizations, music and environmental sounds. Although this is a relatively easy task for humans, it is still difficult to automate well, especially in noisy and reverberant environments. The auditory model captures basic processes occurring from the early cochlear stages to the central cortical areas. The model generates a multidimensional spectro-temporal representation of the sound, which is then analyzed by a multi-linear dimensionality reduction technique and classified by a Support Vector Machine (SVM). Generalization of the system to signals in high level of additive noise and reverberation is evaluated and compared to two existing approaches [1] [2]. The results demonstrate the advantages of the auditory model over the other two systems, especially at low SNRs and high reverberation.en_US
dc.format.extent603389 bytes
dc.format.extent603389 bytes
dc.format.mimetypeapplication/pdf
dc.format.mimetypeapplication/pdf
dc.identifier.urihttp://hdl.handle.net/1903/3044
dc.language.isoen_US
dc.subject.pqcontrolledEngineering, Electronics and Electricalen_US
dc.subject.pquncontrolledSpeech Detectionen_US
dc.subject.pquncontrolledSpectrotemporal modulationsen_US
dc.subject.pquncontrolledtensoren_US
dc.titleDiscrimination of Speech From Non-Speech Based on Multiscale Spectro-Temporal Modulationsen_US
dc.typeThesisen_US

Files

Original bundle
Now showing 1 - 2 of 2
Loading...
Thumbnail Image
Name:
umi-umd-2483.pdf
Size:
589.25 KB
Format:
Adobe Portable Document Format
Loading...
Thumbnail Image
Name:
mainthesis.pdf
Size:
589.25 KB
Format:
Adobe Portable Document Format