Wir verwenden Cookies und Analyse-Tools, um die Nutzerfreundlichkeit der Internet-Seite zu verbessern und für Marketingzwecke. Wenn Sie fortfahren, diese Seite zu verwenden, nehmen wir an, dass Sie damit einverstanden sind. Zur Datenschutzerklärung.
Auditory Features for Speech Recognition and Enhancement
Details
Automatic speech recognition (ASR) involves the transformation of acoustic speech signal captured by a microphone, a telephone, or other transducers, into a text sequence. It is also known as the recognition of speech by a machine or, by some artificial intelligence. However, in spite of focused research in this field for the past several decades, robust speech recognition with high reliability has not been achieved as it degrades in the presence of speaker variabilities, channel mismatch conditions, and in noisy environments. The superb ability of the human auditory system has motivated researchers to include features of human perception in the speech recognition process. This book investigates the roles of several psychoacoustic features of human hearing in automatic speech recognition in clean and noisy environments and to determine those perceptual features which are relevant for speech recognition applications. The psychoacoustic features which are investigated are perceptual filterbank corresponding to the critical bands, synaptic adaptation, two-tone suppression, dynamic range compression and simultaneous and temporal masking effects.
Autorentext
The authors are with the Univ. of West. Australia (UWA). Research interests of Dr. Haque are speech comm. and auditory modeling. Dr. Togneri is the head of the Signal Proc. Lab at UWA. His research interests are statistical models for speech and spoken language technology. Research interests of Dr. Zaknich are learning theory and neural networks.
Weitere Informationen
- Allgemeine Informationen
- GTIN 09783639183962
- Genre Technik
- Sprache Englisch
- Anzahl Seiten 208
- Herausgeber VDM Verlag Dr. Müller e.K.
- Größe H18mm x B222mm x T151mm
- Jahr 2009
- EAN 9783639183962
- Format Kartonierter Einband (Kt)
- ISBN 978-3-639-18396-2
- Titel Auditory Features for Speech Recognition and Enhancement
- Autor Serajul Haque
- Untertitel Psychoacoustics of human hearing applied to automatic speech recognition