Incorporating Knowledge Sources into Statistical Speech Recognition
Details
The authors address the problem of developing efficient automatic speech recognition systems that maintain a balance between utilizing a wide knowledge of speech variability, while keeping the training manageable and improving speech recognition performance.
Incorporating Knowledge Sources into Statistical Speech Recognition addresses the problem of developing efficient automatic speech recognition (ASR) systems, which maintain a balance between utilizing a wide knowledge of speech variability, while keeping the training / recognition effort feasible and improving speech recognition performance. The book provides an efficient general framework to incorporate additional knowledge sources into state-of-the-art statistical ASR systems. It can be applied to many existing ASR problems with their respective model-based likelihood functions in flexible ways.
Provides an efficient general framework to incorporate additional knowledge sources into state-of-the-art statistical ASR systems Demonstrates applications to existing ASR problems with their respective model-based likelihood functions in flexible ways
Klappentext
Incorporating Knowledge Sources into Statistical Speech Recognition offers solutions for enhancing the robustness of a statistical automatic speech recognition (ASR) system by incorporating various additional knowledge sources while keeping the training and recognition effort feasible.
The authors provide an efficient general framework for incorporating knowledge sources into state-of-the-art statistical ASR systems. This framework, which is called GFIKS (graphical framework to incorporate additional knowledge sources), was designed by utilizing the concept of the Bayesian network (BN) framework. This framework allows probabilistic relationships among different information sources to be learned, various kinds of knowledge sources to be incorporated, and a probabilistic function of the model to be formulated.
Incorporating Knowledge Sources into Statistical Speech Recognition demonstrates how the statistical speech recognition system may incorporate additional information sources by utilizing GFIKS at different levels of ASR. The incorporation of various knowledge sources, including background noises, accent, gender and wide phonetic knowledge information, in modeling is discussed theoretically and analyzed experimentally.
Inhalt
and Book Overview.- Statistical Speech Recognition.- Graphical Framework to Incorporate Knowledge Sources.- Speech Recognition Using GFIKS.- Conclusions and Future Directions.
Weitere Informationen
- Allgemeine Informationen
- GTIN 09781441946768
- Genre Elektrotechnik
- Auflage Softcover reprint of hardcover 1st edition 2009
- Sprache Englisch
- Lesemotiv Verstehen
- Anzahl Seiten 220
- Größe H235mm x B155mm x T13mm
- Jahr 2010
- EAN 9781441946768
- Format Kartonierter Einband
- ISBN 1441946764
- Veröffentlichung 05.11.2010
- Titel Incorporating Knowledge Sources into Statistical Speech Recognition
- Autor Sakriani Sakti , Wolfgang Minker , Satoshi Nakamura , Konstantin Markov
- Untertitel Lecture Notes in Electrical Engineering 42
- Gewicht 341g
- Herausgeber Springer US