Corpus-Based Methods in Language and Speech Processing
Details
Corpus-based methods will be found at the heart of many language and speech processing systems. This book provides an in-depth introduction to these technologies through chapters describing basic statistical modeling techniques for language and speech, the use of Hidden Markov Models in continuous speech recognition, the development of dialogue systems, part-of-speech tagging and partial parsing, data-oriented parsing and n-gram language modeling.
The book attempts to give both a clear overview of the main technologies used in language and speech processing, along with sufficient mathematics to understand the underlying principles. There is also an extensive bibliography to enable topics of interest to be pursued further. Overall, we believe that the book will give newcomers a solid introduction to the field and it will give existing practitioners a concise review of the principal technologies used in state-of-the-art language and speech processing systems.
Corpus-Based Methods in Language and Speech Processing is an initiative of ELSNET, the European Network in Language and Speech. In its activities, ELSNET attaches great importance to the integration of language and speech, both in research and in education. The need for and the potential of this integration are well demonstrated by this publication.
Inhalt
1 Corpus-Based Statistical Methods in Speech and Language Processing.- 2 Hidden Markov Models in Speech and Language Processing.- 3 Spoken Language Dialogue Systems.- 4 Part-of-Speech Tagging and Partial Parsing.- 5 Data-Oriented Language Processing.- 6 Statistical Language Modeling Using Leaving-One-Out.- Author information.
Weitere Informationen
- Allgemeine Informationen
- GTIN 09789048148134
- Editor Gerrit Bloothooft, Steve Young
- Sprache Englisch
- Auflage Softcover reprint of hardcover 1st edition 1997
- Größe H235mm x B155mm x T14mm
- Jahr 2010
- EAN 9789048148134
- Format Kartonierter Einband
- ISBN 9048148138
- Veröffentlichung 08.12.2010
- Titel Corpus-Based Methods in Language and Speech Processing
- Untertitel Text, Speech and Language Technology 2
- Gewicht 382g
- Herausgeber Springer Netherlands
- Anzahl Seiten 248
- Lesemotiv Verstehen
- Genre Sprach- und Literaturwissenschaften