Wir verwenden Cookies und Analyse-Tools, um die Nutzerfreundlichkeit der Internet-Seite zu verbessern und für Marketingzwecke. Wenn Sie fortfahren, diese Seite zu verwenden, nehmen wir an, dass Sie damit einverstanden sind. Zur Datenschutzerklärung.
Generation of Text and Speech Corpora
Details
Recent trends in the development of language related technology finds unavoidable requirement of language related resources and acquiring knowledge from these resources. In this prospect corpus-based methods are getting strong push from various laboratories throughout the world in Bangla language processing. As a continuation of these efforts, new Bangla text corpus BdNC01 and several speech corpora were generated in this work. The texts were collected from web editions of several leading Bangla news papers over a long period of time to avoid time dependency of word frequency. More than eleven million word tokens were collected during a period of six years. The corpus was manually checked and error-corrected each time before preserving in final repository as ASCII and Unicode texts. Popular words derived from text corpus, we recorded the largest speech corpora in Bangla language. It has been specifically designed for various research activities related to HMM-based speaker-independent speech recognition.
Autorentext
Dr. Md. Farukuzzaman Khan is a professor of Islamic University, Bangladesh. His work now focuses on language technology major in Bangla. He received Ph.D. and M.Phil. from Islamic University in 2014 and 2003. Dr. Khan earned M.Sc. and B.Sc. in Applied Physics and Electronics from University of Rajshahi. He was born in Bangladesh on 10 August 1966.
Weitere Informationen
- Allgemeine Informationen
- Sprache Englisch
- Anzahl Seiten 168
- Herausgeber LAP LAMBERT Academic Publishing
- Gewicht 268g
- Untertitel For Computer Processing and Recognition of Bangla
- Autor Md. Farukuzzaman Khan
- Titel Generation of Text and Speech Corpora
- Veröffentlichung 14.08.2019
- ISBN 3659777129
- Format Kartonierter Einband
- EAN 9783659777127
- Jahr 2019
- Größe H220mm x B150mm x T11mm
- GTIN 09783659777127