Wir verwenden Cookies und Analyse-Tools, um die Nutzerfreundlichkeit der Internet-Seite zu verbessern und für Marketingzwecke. Wenn Sie fortfahren, diese Seite zu verwenden, nehmen wir an, dass Sie damit einverstanden sind. Zur Datenschutzerklärung.
Automatic POS Tagging of Bhojpuri: A Comparative Study with Hindi
Details
This work is one of the initial experiments towards creating the automatic Part-of Speech (POS) tagger for Bhojpuri language. Bhojpuri is a lesser resource language and does not have much technology available, therefore, this work presents the first big representative Bhojpuri corpus of approx 2,67,000 tokens from different domains and a SVM (Support Vector Machine) based POS tagger trained on this corpus. The accuracy of the tagger achieved under this experiment is approx. 87 %. This work also cover a detail guideline of annotating Bhojpuri corpus following BIS scheme and a comparative analysis of performances of Bhojpuri and Hindi POS taggers trained with SVM model.
Autorentext
Author is a Research scholar from Jawaharlal Nehru University, New Delhi. Her area of expertise is Computational Linguistics. Natural Language Processing (NLP), Corpora Collection and Resource Creation for lesser resourced languages is author's major area of interest and main objective of the present work (M.Phil dissertation) submitted in 2015.
Weitere Informationen
- Allgemeine Informationen
- GTIN 09786138389927
- Sprache Englisch
- Titel Automatic POS Tagging of Bhojpuri: A Comparative Study with Hindi
- Veröffentlichung 20.04.2018
- ISBN 6138389921
- Format Kartonierter Einband
- EAN 9786138389927
- Jahr 2018
- Größe H220mm x B150mm x T17mm
- Autor Srishti Singh
- Genre Sprach- und Literaturwissenschaften
- Anzahl Seiten 276
- Herausgeber LAP LAMBERT Academic Publishing
- Gewicht 429g