Wir verwenden Cookies und Analyse-Tools, um die Nutzerfreundlichkeit der Internet-Seite zu verbessern und für Marketingzwecke. Wenn Sie fortfahren, diese Seite zu verwenden, nehmen wir an, dass Sie damit einverstanden sind. Zur Datenschutzerklärung.
Guide to OCR for Arabic Scripts
Details
This detailed overview of Arabic character recognition technology covers pre-processing and feature extraction; HMM-based methods; evaluation of OCR systems; and applications of recognition technology, from historical manuscripts to online Arabic recognition.
This Guide to OCR for Arabic Scripts is the first book of its kind, specifically devoted to this emerging field. Topics and features: contains contributions from the leading researchers in the field; with a Foreword by Professor Bente Maegaard of the University of Copenhagen; presents a detailed overview of Arabic character recognition technology, covering a range of different aspects of pre-processing and feature extraction; reviews a broad selection of varying approaches, including HMM-based methods and a recognition system based on multidimensional recurrent neural networks; examines the evaluation of Arabic script recognition systems, discussing data collection and annotation, benchmarking strategies, and handwriting recognition competitions; describes numerous applications of Arabic script recognition technology, from historical Arabic manuscripts to online Arabic recognition.
The first book of its kind, specifically devoted to the emerging field of OCR for Arabic Scripts Presents state-of-the-art research from an international selection of pre-eminent authorities in the field Describes numerous applications of Arabic script recognition technology, from historical Arabic manuscripts to online Arabic recognition Includes supplementary material: sn.pub/extras
Autorentext
Volker Märgner is Academic Director of the Institute for Communications Technology (IfN) at Technische Universität Braunschweig, Germany. He has over 30 years research experience in image processing, pattern recognition, and handwriting recognition. He developed the IfN/ENIT-database of Arabic handwritten names which is the reference for Arabic handwritten word recognition systems and organized competitions both together with Haikal El Abed.
Haikal El Abedis a Senior Research Engineer at the Institute for Communications Technology (IfN) at Technische Universität Braunschweig, Germany. He has more than 10 years research experience in pattern recognition and Arabic text recognition, on-line and off-line. He organizes competitions and works on the collection of databases.
Klappentext
Optical Character Recognition (OCR) is a key technology enabling access to digital text data. This technique is especially valuable for Arabic scripts, for which there has been very little digital access.
Arabic script is widely used today. It is estimated that approximately 200 million people use Arabic as a first language, and the Arabic script is shared by an additional 13 languages, making it the second most widespread script in the world. However, Arabic scripts pose unique challenges for OCR systems that cannot be simply adapted from existing Latin character-based processing techniques.
This comprehensive Guide to OCR for Arabic Scripts is the first book of its kind, specifically devoted to this emerging field. Presenting state-of-the-art research from an international selection of pre-eminent authorities, the book reviews techniques and algorithms for the recognition of both handwritten and printed Arabic scripts. Many of these techniques can also be applied to other scripts, serving as an inspiration to all groups working in the area of OCR.
Topics and features:
- Contains contributions from the leading researchers in the field
- With a Foreword by Professor Bente Maegaard of the University of Copenhagen
- Presents a detailed overview of Arabic character recognition technology, covering a range of different aspects of pre-processing and feature extraction
- Reviews a broad selection of varying approaches, including HMM-based methods and a recognition system based on multidimensional recurrent neural networks
- Examines the evaluation of Arabic script recognition systems, discussing data collection and annotation, benchmarking strategies, and handwriting recognition competitions
Describes numerous applications of Arabic script recognition technology, from historical Arabic manuscripts to online Arabic recognition This authoritative work is an essential reference for all researchers and graduate students interested in OCR technology and methodology in general, and in Arabic scripts in particular.
Inhalt
Part I: Pre-Processing.- An Assessment of Arabic Handwriting Recognition Technology.- Layout Analysis of Arabic Script Documents.- A Multi-Stage Approach to Arabic Document Analysis.- Pre-Processing Issues in Arabic OCR.- Segmentation of Ancient Arabic Documents.- Features for HMM-Based Arabic Handwritten Word Recognition Systems.- Part II: Recognition.- Printed Arabic Text Recognition.- Handwritten Arabic Word Recognition Using the IFN/ENIT-Database.- RWTH OCR: A Large Vocabulary Optical Character Recognition System for Arabic Scripts.- Arabic Handwriting Recognition using Bernoulli HMMs.- Handwritten Farsi Words Recognition Using Hidden Markov Models.- Offline Arabic Handwriting Recognition with Multidimensional Recurrent Neural Networks.- Application of Fractal Theory in Farsi/Arabic Document Analysis.- Multi-Stream Markov Models for Arabic Handwriting Recognition.- Towards Distributed Cursive Writing OCR Systems based on the Combination of Complementary Approaches.- Part III: Evaluation.- Data Collection and Annotation for Arabic Document Analysis.- Arabic Handwriting Recognition Competitions.- Benchmarking Strategy for Arabic Screen Rendered Word Recognition.- Part IV: Applications.- A Robust Word Spotting System for Historical Arabic Manuscripts.- Arabic Text recognition using a Script-Independent Methodology: A Unified HMM-based Approach for Machine-print and Handwritten Text.- Arabic Handwriting Recognition Using VDHMM and Over-Segmentation.- Online Arabic Databases and Applications.- Online Arabic Handwritten Words Recognition Based on HMM and Combination of Online and Offline Features.- Part I: Pre-Processing.- An Assessment of Arabic Handwriting Recognition Technology.- Layout Analysis of Arabic Script Documents.- A Multi-Stage Approach to Arabic Document Analysis.- Pre-Processing Issues in Arabic OCR.- Segmentation of Ancient Arabic Documents.- Features for HMM-Based Arabic Handwritten Word Recognition Systems.- Part II: Recognition.- Printed Arabic Text Recognition.- Handwritten Arabic Word Recognition Using the IFN/ENIT-Database.- RWTH OCR: A Large Vocabulary Optical Character Recognition System for Arabic Scripts.- Arabic Handwriting Recognition using Bernoulli HMMs.- Handwritten Farsi Words Recognition Using Hidden Markov Models.- Offline Arabic Handwriting Recognition with Multidimensional Recurrent Neural Networks.- Application of Fractal Theory in Farsi/Arabic Document Analysis.- Multi-Stream Markov Models for Arabic Handwriting Recognition.- Towards Distributed Cursive Writing OCR Systems based on the Combination of Complementary Approaches.- Part III: Evaluation.- Data Collection and Annotation for Arabic Document Analysis.- Arabic Handwriting Recognition Competitions.- Benchmarking Strategy for Arabic Screen Rendered Word Recognition.- Part IV: Applications.- A Robust Word Spotting System for Historical Arabic Manuscripts.- Arabic Text recognition using a Script-Independent Methodology: A Unified HMM-based Approach for Machine-print and Handwritten Text.- Arabic Handwriting Recognition Using VDHMM and Over-Segmentation.- Online Arabic Databases and Applications.- Online Arabic Handwritten Words Recognition Based on HMM and Combination of Online and Offline Features.
Weitere Informationen
- Allgemeine Informationen
- GTIN 09781447159766
- Auflage 2012
- Editor Haikal El Abed, Volker Märgner
- Sprache Englisch
- Genre Anwendungs-Software
- Größe H235mm x B155mm x T33mm
- Jahr 2014
- EAN 9781447159766
- Format Kartonierter Einband
- ISBN 1447159764
- Veröffentlichung 09.08.2014
- Titel Guide to OCR for Arabic Scripts
- Gewicht 914g
- Herausgeber Springer London
- Anzahl Seiten 612
- Lesemotiv Verstehen