Navigation überspringen.
Startseite

Guide to OCR for Indic Scripts

Govindaraju, Venu [u.a.] (Hrsg.):
Guide to OCR for Indic Scripts : Document Recognition and Retrieval / Venu Govindaraju, Srirangaraj Setlur, editors. - London ; Dordrecht ; Heidelberg [u.a.] : Springer, 2009 (eBook), 2010 (Print-Ausg.). - XXI, 325 S. - (Advances in Pattern Recognition)
ISBN 978-1-84800-329-3 (Print-Ausg.)
ISBN 978-1-84800-330-9 (eBook)
EUR 106,95
DOI: 10.1007/978-1-84800-330-9
DDC: 006.4240954

Beschreibung
Optical Character Recognition (OCR) is a key enabling technology critical to creating indexed, digital library content, and it is especially valuable for Indic scripts, for which there has been very little digital access.
   Indic scripts, the ancient Brahmi scripts prevalent in the Indian subcontinent, present some challenges for OCR that are different from those faced with Latin and Oriental scripts. But properly utilized, OCR will help to make Indic digital archives practically accessible to researchers and lay users alike by creating searchable indexes and machine-readable text repositories.
   This unique guide/reference is the very first comprehensive book on the subject of OCR for Indic scripts, providing an overview of the state-of-the-art research in this field as well as other issues related to facilitating query and retrieval of Indic documents from digital libraries. All major research groups working in this area are represented in this book, which is divided into sections on recognition of Indic scripts and retrieval of Indic documents. [Verlagsinformation]

Inhalt
PART I: RECOGNITION OF INDIC SCRIPTS
1. C. V. Jawahar, Anand Kumar, A. Phaneendra and K.J. Jinesh: Building Data Sets for Indian Language OCR Research. 3
2. B. B. Chaudhari: On OCR of major Indian scripts: Bangla and Devanagari. 27
3. Gurpreet Singh Lehal: A Complete Machine Printed Gurmukhi OCR System. 43
4. Jignesh Dholakia, Atul Negi and S. Rama Mohan: Progress in Gujarati Document Processing and Character Recognition. 73
5. R. S. Umesh , P. B. Pati and A. G. Ramakrishnan: Design of a bilingual Kannada-English OCR. 97
6. N. V. Neeba , Anoop Namboodiri, C. V. Jawahar and P. J. Narayanan: Recognition of Malayalam Documents. 125
7. K. H. Aparna and V. S. Chakravarthy: A Complete OCR System for Tamil Magazine Documents. 147
8. Omar Mukhtar, Srirangaraj Setlur and Venu Govindaraju: Experiments on Urdu Text Recognition. 163
9. Prem Natarajan, Ehry MacRostie, and Michael Decerbo: The BBN Byblos Hindi OCR System. 173
10. Mudit Agrawal, Huanfeng Ma and David Doermann: Generalization of Hindi OCR using Adaptive Segmentation and Font Files. 181
11. A. Bharath and Sriganesh Madhvanath: Online Handwriting Recognition for Indic Scripts. 209
PART II: RETRIEVAL OF INDIC DOCUMENTS
12. Peter M. Scharf and Malcolm Hyman: Enhancing Access to Primary Cultural Heritage Materials of India. 237
13. Zhixin Shi, Srirangaraj Setlur and Venu Govindaraju: Digital Image Enhancement of Indic Historical Manuscripts. 249
14. Gaurav Harit, Shantanu Chaudhary and Ritu Garg: GFG based Compression and Retrieval of Document Images in Indian Scripts. 269
15. Anurag Bhardwaj, Srirangaraj Setlur, Venu Govindaraju: Word spotting for Indic documents to facilitate retrieval. 285
16. Prasenjit Majumder and Mandar Mitra: Indian Language Information Retrieval. 301
Colour plates. 315
Index. 321

Herausgeber
VENU GOVINDARAJU, Distinguished Professor of Computer Science and Engineering at the University at Buffalo (SUNY Buffalo). Profile page
SRIRANGARAJ (RANGA) SETLUR, Principal Research Scientist of and directs projects at CEDAR (Center of Excellence for Document Analysis and Recognition) sponsored by the United States Postal Service for the evaluation of recognition systems. Profile page.

Quellen: Springer Verlag; Amazon; Google Books; WorldCat