AI-BASED OCR SYSTEM FOR DIGITIZING HANDWRITTEN HISTORICAL DOCUMENTS IN REGIONAL LANGUAGES

  • Unique Paper ID: 174285
  • Volume: 11
  • Issue: 10
  • PageNo: 3435-3441
  • Abstract:
  • This project addresses the critical challenge of preserving and accessing historical documents written in regional languages, which are often at risk of deterioration and limited accessibility. We propose an AI-driven Optical Character Recognition (OCR) system leveraging Convolutional Neural Networks (CNNs) within the MATLAB environment. The system aims to accurately digitize handwritten texts, overcoming the complexities of varying handwriting styles and language-specific characters. A comprehensive image preprocessing pipeline, including noise removal, binarization, and segmentation, is implemented to enhance document quality and isolate text regions. The recognized characters are then converted into machine-readable text and further translated into modern regional languages, thereby broadening accessibility for researchers and historians. This initiative contributes significantly to the preservation of cultural heritage by providing a robust tool for accessing and studying invaluable historical information that would otherwise be lost.

Cite This Article

  • ISSN: 2349-6002
  • Volume: 11
  • Issue: 10
  • PageNo: 3435-3441

AI-BASED OCR SYSTEM FOR DIGITIZING HANDWRITTEN HISTORICAL DOCUMENTS IN REGIONAL LANGUAGES

Related Articles