Deep Learning Based Content Retrieval for Recognition and Classification in Historical Document
Author(s):
Shruthi K.R, Abhishek, Bharath S, Pavan Kumar S, Madhu R
Keywords:
Deep Learning Convolution neural network Historical Document Text Retrieval Optical Character Recognition Deep Neural Network
Abstract
This is quintessential because the variety of digitized historic files has expanded rapidly in latest decades. It affords environment friendly statistics retrieval and information extraction techniques to enable get entry to data. Such a technique to transform document images into written representations, it uses optical character recognition (OCR). At the moment, OCR methods frequently do not fit into the historical realm. In addition, they normally require a massive amount Annotated document. Therefore, this report will exhibit you some methods to enable OCR on historical data. Add some authentic, manually labelled coaching information to the photograph. Full featured OCR The device performs two main tasks: OCR and page layout analysis, which includes text block and line segmentation. Our segmentation method uses a recurrent neural network, while the OCR method is based on a fully convolutional network. Both strategies are cutting edge in the relevant field. built a new kind of Protonium Portal genuine dataset for OCR. All recommended techniques will be assessed in light of this data, which is freely available for research on this corpus. We display it with the aid of some real samples of annotated records, both segmentation and OCR jobs can be completed. The experiment goals to do this If your dataset is small, determine the satisfactory way to do it properly. We also show that the rating carried out is equal to or better than the scores of some contemporary systems. In conclusion, this study shows how to develop an effective OCR system for historical archives even in the absence of much training data.
Article Details
Unique Paper ID: 156035

Publication Volume & Issue: Volume 9, Issue 2

Page(s): 571 - 577
Article Preview & Download


Share This Article

Join our RMS

Conference Alert

NCSEM 2024

National Conference on Sustainable Engineering and Management - 2024

Last Date: 15th March 2024

Call For Paper

Volume 11 Issue 1

Last Date for paper submitting for Latest Issue is 25 June 2024

About Us

IJIRT.org enables door in research by providing high quality research articles in open access market.

Send us any query related to your research on editor@ijirt.org

Social Media

Google Verified Reviews