Deep Learning Based Content Retrieval for Recognition and Classification in Historical Document
Shruthi K.R, Abhishek, Bharath S, Pavan Kumar S, Madhu R
Deep Learning Convolution neural network Historical Document Text Retrieval Optical Character Recognition Deep Neural Network
This is quintessential because the variety of digitized historic files has expanded rapidly in latest decades. It affords environment friendly statistics retrieval and information extraction techniques to enable get entry to data. Such a technique to transform document images into written representations, it uses optical character recognition (OCR). At the moment, OCR methods frequently do not fit into the historical realm. In addition, they normally require a massive amount Annotated document. Therefore, this report will exhibit you some methods to enable OCR on historical data. Add some authentic, manually labelled coaching information to the photograph. Full featured OCR The device performs two main tasks: OCR and page layout analysis, which includes text block and line segmentation. Our segmentation method uses a recurrent neural network, while the OCR method is based on a fully convolutional network. Both strategies are cutting edge in the relevant field. built a new kind of Protonium Portal genuine dataset for OCR. All recommended techniques will be assessed in light of this data, which is freely available for research on this corpus. We display it with the aid of some real samples of annotated records, both segmentation and OCR jobs can be completed. The experiment goals to do this If your dataset is small, determine the satisfactory way to do it properly. We also show that the rating carried out is equal to or better than the scores of some contemporary systems. In conclusion, this study shows how to develop an effective OCR system for historical archives even in the absence of much training data.
Article Details
Unique Paper ID: 156035

Publication Volume & Issue: Volume 9, Issue 2

Page(s): 571 - 577
Article Preview & Download

Share This Article

Join our RMS

Conference Alert


AICTE Sponsored National Conference on Smart Systems and Technologies

Last Date: 25th November 2023

SWEC- Management


Last Date: 7th November 2023

Call For Paper

Volume 10 Issue 10

Last Date for paper submitting for March Issue is 25 June 2024

About Us enables door in research by providing high quality research articles in open access market.

Send us any query related to your research on

Social Media

Google Verified Reviews