Copyright © 2025 Authors retain the copyright of this article. This article is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
@article{158887, author = {Karwan Vishweshwar and Kosanam Srinivas and C.A. Daphine Desona Clemency}, title = {Machine-Generated Captions for Images using Deep Learning}, journal = {International Journal of Innovative Research in Technology}, year = {}, volume = {9}, number = {10}, pages = {807-812}, issn = {2349-6002}, url = {https://ijirt.org/article?manuscript=158887}, abstract = {The primary objective of the picture caption generator is to automatically produce a suitable text or caption in English. The system's primary goal is to successfully provide appropriate captions for the provided picture. This study presents an image caption generator that, given an input picture, would identify its contents using beam search and greedy search to produce an English phrase. A pretrained deep learning CNN architecture exception model is used to learn image features, while a LSTM model is used to learn textual features, then integrates the results of both to produce a caption. To produce words, phrases, or captions for the provided photos, we use the LSTM model. Using the Convolutional Neural Network with Long Short-Term Memory, this model was created to create a caption generator for images. Features are extracted from the picture using a pre-trained version of VGG16. To create descriptive text for the pictures, LSTM acts as a decoder. This model has been taught to produce descriptive captions or words based on an input picture. The effectiveness of the model is measured by means of blue scores given to the system. The Keras library, NumPy, and Jupyter notebooks are discussed as tools for developing this project. We also talk about the picture categorization task, how CNNs are employed, and the Flickr dataset.}, keywords = {Deep Learning, LSTM, Caption, Description, Memory, Neural Network, VGG16, Image, CNN}, month = {}, }
Cite This Article
Submit your research paper and those of your network (friends, colleagues, or peers) through your IPN account, and receive 800 INR for each paper that gets published.
Join NowNational Conference on Sustainable Engineering and Management - 2024 Last Date: 15th March 2024
Submit inquiry