NARRATIVE AI: VISUAL STORYCRAFT

  • Unique Paper ID: 171576
  • PageNo: 609-612
  • Abstract:
  • This project leverages artificial intelligence to create interactive and dynamic storytelling from images. By combining image captioning, language models, and text-to-speech technologies, the system generates engaging narratives based on the content of uploaded images. Using the BLIP image captioning model, the project converts visual content into descriptive text. This description is then processed by the Falcon-7B-Instruct language model to generate a short story. The final output is a text-to-speech conversion of the story, offering a fully automated, multimedia storytelling experience. The system is built using Python, Streamlit, and Hugging Face APIs, providing users with an easy to-use web interface to upload images, generate stories, and listen to the narratives.

Copyright & License

Copyright © 2026 Authors retain the copyright of this article. This article is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

BibTeX

@article{171576,
        author = {Prajwal D and Hemanth B and Sharath M D and Mohammad Wafeeq and Mr. Aneesh Kumar A},
        title = {NARRATIVE AI: VISUAL STORYCRAFT},
        journal = {International Journal of Innovative Research in Technology},
        year = {2025},
        volume = {11},
        number = {8},
        pages = {609-612},
        issn = {2349-6002},
        url = {https://ijirt.org/article?manuscript=171576},
        abstract = {This project leverages artificial intelligence to create interactive and dynamic storytelling from images. By combining image captioning, language models, and text-to-speech technologies, the system generates engaging narratives based on the content of uploaded images. Using the BLIP image captioning model, the project converts visual content into descriptive text. This description is then processed by the Falcon-7B-Instruct language model to generate a short story. The final output is a text-to-speech conversion of the story, offering a fully automated, multimedia storytelling experience. The system is built using Python, Streamlit, and Hugging Face APIs, providing users with an easy to-use web interface to upload images, generate stories, and listen to the narratives.},
        keywords = {Artificial intelligence, interactive storytelling, dynamic storytelling, image captioning, language models, text-to-speech (TTS), BLIP model, Falcon-7B-Instruct model, descriptive text generation, short story creation, multimedia experience, Python, Streamlit, Hugging Face APIs, web interface, and narrative generation.},
        month = {January},
        }

Cite This Article

D, P., & B, H., & D, S. M., & Wafeeq, M., & A, M. A. K. (2025). NARRATIVE AI: VISUAL STORYCRAFT. International Journal of Innovative Research in Technology (IJIRT), 11(8), 609–612.

Related Articles