About | Contact Us | Register | Login
ProceedingsSeriesJournalsSearchEAI
Proceedings of the 4th International Conference on Information Technology, Civil Innovation, Science, and Management, ICITSM 2025, 28-29 April 2025, Tiruchengode, Tamil Nadu, India, Part I

Research Article

Smart Image Interpretation Chatbot With Speech Synthesis And Image Generation

Download11 downloads
Cite
BibTeX Plain Text
  • @INPROCEEDINGS{10.4108/eai.28-4-2025.2357931,
        author={Roopa.  R and Thammisetty  Swetha and Thejaswini.  P and Sravani.  K and Sambasiva.  G and Vamsi.  A},
        title={Smart Image Interpretation Chatbot With Speech Synthesis And Image Generation},
        proceedings={Proceedings of the 4th International Conference on Information Technology, Civil Innovation, Science, and Management, ICITSM 2025, 28-29 April 2025, Tiruchengode, Tamil Nadu, India, Part I},
        publisher={EAI},
        proceedings_a={ICITSM PART I},
        year={2025},
        month={10},
        keywords={smart image chatbot computer vision deep learning natural language processing (nlp) conversational ai speech synthesis text-to-speech (tts) image recognition ai-generated images generative ai multimodal interaction human-ai interaction automated image analysis accessibility technology content creation},
        doi={10.4108/eai.28-4-2025.2357931}
    }
    
  • Roopa. R
    Thammisetty Swetha
    Thejaswini. P
    Sravani. K
    Sambasiva. G
    Vamsi. A
    Year: 2025
    Smart Image Interpretation Chatbot With Speech Synthesis And Image Generation
    ICITSM PART I
    EAI
    DOI: 10.4108/eai.28-4-2025.2357931
Roopa. R1,*, Thammisetty Swetha1, Thejaswini. P1, Sravani. K1, Sambasiva. G1, Vamsi. A1
  • 1: Madanapalle Institute of Technology & Science
*Contact email: roopa509@gmail.com

Abstract

In the past couple of years, there has been an increasing focus on turning images into word descriptions. In this case, images are subject to analysis, described in words, and conveyed in voice. It helps those with disabilities gain access to media, give users a more immersive experience, and makes online content interesting. Smart Image Chatbot has specialized in making human engagement with images much better. Users can upload an image and ask a question about it, and the system will respond either verbally or in writing. This type of program can serve the blind users of this system because it helps them perceive visual information with the help of hearing. Another delightful component is its generation of new images coming from upload by the user. This function allows individuals to directly and manually edit or create images, no matter learning, content creation, online assistance: software gives life to image editing; it opens its implementation to everyone. This article includes all about the development of Smart Image Chatbot, the development methods used in it, and the problems faced therein. It will also explore possible extensions and how the technology helps make interaction over the Internet more engaging and accessible to all. The paper discusses the architecture, approaches, and deployment of the Smart Image Chatbot, including its technical infrastructure, limitations, and potential for future research.

Keywords
smart image chatbot, computer vision, deep learning, natural language processing (nlp), conversational ai, speech synthesis, text-to-speech (tts), image recognition, ai-generated images, generative ai, multimodal interaction, human-ai interaction, automated image analysis, accessibility technology, content creation
Published
2025-10-13
Publisher
EAI
http://dx.doi.org/10.4108/eai.28-4-2025.2357931
Copyright © 2025–2025 EAI
EBSCOProQuestDBLPDOAJPortico
EAI Logo

About EAI

  • Who We Are
  • Leadership
  • Research Areas
  • Partners
  • Media Center

Community

  • Membership
  • Conference
  • Recognition
  • Sponsor Us

Publish with EAI

  • Publishing
  • Journals
  • Proceedings
  • Books
  • EUDL