Smart Image Interpretation Chatbot With Speech Synthesis And Image Generation

Roopa. R; Thammisetty Swetha; Thejaswini. P; Sravani. K; Sambasiva. G; Vamsi. A

Proceedings of the 4th International Conference on Information Technology, Civil Innovation, Science, and Management, ICITSM 2025, 28-29 April 2025, Tiruchengode, Tamil Nadu, India, Part I

Research Article

Smart Image Interpretation Chatbot With Speech Synthesis And Image Generation

Download375 downloads

Cite: BibTeX Plain Text

@INPROCEEDINGS{10.4108/eai.28-4-2025.2357931,
    author={Roopa.  R and Thammisetty  Swetha and Thejaswini.  P and Sravani.  K and Sambasiva.  G and Vamsi.  A},
    title={Smart Image Interpretation Chatbot With Speech Synthesis And Image Generation},
    proceedings={Proceedings of the 4th International Conference on Information Technology, Civil Innovation, Science, and Management, ICITSM 2025, 28-29 April 2025, Tiruchengode, Tamil Nadu, India, Part I},
    publisher={EAI},
    proceedings_a={ICITSM PART I},
    year={2025},
    month={10},
    keywords={smart image chatbot computer vision deep learning natural language processing (nlp) conversational ai speech synthesis text-to-speech (tts) image recognition ai-generated images generative ai multimodal interaction human-ai interaction automated image analysis accessibility technology content creation},
    doi={10.4108/eai.28-4-2025.2357931}
}

Roopa. R
Thammisetty Swetha
Thejaswini. P
Sravani. K
Sambasiva. G
Vamsi. A
Year: 2025
Smart Image Interpretation Chatbot With Speech Synthesis And Image Generation
ICITSM PART I
EAI
DOI: 10.4108/eai.28-4-2025.2357931

Roopa. R¹^,*, Thammisetty Swetha¹, Thejaswini. P¹, Sravani. K¹, Sambasiva. G¹, Vamsi. A¹

1: Madanapalle Institute of Technology & Science

*Contact email: roopa509@gmail.com

Abstract

In the past couple of years, there has been an increasing focus on turning images into word descriptions. In this case, images are subject to analysis, described in words, and conveyed in voice. It helps those with disabilities gain access to media, give users a more immersive experience, and makes online content interesting. Smart Image Chatbot has specialized in making human engagement with images much better. Users can upload an image and ask a question about it, and the system will respond either verbally or in writing. This type of program can serve the blind users of this system because it helps them perceive visual information with the help of hearing. Another delightful component is its generation of new images coming from upload by the user. This function allows individuals to directly and manually edit or create images, no matter learning, content creation, online assistance: software gives life to image editing; it opens its implementation to everyone. This article includes all about the development of Smart Image Chatbot, the development methods used in it, and the problems faced therein. It will also explore possible extensions and how the technology helps make interaction over the Internet more engaging and accessible to all. The paper discusses the architecture, approaches, and deployment of the Smart Image Chatbot, including its technical infrastructure, limitations, and potential for future research.

Keywords: smart image chatbot, computer vision, deep learning, natural language processing (nlp), conversational ai, speech synthesis, text-to-speech (tts), image recognition, ai-generated images, generative ai, multimodal interaction, human-ai interaction, automated image analysis, accessibility technology, content creation

Published: 2025-10-13
Publisher: EAI

: http://dx.doi.org/10.4108/eai.28-4-2025.2357931

Smart Image Interpretation Chatbot With Speech Synthesis And Image Generation

Abstract

About EAI

Community

Publish with EAI