Open Access Open Access  Restricted Access Subscription Access

VisuBot: A Real-Time Object Describer for Visually Impaired Users Using YOLOv8

Jay Kiran Patil, Subodh Shivaji Wadekar, Rajkumar Lakshman Kale, Uday Rakesh Godari, Mr. D. B. Ghorpade

Abstract


Visual impairment significantly restricts independent mobility, environmental awareness, and social participation, particularly in developing countries such as India where access to assistive technologies remains limited. Traditional navigation aids like white canes provide only basic obstacle detection and lack contextual intelligence. Recent advances in artificial intelligence, deep learning, and computer vision have enabled the development of intelligent assistive systems capable of interpreting visual scenes in real time.

This paper presents VisuBot, a real-time object describer designed specifically for visually impaired users. The proposed system employs the YOLOv8 deep learning model trained on the COCO dataset to detect navigation-relevant objects in real time. Detected objects are analyzed to estimate direction using a clock-based spatial mapping technique and distance using bounding box area approximation. The processed information is converted into natural language audio feedback using an offline text-to- speech engine.


Full Text:

PDF

References


J. Redmon and A. Farhadi, "YOLOv3: An Incremental Improvement," arXiv preprint arXiv:1804.02767, 2018.

A. Sharma and P. Kumar, "Vision Aid for Blind People Using YOLOv8 and Text-to-Speech Conversion," ResearchGate, 2023.

R. Gupta and S. Singh, "YOLOInsight: AI-Powered Assistive Device for Visually Impaired Individuals," Cureus Journal of Medical Science, vol. 15, no. 8, 2023.

J. Lee and H. Park, "Performance Evaluation of YOLOv8 for Real-Time Object Detection in Assistive Navigation," Procedia Computer Science, vol. 220, pp. 1234-1241, 2025.

V. Kumar and M. Joshi, "Smart Assistive Navigation Stick using YOLOv8 Object Detection," ResearchGate, 2023.

T. Ahmed and F. Rahman, "Spatial Awareness Techniques for Visually Impaired Navigation Systems," Zenodo, 2023.

T.-Y. Lin et al., "Microsoft COCO: Common Objects in Context," in Proc. European Conference on Computer Vision (ECCV), 2014, pp. 740-755.

World Health Organization, "Blindness and Vision Impairment," WHO Fact Sheet, 2023.

G. Jocher et al., "Ultralytics YOLOv8," GitHub repository, https://github.com/ultralytics/ultralytics, 2023.

S. Ren, K. He, R. Girshick, and J. Sun, "Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 39, no. 6, pp. 1137-1149, 2017.


Refbacks

  • There are currently no refbacks.