SmartShield: A Hybrid Deep Learning-Based Chrome Extension for Real-Time Detection of Spam and Toxic Comments on YouTube
Abstract
The rapid proliferation of spam and toxic comments on video-sharing platforms like YouTube has become a significant threat to user safety and digital well-being. This paper proposes ”SmartShield,” a robust hybrid deep learning-based Chrome extension designed for real-time content moderation. The system integrates state-of-the-art architectures, including RoBERTa for deep contextual analysis and a hybrid CNN-LSTM network to capture sequential patterns and linguistic obfuscations. To ensure high reliability, an ensemble layer utilizing SVM and XGBoost is employed for final classification. Beyond mere detection, the framework incorporates Explainable AI (XAI) through SHAP and LIME, providing users with transparent insights into why specific comments were flagged. The extension operates seam- lessly at the browser level, utilizing a Flask-based backend and the YouTube Data API v3 for immediate intervention. Experimental evaluations demonstrate that this hybrid approach significantly outperforms standalone models, achieving superior accuracy and a reduced false-positive rate. This research provides a scalable solution to online harassment, empowering users with a proactive defense mechanism against evolving cyber threats The proposed hybrid model achieved an overall accuracy of 94.9%, with a precision of 95.2%, recall of 94.6%, and F1-score of 94.8%. Experimental evaluation demonstrates that the ensemble approach outperforms individual baseline models such as SVM, CNN-LSTM, and RoBERTa when used independently.
References
H. Valpadasu, P. Chakri, P. Harshitha, and P. Tarun, “Machine Learning Based Spam Comments Detection on YouTube,” in Proc. IEEE 7th Int. Conf. Computer Communication and Informatics (ICCCI), 2023, pp. 1– 6.
A. S. Xiao and Q. Liang, “Spam detection for YouTube video comments using machine learning approaches,” Machine Learning with Applica- tions, vol. 16, 2024, Art. no. 100550.
S. Douzi, F. A. AlShahwan, M. Lemoudden, and B. El Ouahidi, “Hybrid email spam detection model using artificial intelligence,” International Journal of Machine Learning and Computing, vol. 10, no. 2, pp. 245– 252, 2020.
P. Nagaraj, K. M. Sudar, P. Thrived, P. G. K. Reddy, S. B. Babu, and
P. S. R. Krishna, “YouTube Comment Spam Detection,” in Proc. IEEE Int. Conf. Computer Communication and Informatics (ICCCI), 2023, pp. 1–6.
V. Niranjani, Y. Agalya, K. Charunandhini, K. Gayathri, and R. Gayathri, “Spam detection for social media networks using machine learning,” in Proc. IEEE 8th Int. Conf. Advanced Computing and Communication Systems (ICACCS), 2022, pp. 2082–2088.
R. Abinaya and P. Naveen, “Spam detection on social media platforms,” in Proc. IEEE 7th Int. Conf. Smart Structures and Systems (ICSSS), 2020, pp. 1–3.
K. Danilchenko, M. Segal, and D. Vilenchik, “Opinion spam detection: A new approach using machine learning and network-based algorithms,” in Proc. Int. AAAI Conf. Web and Social Media, vol. 16, pp. 125–134, 2022.
M. Mashaleh et al., “Detecting Spam Email with Machine Learning Optimized with Harris Hawks Optimizer (HHO),” Procedia Computer Science, vol. 201, pp. 659–664, 2022.
U. K. Sah and N. Parmar, “An approach for malicious email spam de- tection using comparison of different classifiers,” International Research Journal of Engineering and Technology (IRJET), vol. 4, no. 8, pp. 2238– 2242, 2017.
Madhur, “5000 YouTube Spam/Not Spam Dataset,” Kaggle, 2022. [Online]. Available: https://www.kaggle.com/datasets/madhuragl/5000- youtube-spamnot-spam-dataset. Accessed: Jan. 2026.
Refbacks
- There are currently no refbacks.