Exploring Generative AI for Voice-Driven Interfaces: A Literature Review
Abstract
Voice-driven interfaces are evolving rapidly due to continuous advancements in generative artificial intelligence. The paper examines that how generative AI technologies work together with voice user interfaces and studies the relationship between them. It discusses the existing research along with the underlying technical frameworks while addressing usability aspects, ethical concerns, and possible future developments. Then the review together brings insights from recent studies to emphasize the key challenges as well as the new opportunities that can guide designers and researchers working in this area.
References
C. Zhang, J. He, M. Wang, and Y. Zhang, “A Survey on Audio Diffusion Models: Text-to-Speech Synthesis and Enhancement in Generative AI,” arXiv preprint arXiv:2307.04670, 2023.
J. Bieniek, M. Rahouti, and D. C. Verma, “Generative AI in Multimodal User Interfaces: Trends, Challenges, and Cross-Platform Adaptability,” arXiv preprint arXiv:2401.02043, 2024.
H. Papneja and N. Yadav, “Self-Disclosure to Conversational AI: A Literature Review, Emergent Framework, and Directions for Future Research,” Computers in Human Behavior Reports, vol. 12, pp. 101–119, 2024.
M. Nayak, J. Kangas, and R. Raisamo, “A Study of NLP-Based Speech Interfaces in Medical Virtual Reality,” Multimodal Technologies and Interaction, vol. 9, no. 1, pp. 18–29, 2025.
G. G. Genelza, “A Systematic Literature Review on AI Voice Cloning Generator: A Game-Changer or a Threat?,” Journal of Emerging Technologies, vol. 14, no. 2, pp. 53–70, 2024.
Y. Zhang, H. Lin, and L. Kang, “AI Versus Human-Generated Voices and Avatars: Rethinking User Engagement and Cognitive Load,” Education and Information Technologies, vol. 30, pp. 223–242, 2025.
C. A. C. Urzúa, A. Martinez, and D. Perez, “Effects of AI-Assisted Feedback via Generative Chat on Academic Writing: A Systematic Review,” Education Sciences, vol. 15, no. 4, pp. 1–21, 2025.
R. Alabduljabbar, “User-Centric AI: Evaluating the Usability of Generative AI Applications Through User Reviews on App Stores,” PeerJ Computer Science, vol. 11, pp. 1–16, 2024.
A. dos Santos, R. V. Souza, and M. P. Silva, “An Automated Literature Survey of Data-Driven Speech Enhancement Methods,” Acta Acustica, vol. 110, no. 1, pp. 88–102, 2024.
R. Mundra, J. Kohli, and P. Bhatt, “Value to User’s Voice: A Generative AI Framework for Actionable Insights from Customer Reviews,” Proceedings of ICON 2024, pp. 351–359, 2024.
Refbacks
- There are currently no refbacks.