Open Access Open Access  Restricted Access Subscription Access

AUTOMATIC CAPTION GENERATION FOR CHEST X-RAY USING CNN ALGORITHM

Simaran Singh, Pallavi Pandey, Dr. Atul Kumar, Dr. Vibha Srivastava

Abstract


The automatic caption generation of chest X-ray report is a hot research topic at present. Image captioning aims to automatically describe the relationship of an image with a sentence, and this work has attracted research from both computer vision and natural language processing research communities. This research paper proposes a novel approach to automatically generating captions for medical images using Convolutional Neural Network (CNN) algorithm. The system was trained on a large dataset of medical images and their corresponding captions, and was evaluated using a variety of metrics including BLEU score and human evaluation. The results indicate that the proposed approach outperforms existing captioning systems in terms of caption accuracy and fluency. The proposed system has potential applications in the healthcare domain, where accurate and timely description of medical images can be critical for diagnosis and treatment.


Full Text:

PDF

References


O. Vinyals, A. Toshev, S. Bengio, and D. Erhan, “Show and tell: A neural image caption generator,” in IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2015, 2015. [Online].Available: https://doi.org/10.1109/CVPR.2015.7298935

K. Xu, J. Ba, R. Kiros, K. Cho, A. C. Courville, R. Salakhutdinov, R. S. Zemel, and Y. Bengio, “Show, attend and tell: Neural image caption generation with visual attention,” in Proceedings of the 32nd International Conference on Machine Learning, ICML 2015, 2015. [Online]. Available: http://jmlr.org/proceedings/papers/v37/xuc15.html

A. Aker and R. Gaizauskas. Generating image descriptions using dependency relational patterns. In ACL, 2010.

A. Farhadi, M. Hejrati, M. A. Sadeghi, P. Young, C. Rashtchian, J. Hockenmaier, and D. Forsyth. Every pic-ture tells a story: Generating sentences from images. In ECCV, 2010

Changchang Yin∗, Buyue Qian†, Jishang Wei‡, Xiaoyu Li∗, Xianli Zhang∗, Yang Li∗, Qinghua Zheng.” Automatic Generation of Medical Imaging Diagnostic Report with Hierarchical Recurrent Neural Network” in IEEE Conference on Data Mining(ICDM), 2019.

Hoo-Chang Shin1, Kirk Roberts2, Le Lu1, Dina Demner-Fushman 2, Jianhua Yao 1, Ronald M Summers 1,” Learning to Read Chest X-Rays: Recurrent Neural Cascade Model for Automated Image Annotation” in IEEE Conference on Computer Vision and Pattern Recognition, 2016.

Xin Huang, Biao Zhong, Yuanlong Cao, Yugen Yi “Chest X-Ray Lung Chinese Discription Generation based on Semantic Labels and Hierarchical LSTM” in IEEE International Conference on Bioinformatics and Biomedicine(BIBM), 2020.

U. Avni, H. Greenspan, E. Konen, M. Sharon, and J. Goldberger. X-ray categorization and retrieval on the organ and pathology level, using patch-based visual words. Medical Imaging, IEEE Transactions on, 2011.

S. Bird, E. Klein, and E. Loper. Natural language processing with Python. ” O’Reilly Media, Inc.”, 2009

C. Eickhoff, I. Schwall, A. G. S. de Herrera, and H. Muller, ¨ “Overview of imageclefcaption 2017 - image caption prediction and concept detection for biomedical images,” in Working Notes of CLEF 2017 - Conference and Labs of the Evaluation Forum, Dublin, Ireland, September 11-14, 2017., 2017. [Online]. Available: http://ceur-ws.org/Vol-1866/invited paper 7.pdf


Refbacks

  • There are currently no refbacks.