Research Article

MedXGen: LLM leveraged Framework for Automated Clinical Coherent Medical Report Generation

by  A.N. Ramya Shree, Nithya N., Lavanya Kamaraju, Sahana M.B., Hrithwika, Supriya
journal cover
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 187 - Issue 43
Published: September 2025
Authors: A.N. Ramya Shree, Nithya N., Lavanya Kamaraju, Sahana M.B., Hrithwika, Supriya
10.5120/ijca2025925746
PDF

A.N. Ramya Shree, Nithya N., Lavanya Kamaraju, Sahana M.B., Hrithwika, Supriya . MedXGen: LLM leveraged Framework for Automated Clinical Coherent Medical Report Generation. International Journal of Computer Applications. 187, 43 (September 2025), 23-28. DOI=10.5120/ijca2025925746

                        @article{ 10.5120/ijca2025925746,
                        author  = { A.N. Ramya Shree,Nithya N.,Lavanya Kamaraju,Sahana M.B.,Hrithwika,Supriya },
                        title   = { MedXGen:  LLM leveraged Framework for Automated Clinical Coherent Medical Report Generation },
                        journal = { International Journal of Computer Applications },
                        year    = { 2025 },
                        volume  = { 187 },
                        number  = { 43 },
                        pages   = { 23-28 },
                        doi     = { 10.5120/ijca2025925746 },
                        publisher = { Foundation of Computer Science (FCS), NY, USA }
                        }
                        %0 Journal Article
                        %D 2025
                        %A A.N. Ramya Shree
                        %A Nithya N.
                        %A Lavanya Kamaraju
                        %A Sahana M.B.
                        %A Hrithwika
                        %A Supriya
                        %T MedXGen:  LLM leveraged Framework for Automated Clinical Coherent Medical Report Generation%T 
                        %J International Journal of Computer Applications
                        %V 187
                        %N 43
                        %P 23-28
                        %R 10.5120/ijca2025925746
                        %I Foundation of Computer Science (FCS), NY, USA
Abstract

Artificial intelligence-based automatic medical report creation has accelerated significantly since the introduction of cross-modal learning, sophisticated transformer structures, and knowledge-enhanced pretraining methods. The detector attention modules, adapter tuned vision language models, and graph-guided hybrid strategies are integrated to propose framework for automated medical report generation. Utilizing topic wise separable retrieval, hierarchical cross-modal alignment, and phrase-level augmentation, the proposed MedXGen confront semantic inconsistency, hallucination and redundancy. Memory-guided transformers and semi- supervised learning is used to enhance interpretability and adaptability. The suggested framework offers a practical implementation of clinical diagnostic support systems and it supports medical language creation and visual perception.

References
  • Y. Gu, J. Sun, X. Wang, R. Li and Z. Zhou, "Automatic Medical Report Generation Based on Detector Attention Module and GPT-Based Word LSTM," 2024 6th International Conference on Data-driven Optimization of Complex Systems (DOCS), Hangzhou, China, 2024,pp. 7-15, doi: 10.1109/DOCS63458.2024.10704337.
  • P. Kumar, F. Y. Alghayadh, V. P. Tamta, D. M. Anand, M. Soni and J.K. Chaudhary, "Artificial Intelligence based Healthcare Knowledge Structure for Automatic Medical Report Generation," 2024 7th International Conference on Contemporary Computing and Informatics (IC3I), Greater Noida, India, 2024, pp. 1569-1575, doi: 10.1109/IC3I61595.2024.10828853.
  • M. Makram and A. Mohammcd, "AI Applications in Medical Reporting and Diagnosis," 2024 International Mobile, Intelligent, and Ubiquitous Computing Conference (MIUCC), Cairo, Egypt, 2024, pp. 185-192, doi: 10.1109/MIUCC62295.2024.10783552.
  • X. Mei, L. Yang, D. Gao, X. Cai, J. Han and T. Liu, "Adaptive Medical Topic Learning for Enhanced Fine-Grained Cross-Modal Alignment in Medical Report Generation," in IEEE Transactions on Multimedia, doi: 10.1109/TMM.2025.3543101.
  • Wu, Bang Yang, Zhiyu Ye, Haoqian Wang, Hairong Zheng, and Tong Zhang, "MAKEN: Improving Medical Report eration with Adapter Tuning and Knowledge Enhancement in Vision Language Foundation Models," in Proc. IEEE International Symposium on Biomedical Imaging (ISBI), 2024, pp. 1–5. DOI: 10.1109/ISBI56570.2024.10635421.
  • Zhizhe Liu, Zhenfeng Zhu, Shuai Zheng, Yawei Zhao, Kunlun He, and Yao Zhao, "From Observation to Concept: A Flexible Multi-View Paradigm for Medical Report Generation," IEEE Transactions on Multimedia, vol. 26, pp. 5987–5995, 2024. DOI:10.1109/TMM.2023.3342691
  • T. Yu, W. Lu, Y. Yang, W. Han, Q.Huang, J. Yu, and K. Zhang, "Adapter-Enhanced Hierarchical Cross Modal Pre-training for Lightweight Medical Report Generation," IEEE Transactions on Neural Networks and Learning Systems, vol. 36, no. 2, pp. 2206-2220, Feb. 2025, doi: 10.1109/TNNLS.2023.3343391.
  • J. Zhao, Y. Zhou, Z. Chen, H. Fu, and L. Wan, "Topicwise Separable Sentence Retrieval for Medical Report Generation," IEEE Transactions on Medical Imaging, vol. 44, no. 3, pp. 1505–1514, Mar. 2025. DOI: 10.1109/TMI.2024.3507076.
  • L. Xu, Q. Tang, B. Zheng, J. Lv, W. Li and X. Zeng, "CGFTrans: Cross-Modal Global Feature Fusion Transformer for Medical Report Generation," in IEEE Journal of Biomedical and Health Informatics, vol. 28, no. 9, pp. 5600-5612, Sept. 2024, doi: 10.1109/JBHI.2024.3414413.
  • F. Alqahtani, M. M. Mohsan, K. Alshamrani, J. Zeb, S. Alhamami and D. Alqarni, "CNX-B2: A Novel CNN-Transformer Approach For Chest X-Ray Medical Report Generation," in IEEE Access, vol. 12, pp. 26626-26635, 2024, doi: 10.1109/ACCESS.2024.3367360.
  • Y. Tang, Y. Yuan, F. Tao and M. Tang, "Cross-Modal Augmented Transformer for Automated Medical Report Generation," in IEEE Journal of Translational Engineering in Health and Medicine, vol. 13,pp. 33-48, 2025, doi: 10.1109/JTEHM.2025.3536441.
  • Y. Wang, Y. Du and S. Tang, "Enhancement of medical report generation by multi-scale image deblurring strategy," 2024 International Joint Conference on Neural Networks (IJCNN), Yokohama, Japan, 2024, pp. 1-8,doi: 10.1109/IJCNN60899.2024.10650006.
  • J. Zhang, M. Cheng, X. Li, X. Shen, Y. Liu and Y. Wan, "Generating Medical Report via Joint Probability Graph Reasoning," in Tsinghua Science and Technology, vol. 30, no4, pp. 1685-1699, August 2025, doi: 10.26599/TST.2024.9010058.
  • Z. Xu, W. Xu, R. Wang, J. Chen, C. Qi and T. Lukasiewicz, "Hybrid Reinforced Medical Report Generation With M-Linear Attention and Repetition Penalty," in IEEE Transactions on Neural Networks and Learning Systems, vol. 36, no. 2, pp. 2206-2220, Feb. 2025, doi: 10.1109/TNNLS.2023.3343391.
  • P. Divya, Y. Sravani, C. Vishnu, C. K. Mohan and Y. W. Chen, "Memory Guided Transformer With Spatio-Semantic Visual Extractor for Medical Report Generation," in IEEE Journal of Biomedical and Health Informatics, vol. 28, no. 5, pp. 3079-3089, May 2024, doi: 10.1109/JBHI.2024.3371894.
  • X. Mei, L. Yang, D. Gao, X. Cai, J. Han and T. Liu, "PhraseAug: An Augmented Medical Report Generation Model With Phrasebook," in IEEE Transactions on Medical Imaging, vol. 43, no. 12, pp. 4211-4223, Dec. 2024, doi: 10.1109/TMI.2024.3416190.
  • K. Zhang et al., "Semi-Supervised Medical Report Generation via Graph-Guided Hybrid Feature Consistency," in IEEE Transactions on Multimedia, vol. 26, pp. 904-915, 2024, doi: 10.1109/TMM.2023.3273390.
  • Y. Jiang et al., "CoMT: Chain-of-Medical-Thought Reduces Hallucination in Medical Report Generation," ICASSP 2025 - 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Hyderabad, India, 2025, pp. 1-5, doi: 10.1109/ICASSP49660.2025.10887699.
  • R. Li, Z. Wang and L. Zhang, "Image Caption and Medical Report Generation Based on Deep Learning: a Review and Algorithm Analysis," 2021 International Conference on Computer Information Science and Artificial Intelligence (CISAI), Kunming, China, 2021, pp. 373-379, doi: 10.1109/CISAI54367.2021.00078.
  • Y. Zhou, L. Huang, T. Zhou, H. Fu and L. Shao, "Visual-Textual Attentive Semantic Consistency for Medical Report Generation," 2021 IEEE/CVF International Conference on Computer Vision (ICCV), Montreal, QC, Canada, 2021, pp. 3965-3974, doi: 10.1109/ICCV48922.2021.00395.
  • A. N. R. Shree and P. Kiran, "Sensitivity Context Aware Privacy Preserving Text Document Summarization," 2020 4th International Conference on Electronics, Communication and Aerospace Technology (ICECA), Coimbatore, India, 2020, pp. 1517-1523, doi: 10.1109/ICECA49313.2020.9297415.
  • A. N. R. Shree and K. P, “Privacy Preserving Text Document Summarization,” Journal of Engineering Research and Sciences, vol. 1, no. 7, pp. 7–14, Jul. 2022, doi: 10.55708/js0107002.
  • Wang, Mengqian, Manhua Wang, Fei Yu, Yue Yang, Jennifer Walker, and Javed Mostafa. "A systematic review of automatic text summarization for biomedical literature and EHRs." Journal of the American Medical Informatics Association 28, no. 10, pp 2287-2297,2021.
Index Terms
Computer Science
Information Sciences
No index terms available.
Keywords

Medical Report Generation Vision-Language Models Cross-Modal Learning Transformer Deep Learning Clinical Decision Support AI in Healthcare.

Powered by PhDFocusTM