Explainability of Tuberculosis Diagnosis based on Chest X-Ray Images with Vision Transformer

Authors

  • Ramkumar Thirunavukarasu
  • Evans Kotei Kumasi Technical University
  • Thillainayagam

DOI:

https://doi.org/10.56042/jsir.v85i1.5865

Keywords:

Deep learning, Machine learning, Medical image analysis, Self-attention, Transformer network

Abstract

Chest X-ray radiography is a reasonably inexpensive and widely available diagnostic technology that can aid in identifying different illnesses like tuberculosis (TB), pneumonia, COVID-19 and many more. The demand for skilled personnel to evaluate X-ray radiographs is a challenge in many health facilities across the globe, particularly in underdeveloped regions. Machine Learning (ML) algorithms have enabled the automated diagnosis of TB from X-ray modalities. Aside from deep convolutional neural networks (DCNN) for vision applications, the Vision Transformer (ViT) network has also produced outstanding results in image classification. Motivated by the robustness of the transformer network on image processing tasks, the study proposes a transformer-based framework for early screening of TB disease. Three different vision transformer types ViT-Base16 (ViT-B16), ViT-Base32 (ViT-B32), and ViT-Large32(ViT-L32) were tested in the experiment to see how well they performed in identifying tuberculosis. When the transformer models' outcomes were contrasted with those of other CNNs, the VIT-B32 model performed admirably in the diagnostic procedure. The ViT-B32 model's attained accuracy, sensitivity, specificity, precision, F-1 score, and AUC scores of the ViT-B32 model were 96.96%, 96.89%, 97.01%, 96.72%, 96.80% and 0.97, respectively, on TB classification. The ViT-b32 model demonstrated superiority and generalizability. Because of its low cost and ease of use, the ViT-b32 model may provide an accurate diagnostic system to all TB patients for early screening.

 

Author Biography

  • Ramkumar Thirunavukarasu

    Vellore Institute of Technology, Vellore Tamil Nadu, India

Downloads

Published

14-05-2026

Issue

Section

Computer Sciences, Communication and Information Technology

How to Cite

Explainability of Tuberculosis Diagnosis based on Chest X-Ray Images with Vision Transformer. (2026). Journal of Scientific & Industrial Research (JSIR), 85(1), 36-43. https://doi.org/10.56042/jsir.v85i1.5865

Similar Articles

1-10 of 233

You may also start an advanced similarity search for this article.