Skip to content

MedViTV2: Medical Image Classification with KAN-Integrated Transformers and Dilated Neighborhood Attention

License

Notifications You must be signed in to change notification settings

Omid-Nejati/MedViTV2

Repository files navigation

Paper Open In Colab

MedViT

MedViT-V2: Medical Image Classification with KAN-Integrated Transformers and Dilated Neighborhood Attention

figure4

Train & Test --- Prepare data

To train or evaluate MedViT models on 17 medical datasets, follow this "Evaluation".

⚠️ Important: This code also supports training all TIMM models.

Visual Examples

You can find a tutorial for visualizing the Grad-CAM heatmap of MedViT in this repository "visualize".

📊 Performance Overview

Below is the performance summary of MedViT on various medical imaging datasets.
🔹 Model weights will be available soon.

Dataset Task Overall Accuracy (%)
PAD-UFES-20 Multi-Class (6) 63.6
ISIC2018 Multi-Class (7) 77.1
Fetal-Planes-DB Multi-Class (6) 95.3
CPN X-ray Multi-Class (3) 98.2
Kvasir Multi-Class (8) 82.8
ChestMNIST Multi-Class (14) 96.3
PathMNIST Multi-Class (9) 95.9
DermaMNIST Multi-Class (7) 78.1
OCTMNIST Multi-Class (4) 92.7
PneumoniaMNIST Multi-Class (2) 95.1
RetinaMNIST Multi-Class (5) 54.7
BreastMNIST Multi-Class (2) 88.2
BloodMNIST Multi-Class (8) 97.9
TissueMNIST Multi-Class (8) 69.9
OrganAMNIST Multi-Class (11) 95.8
OrganCMNIST Multi-Class (11) 93.5
OrganSMNIST Multi-Class (11) 82.4

License

MedViT is released under the MIT License.

💖🌸 If you find my GitHub repository useful, please consider giving it a star!🌟

References

Citation

@article{manzari2025medical,
  title={Medical Image Classification with KAN-Integrated Transformers and Dilated Neighborhood Attention},
  author={Manzari, Omid Nejati and Asgariandehkordi, Hojat and Koleilat, Taha and Xiao, Yiming and Rivaz, Hassan},
  journal={arXiv preprint arXiv:2502.13693},
  year={2025}
}

@article{manzari2023medvit,
  title={MedViT: a robust vision transformer for generalized medical image classification},
  author={Manzari, Omid Nejati and Ahmadabadi, Hamid and Kashiani, Hossein and Shokouhi, Shahriar B and Ayatollahi, Ahmad},
  journal={Computers in Biology and Medicine},
  volume={157},
  pages={106791},
  year={2023},
  publisher={Elsevier}
}

About

MedViTV2: Medical Image Classification with KAN-Integrated Transformers and Dilated Neighborhood Attention

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published