Bahmane, Karima and Aksasse, Hamid and Alkhalil Chaouki, Brahim (2025) A Case Study of DenseNet, ResNet and Vision Transformers for Thyroid Nodule Analysis in Medical Imaging. International Journal of Innovative Science and Research Technology, 10 (3): 25mar1818. pp. 3197-3205. ISSN 2456-2165

[thumbnail of IJISRT25MAR1818.pdf] Text
IJISRT25MAR1818.pdf - Published Version

Download (882kB)

Abstract

In order to classify thyroid nodules using ultrasound imaging [1], this study assesses the effectiveness of three deep learning models: Vision Transformer (ViT), DenseNet, and ResNet. Seven thousand thyroid ultrasound pictures from Morocco's Hassan II Hospital (2005–2022) were utilized as the dataset. Accuracy, F1-score, sensitivity, and specificity were important parameters. DenseNet did somewhat better with 89.3% accuracy and F1-score than ResNet, which had 87.7% accuracy and an 87.8% F1-score. ViT outperformed both, achieving 91.5% accuracy and a 91.4% F1-score, demonstrating superior global context capture. ResNet excels in gradient flow optimization, DenseNet in feature propagation for smaller datasets, and ViT in versatility but requires larger datasets. The study highlights trade-offs between transformer-based and CNN-based architectures, emphasizing the importance of dataset characteristics and task requirements for optimal diagnostic outcomes in medical imaging.

Item Type: Article
Subjects: R Medicine > R Medicine (General)
Divisions: Faculty of Medicine, Health and Life Sciences > School of Medicine
Depositing User: Editor IJISRT Publication
Date Deposited: 22 Apr 2025 11:41
Last Modified: 22 Apr 2025 11:41
URI: https://eprint.ijisrt.org/id/eprint/536

Actions (login required)

View Item
View Item