Bahmane, Karima and Aksasse, Hamid and Alkhalil Chaouki, Brahim (2025) A Case Study of DenseNet, ResNet and Vision Transformers for Thyroid Nodule Analysis in Medical Imaging. International Journal of Innovative Science and Research Technology, 10 (3): 25mar1818. pp. 3197-3205. ISSN 2456-2165
![IJISRT25MAR1818.pdf [thumbnail of IJISRT25MAR1818.pdf]](https://eprint.ijisrt.org/style/images/fileicons/text.png)
IJISRT25MAR1818.pdf - Published Version
Download (882kB)
Abstract
In order to classify thyroid nodules using ultrasound imaging [1], this study assesses the effectiveness of three deep learning models: Vision Transformer (ViT), DenseNet, and ResNet. Seven thousand thyroid ultrasound pictures from Morocco's Hassan II Hospital (2005–2022) were utilized as the dataset. Accuracy, F1-score, sensitivity, and specificity were important parameters. DenseNet did somewhat better with 89.3% accuracy and F1-score than ResNet, which had 87.7% accuracy and an 87.8% F1-score. ViT outperformed both, achieving 91.5% accuracy and a 91.4% F1-score, demonstrating superior global context capture. ResNet excels in gradient flow optimization, DenseNet in feature propagation for smaller datasets, and ViT in versatility but requires larger datasets. The study highlights trade-offs between transformer-based and CNN-based architectures, emphasizing the importance of dataset characteristics and task requirements for optimal diagnostic outcomes in medical imaging.
Item Type: | Article |
---|---|
Subjects: | R Medicine > R Medicine (General) |
Divisions: | Faculty of Medicine, Health and Life Sciences > School of Medicine |
Depositing User: | Editor IJISRT Publication |
Date Deposited: | 22 Apr 2025 11:41 |
Last Modified: | 22 Apr 2025 11:41 |
URI: | https://eprint.ijisrt.org/id/eprint/536 |