A novel Deeplabv3+ and vision-based transformer model for segmentation and classification of skin lesions

Iqra Ahmad,Javaria Amin, Muhammad IkramUllah Lali,Farhat Abbas,Muhammad Imran Sharif

Biomedical Signal Processing and Control(2024)

引用 0|浏览0
暂无评分
摘要
Skin cancer (SC) is a common disease caused due to ultraviolet radiation. Accurate SC detection is degraded due to some artifacts such as lesion variations in shape, size, color, texture, hairs, poor contrast, brightness, and irregular lesion boundaries. To solve these limitations, a deep learning-based technique is proposed that consists of segmentation and classification of SC. The DeepLabv3+ segmentation model is designed that consist of 9 convolutional neural network blocks. Each block comprises 19 convolution, 18 rectified linear units, and 18 batch normalization layers. The model is evaluated on ISIC-16, 17, 18, and PH2 datasets that provide accuracy of 98.90 %, 98.38 %, 99.45 %, and 100 %, respectively. Another Vision Transformer (ViT) model is developed for the classification of skin lesions (SL). The ViT model performs better than CNN because ViT works as a token while CNN works pixel to pixel. The ViT model consists of eight blocks, each with 17 normalization, 8 multi-head attention, 19 dense, and 19 dropout layers with a 7x7 patch size. The model is evaluated on PH2, ISIC-19, ISIC-20, and HAM10000 datasets that provided an accuracy of 100 %, 96.97 %, 97.73 %, and 100 % respectively. The results are better than existing methods.
更多
查看译文
关键词
Vision transformer,Patch,Deeplabv3+,Skin Lesions,Dermoscopy
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要