A Comparative Analysis of Vision Transformers and Convolutional Neural Networks in Cardiac Image Segmentation.

Sebástion Granizo,Maria G. Baldeon Calisto, Milena Iñiguez,Danny Navarrete,Daniel Riofrío, Noel Pérez-Pérez,Diego S. Benítez, Ricardo Flores Moyano

International Symposium on Digital Forensics and Security（2024）

引用 0|浏览0

暂无评分

摘要

In recent years, Convolutional Neural Networks (CNNs) and Vision Transformers (ViTs) have emerged as dominant automated cardiac image segmentation methods. CNNs are efficient architectures that capture local spatial patterns, whereas ViTs can model long-range global dependencies. Each network has been shown to provide better performance on certain types of tasks and datasets. In this work, we conducted a comparative analysis between ViTs and CNNs in the context of cardiac image segmentation. We statistically evaluated the performance of five CNNs and ViTs architectures using the publicly available Automated Cardiac Diagnosis Challenge (ACDC) MRI dataset. Employing a one-way ANOVA and Tukey is test, our analysis indicates that CNNs exhibit superior performance compared to Transformers in segmenting the right ventricle cavity, the left ventricle cavity, and the left ventricle myocardium. Furthermore, CNN architectures tend to be smaller and easier to train. Among all the networks considered, LinkN et achieves the highest performance with a mean dice of 0.8965 and a mean ASSD of 0.2960.

查看译文

关键词

Cardiac MRI Segmentation,Convolutional Neural Networks (CNNs),Image Segmentation,Transformers,Vision Transformers (ViT)

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要