
A Multi-Scale Two-Branch Fusion Network for Simultaneous Segmentation in Electronic Laryngoscope Images

Digital signal processing(2023)

引用 0|浏览14
Three issues reduced the performance of networks for handling the organs and lesions' simultaneous segmentation in electronic laryngoscopy images. Firstly, the moving endoscope will cause noticeable variations of the shape and angle in lesions and organs. Secondly, the lesions, mainly the polyps, and the major organs differ considerably in size. Moreover, the boundaries between the lesions or organs and their backgrounds are usually indistinguishable since their color and texture are very close to the mucosal tissues. To improve the simultaneous segmentation accuracy, we propose a multi-scale two-branch fusion network (MsFusionNet), which adopted an asymmetric two-branch structure to fuse the fine-grained feature maps extracted by the convolution neural network with the global context feature maps extracted by the Vision Transformer at different scales. In addition, a Multi-scale Dark Part Feature Enhancement module (MsDFE) was designed to enhance the non-salient details of organs before the feature fusion in the two-branch network. To evaluate the universality and effectiveness of the proposed method, we used a mixed dataset collected from three institutions, including 2425 electronic laryngoscope images with major organs in the pharynx and larynx. The results show the proposed method performs better than nine existing segmentation networks in dealing with the experiment dataset, which has good potential for clinical practice.
Electronic laryngoscope,Simultaneous segmentation,Two-branch network,Multi-scale features fusion,Dark part feature enhancement
AI 理解论文
Chat Paper