Progressive Downsampling Transformer With Convolution-Based Decoder and Its Application in Gear Pitting Measurement.

IEEE Trans. Instrum. Meas.(2023)

引用 2|浏览5
暂无评分
摘要
In transformer for semantic segmentation, patch embedding usually has only one convolutional layer with a large stride, leading to the decrease of feature extraction capability. In addition, the complex decoder results in high computation cost. To address the abovementioned two issues, we put forward a progressive downsampling transformer with convolution-based decoder (PDCDT), which is a simple, efficient yet powerful framework. Specifically, progressive downsampling layers for patch embedding are designed to refine the extracted features and reduce information loss at each stage of the hierarchical transformer encoder. Meanwhile, a simple decoder based on a convolution (conv) module is proposed for aggregating the characteristic information from multiscale (MS) output layers of the encoder, and it can realize dimensional transformation and information interaction with fewer parameters than the decoders used in the existing transformers. Extensive experiments show that PDCDT achieves competitive results on ADE20K mean intersection over union (47.9% mIoU) and cityscapes (82.6% mIoU). Finally, PDCDT is applied to gear pitting measurement in the gear contact fatigue test, and the comparative results indicate that PDCDT can improve the accuracy of pitting detection.
更多
查看译文
关键词
Features refined,neural network,pitting measurement,semantic segmentation transformer
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要