谷歌浏览器插件
订阅小程序
在清言上使用

Dual Wavelet Attention Networks for Image Classification.

IEEE transactions on circuits and systems for video technology(2023)

引用 7|浏览32
暂无评分
摘要
Global average pooling (GAP) plays an important role in traditional channel attention. However, there is the disadvantage of insufficient information to use the result of GAP as the channel scalar. At the same time, the existing spatial attention models focus on the areas of interest using average pooling or convolutional networks, but there is a loss of feature information and neglect of the structural feature. In this paper, dual wavelet attention is proposed, which can effectively alleviate the aforementioned problems and enhance the representation ability of CNNs. Firstly, the equivalence between the sum of the low-frequency subband coefficients of 2D DWT (Haar) and GAP is proved. On this basis, the statistical characteristics of low-frequency and high-frequency subbands are effectively combined to obtain the channel scalars, which can better measure the importance of each channel. In addition, 2D DWT can effectively capture the approximate and detailed structural features. Thus, wavelet spatial attention is proposed, which can effectively focus on the key spatial structural features. Different from traditional spatial attention, it can better curve the structural and spatial attention for different channels. The experiments are verified on four natural image data sets and three remote sensing scene classification data sets, which shows the effectiveness and versatility of the proposed methods. The code of this paper will be available at https://github.com/yutinyang/DWAN .
更多
查看译文
关键词
Wavelet transforms,Discrete wavelet transforms,Feature extraction,Discrete cosine transforms,Wavelet domain,Image coding,Visualization,Attention mechanism,2D DWT,dual wavelet attention,wavelet channel attention,wavelet spatial attention
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要