How many non-linear computations are required for CNNs to account for the response properties of the primary visual cortex (V1)?

Journal of Vision(2022)

引用 0|浏览0
暂无评分
摘要
While the primary visual cortex (V1) is arguably the best understood visual area, we still don’t fully understand its computational mechanisms. Traditional models propose that V1 neurons behave like Gabor filters, exhibiting selectivity for orientation and spatial frequency, followed by additional non-linear processes such as half-wave rectification, suppression, and/or divisive normalization. Convolutional neural networks (CNNs), which can simulate complex tuning functions, provide an alternate way to fit V1 data without relying on hand-designed filters. A recent study by Cadena et al. (2019) used the layer-wise activity of VGG-19 to predict V1 neuronal responses of monkeys that viewed thousands of natural and synthesized images. Surprisingly, the best V1 predictions were not obtained in the lowest layers, but rather, after multiple convolutional and max-pooling operations, leading the authors to conclude that V1 relies on far more non-linear computations than previously thought. However, a potential concern is that the lower layers of VGG-19 have small convolutional filters, whereas the images used to evaluate VGG-19 performance were comparatively large. Thus, we suspected that the poor performance of the lower layers of VGG-19 may have been driven by input size. To address this issue, we evaluated the performance of AlexNet, which has much larger receptive fields in its lower layers. In contrast to VGG-19, we found that the first convolutional layer of AlexNet best predicted V1 responses. A control analysis revealed that the best-performing layer of VGG-19 shifted systematically to lower layers after the input images were rescaled to a smaller size. We further showed that a modified version of AlexNet could match the predictive performance of VGG-19 after just a few non-linear computations. Overall, our findings demonstrate that the response properties of V1 neurons can be well explained by relatively few non-linear computations while confirming that CNNs outperform traditional V1 Gabor filter models.
更多
查看译文
关键词
primary visual cortex,cnns,response properties,non-linear
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要