Small object detection network for monitoring mass wasting activity in the Martian North polar region

Oleksii Martynchuk,Lida Fanara,Juergen Oberst

crossref(2022)

引用 0|浏览2
暂无评分
摘要
<p>The north polar region of Mars is one of the most active places of the planet with avalanches<sup>[1] </sup>and ice block falls<sup>[2]</sup> being observed every year on High Resolution Imaging Science Experiment (HiRISE) data. Both phenomena originate at the steep icy scarps surrounding the north polar plateau, Planum Boreum. The exposed layers of ice and dust contain important information about the climate cycles of the planet<sup>[3]</sup>. We are interested in monitoring how the current scarp erosion rate (quantified through analyzing blockfalls) is linked to these periodic changes in temperature. The large scale of the region of interest, combined with a growing amount of available satellite data makes automation key for this project. Our goal is to robustly monitor the whole north polar region for ice block falls throughout the entire Mars ReconnaissanceOrbiter (MRO) mission.&#160;</p> <p>&#160;</p> <p>Deep Learning networks are widely used for object detection problems, the YOLO family being particularly popular among researchers and commercial users. Although the baseline performance of its most recent iteration (v5) is satisfactory for most generic applications, HiRISE images of NPLD (North Polar Layered Deposits) present a greater challenge. Large boulders are usually easy to identify due to their well differentiated visual features, e.g. longer shadows and edges, greater surface area and less crowded backgrounds. As the blocks get smaller features&#160; become less pronounced and margins between different object instances shrink beyond the default resolution of&#160; YOLOv5 thus noticeably hindering the network&#8217;s overall efficiency. To boost the detection rate we propose the following modifications to the currently available build<em> (fig1)</em>:</p> <p>&#160;</p> <ul> <li aria-level="1"><strong>Super Resolution module</strong><sup>[4]</sup> - before the images are fed into the network backbone they are stretched out to 640x640p using simple bicubic interpolation. An alternative approach utilizes DASR (domain-distance aware super-resolution)&#160; as a preprocessing step. This super-resolution model can be trained on analogous Earth data (e.g. Svalbard) and then deployed to intelligently upscale the HiRISE dataset, adding an extra layer of textures and sharpening the images.&#160;</li> </ul> <p>&#160;</p> <ul> <li aria-level="1"><strong>Better anchor grid</strong><sup>[5]</sup> - anchor grid parameters should be based on the size of the objects we expect to observe in our dataset. The closer the initial guess, the less adjustments are subsequently&#160; performed by the network. We suggest that automatically deriving the anchors prior to training is extremely beneficial to the overall performance.&#160;</li> </ul> <p>&#160;</p> <ul> <li aria-level="1"><strong>Additional Fusion Layer and Residual Connections</strong><sup>[6]</sup> - baseline version of&#160; YOLOv5 starts implementing residual connections only at P3 level, ignoring the higher resolution P2 features. To reduce the loss of the feature information of small objects we incorporate the P2 layer into the rest of the network by stacking it on top of the basis formed by the original three layers, i.e. {P3, P4, P5}.&#160;</li> </ul> <p>&#160;</p> <ul> <li aria-level="1"><strong>Attention Module</strong><sup>[6] </sup>- this enhancement helps the network learn the spatial correlation between features much like the human visual mechanism, which enables the eye to focus on more important feature regions and suppresses irrelevant regions. There are different options available, yet we use a combination of ECA-Net and CBAM contributing channel attention and spatial attention mechanisms respectively.</li> </ul> <p>&#160;</p> <ul> <li aria-level="1"><strong>Position of attention modules within the network</strong><sup>[6]</sup><strong> </strong>- the insertion point of the aforementioned modules is not location invariant. We have chosen to incorporate it into the fusion layers right after every CSP (i.e.C3) block.&#160;</li> </ul> <p><img src="" alt="" /></p> <p><em>Figure 1.</em> A<em> diagram of the improved YOLOv5 architechture. The input tesnors are already upscaled by DASR and presprocessed to derive the best posible anchor grid parameters. C3 (CSP) -&#160; cross-stage partial network; SPPF - spatial pyramid pooling fast; ECBAM - ECANet + CBAM attention modules.&#160;</em></p> <p>The boost in performance is most noticeable in ice boulder dense areas, where detection is highly dependent on resolving separate compactly stacked objects. The exact figure is yet to be confirmed experimentally but preliminary testing shows at least a 3% improvement. We hypothesize that integrating the DASR module with the main training routine instead of performing upscaling as a separate preprocessing step might boost the performance even further. The NPLD remains our primary area of interest due to its high levels of activity and good satellite image density, yet we also plan to apply our pipeline to different surface changes and Martian regions as well as other celestial objects.</p> <p><br /><br /></p> <ul> <li aria-level="1">Russell P., Thomas, N. Byrne, S. Herkenhoff, K.Fishbaugh, K. Bridges, N. Okubo, C. Milazzo, M. Daubar, I. Hansen. Seasonally active frost-dust avalanches on a north polar scarp of Mars captured by HiRISE, Geophys. Res. Lett. 35, L23204 (2008).</li> <li aria-level="1">L. Fanara, K. Gwinner, E. Hauber, J. Oberst. Present-day erosion rate of north polar scarps on Mars due to active mass wasting, Icarus, Volume 342, (2020),113434, ISSN 0019-1035.</li> <li aria-level="1">Smith I.B., Putzig N.E., Holt J.W. and Phillips R.J. An ice age recorded in the polar deposits of Mars, Science, 352, 1075&#8211;1078 (2016).</li> <li aria-level="1">Yunxuan Wei, Shuhang Gu, Yawei Li, Longcun Jin. Unsupervised Real-world Image Super Resolution via Domain-distance Aware Training (2021).</li> <li aria-level="1">Zhan, W., Sun, C., Wang, M. <em>et al.</em> An improved Yolov5 real-time detection method for small objects captured by UAV. <em>Soft Comput</em> 26, 361&#8211;373 (2022)</li> <li aria-level="1">Linlin Zhu, Xun Geng, Zheng Li and Chun Liu. Improving YOLO v5 with Attention Mechanism for Detecting Boulders from Planetary Images. <em>Remote Sens. </em>, <em>13</em>(18), 3776 (2021).</li> </ul>
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要