Multi-granularity context network for SAR ship detection
- 2025年30卷第1期 页码:297-308
纸质出版日期: 2025-01-16
DOI: 10.11834/jig.230838
纸质出版日期: 2025-01-16 ,
应丽, 张志飞, 苗夺谦, 赵才荣. 多粒度上下文网络的SAR船舶检测[J]. 中国图象图形学报, 2025,30(1):297-308.
YING LI, ZHANG ZHIFEI, MIAO DUOQIAN, ZHAO CAIRONG. Multi-granularity context network for SAR ship detection. [J]. Journal of image and graphics, 2025, 30(1): 297-308.
合成孔径雷达(synthetic aperture radar, SAR)是一种主动式微波传感器,能够获取高分辨率的遥感图像,在海上船舶检测中至关重要。然而,SAR船舶检测主要面临两个挑战:复杂背景和船舶目标尺寸的多样性。为此,提出了适用于SAR船舶检测的多粒度上下文网络。
首先,设计了多粒度通道注意力(multi-granularity channel attention, MCA)模块,对全局和局部的不同粒度的上下文信息进行加权,以增强对船舶目标重要信息的关注,降低复杂背景对检测结果的干扰。然后,设计了多粒度空洞自适应空间特征融合(multi-granularity atrous adaptive spatial feature fusion, MAASFF) 模块,采用自适应空间特征融合的方法,将3种不同扩张率(1、2和3)的空洞卷积提取的特征图进行融合,以减少特征图生成中的上下文信息损失,增强特征金字塔的表示能力,从而提高不同尺寸船舶的检测效果。
实验结果表明,本文方法在SAR-Ship-Dataset和SSDD(SAR ship detection dataset)两个数据集上与其他9种方法进行比较均取得最佳的检测结果,平均精度分别达到了96.1%和97.0%,进一步验证了该网络在SAR船舶检测任务中出色的性能表现。
Synthetic aperture radar (SAR), as an active microwave sensor, can acquire high-resolution remote sensing images, which is crucial in marine ship detection. Nevertheless, two primary challenges confront ship detection in SAR images. First and foremost, SAR images frequently encompass complex backgrounds, incorporating turbulent sea waves, islands, and various forms of clutter. These complex backgrounds can significantly hinder the accurate identification of ship targets. The diverse spectrum of ship target sizes within SAR images presents another significant challenge. Traditional detection methods struggle to adapt to the broad range of ship sizes encountered in real-world scenarios. In recent years, with the extensive application of deep learning models and attention mechanisms, researchers have successfully improved the performance of SAR ship detection methods and effectively overcome the detection challenges of SAR ship targets of different sizes in complex backgrounds. However, these methods either have limitations in detection accuracy or require extensive computing resources. To address these challenges, this paper proposes a multi-granularity context network for SAR ship detection.
First, a multi-granularity channel attention (MCA) module is designed to weight the global and local contextual information of different granularities. The primary function of the MCA module is to focus on important characteristics of ship targets and minimize interference caused by complex backgrounds on detection results. Pointwise convolution replaces traditional convolution as an aggregator of global and local channel context information to ensure a lightweight MCA module. This substitution not only trims computational overhead but also streamlines the process. Furthermore, the MCA module is integrated into the first layer of the backbone network feature extraction of the YOLOv5s framework. The fusion of pointwise convolution and integration within the network architecture collectively strengthens our capability for accurate and efficient SAR ship detection. Then, a multi-granularity atrous adaptive spatial feature fusion (MAASFF) module is designed to reduce the loss of contextual information in feature map generation and enhance the representation capability of feature pyramids, thereby improving the detection performance of ships at different sizes. In the process of fusing features of different granularities, the MAASFF module avoids ignoring the differences between ship target features of different sizes and reduces unnecessary computational overload. It primarily employs an adaptive spatial feature fusion method to merge the feature maps extracted using three different atrous rates (1, 2, and 3) of atrous convolutions. This design effectively captures features at different spatial granularities, enhancing the feature representation capability for ships of different sizes.
Compared with nine other methods on two datasets, SAR-Ship-Dataset and SSDD, our method achieves the best detection results. On the SAR-Ship-Dataset, compared with the two-stage methods Faster R-CNN, DAPN, CR2A-Net, KCPNet, and BL-Net, our method can improve model detection accuracy by approximately 1.9% to 6.0%. Compared with common one-stage methods such as YOLOv4, CenterNet++, CRDet, and YOLOv5s, our method can enhance performance by 2.9%, 1.2%, 0.9%, and 1.8%, respectively. Experimental results indicate that our method achieves the best performance on the SAR-Ship-Dataset, reaching 96.1% mAP and thus outperforming all compared methods. On the SSDD dataset, our method improves the performance by approximately 8.7% (Faster R-CNN), 6.9% (DAPN), 7.2% (CR2A-Net), 4.5% (KCPNet), 1.8% (BL-Net), 0.9% (YOLOv4), 4.3% (CenterNet++), 0.6% (CRDet), and 1.6% (YOLOv5s) while maintaining a speed similar to the baseline YOLOv5s. These results show our method has good generalization ability and achieves the best performance with 97.0% mAP, further verifying its excellent effect and performance in SAR ship detection tasks.
This paper proposes a multi-granularity context network, which aims to suppress complex background interference and enhance the ability to extract features of multi-sized ships, effectively improving the accuracy of SAR ship detection.
synthetic aperture radar (SAR) imageship detectionmulti-granularitychannel attentionfeature fusion
Bochkovskiy A, Wang C Y and Liao H Y M. 2020. YOLOv4: optimal speed and accuracy of object detection [EB/OL]. [2023-12-12]. https://arxiv.org/pdf/2004.10934v1.pdfhttps://arxiv.org/pdf/2004.10934v1.pdf
Chen L C, Papandreou G, Kokkinos I, Murphy K and Yuille A L. 2018. DeepLab: semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected CRFs. IEEE Transactions on Pattern Analysis and Machine Intelligence, 40(4): 834-848 [DOI: 10.1109/TPAMI.2017.2699184http://dx.doi.org/10.1109/TPAMI.2017.2699184]
Chollet F. 2017. Xception: deep learning with depthwise separable convolutions//Proceedings of 2017 IEEE Conference on Computer Vision and Pattern Recognition. Honolulu, USA: IEEE: 1800-1807 [DOI: 10.1109/CVPR.2017.195http://dx.doi.org/10.1109/CVPR.2017.195]
Cui Z Y, Li Q, Cao Z J and Liu N Y. 2019. Dense attention pyramid networks for multi-scale ship detection in SAR images. IEEE Transactions on Geoscience and Remote Sensing, 57(11): 8983-8997 [DOI: 10.1109/TGRS.2019.2923988http://dx.doi.org/10.1109/TGRS.2019.2923988]
Gao G and Shi G T. 2017. CFAR ship detection in nonhomogeneous sea clutter using polarimetric SAR data based on the notch filter. IEEE Transactions on Geoscience and Remote Sensing, 55(8): 4811-4824 [DOI: 10.1109/TGRS.2017.2701813http://dx.doi.org/10.1109/TGRS.2017.2701813]
Girshick R, Donahue J, Darrell T and Malik J. 2014. Rich feature hierarchies for accurate object detection and semantic segmentation//Proceedings of 2014 IEEE Conference on Computer Vision and Pattern Recognition. Columbus, USA: IEEE: 580-587 [DOI: 10.1109/CVPR.2014.81http://dx.doi.org/10.1109/CVPR.2014.81]
Gong S R, Xu S J, Zhou L F, Zhu J and Zhong S. 2022. Deformable atrous convolution nearshore SAR small ship detection incorporating mixed attention. Journal of Image and Graphics, 27(12): 3663-3676
龚声蓉, 徐少杰, 周立凡, 朱杰, 钟珊. 2022. 融入混合注意力的可变形空洞卷积近岸SAR小舰船检测. 中国图象图形学报, 27(12): 3663-3676 [DOI: 10.11834/jig.210866http://dx.doi.org/10.11834/jig.210866]
Guo H Y, Yang X, Wang N N and Gao X B. 2021. A CenterNet++ model for ship detection in SAR images. Pattern Recognition, 112: #107787 [DOI: 10.1016/j.patcog.2020.107787http://dx.doi.org/10.1016/j.patcog.2020.107787]
Han Y Q, Liao J W, Lu T S, Pu T and Peng Z M. 2023. KCPNet: knowledge-driven context perception networks for ship detection in infrared imagery. IEEE Transactions on Geoscience and Remote Sensing, 61: #5000219 [DOI: 10.1109/TGRS.2022.3233401http://dx.doi.org/10.1109/TGRS.2022.3233401]
Jocher G. 2021. YOLOv5 [EB/OL]. [2023-12-12]. https://github.com/ultralytics/yolov5https://github.com/ultralytics/yolov5
Li C Y, Li L L, Jiang H L and Weng K H. 2022. YOLOv6: a single-stage object detection framework for industrial applications [EB/OL]. [2023-12-12]. https://arxiv.org/pdf/2209.02976v1.pdfhttps://arxiv.org/pdf/2209.02976v1.pdf
Li J W, Qu C W and Shao J Q. 2017. Ship detection in SAR images based on an improved faster R-CNN//2017 SAR in Big Data Era: Models, Methods and Applications. Beijing, China: IEEE: 1-6 [DOI: 10.1109/BIGSARDATA.2017.8124934http://dx.doi.org/10.1109/BIGSARDATA.2017.8124934]
Liu S T, Huang D and Wang Y H. 2019. Learning spatial fusion for single-shot object detection [EB/OL]. [2023-12-12]. https://arxiv.org/pdf/1911.09516.pdfhttps://arxiv.org/pdf/1911.09516.pdf
Liu W, Anguelov D, Erhan D, Szegedy C, Reed S, Fu C Y and Berg A C. 2016. SSD: single shot multibox detector//Proceedings of the 14th European Conference on Computer Vision. Amsterdam, the Netherlands: Springer: 21-37 [DOI: 10.1007/978-3-319-46448-0_2http://dx.doi.org/10.1007/978-3-319-46448-0_2]
Oh J, Youm G Y and Kim M. 2021. SPAM-net: a CNN-based SAR target recognition network with pose angle marginalization learning. IEEE Transactions on Circuits and Systems for Video Technology, 31(2): 701-714 [DOI: 10.1109/TCSVT.2020.2987346http://dx.doi.org/10.1109/TCSVT.2020.2987346]
Ren S Q, He K M, Girshick R and Sun J. 2017. Faster R-CNN: towards real-time object detection with region proposal networks. IEEE Transactions on Pattern Analysis and Machine Intelligence, 39(6): 1137-1149 [DOI: 10.1109/TPAMI.2016.2577031http://dx.doi.org/10.1109/TPAMI.2016.2577031]
Ruan C, Guo H and An J B. 2021. SAR inshore ship detection algorithm in complex background. Journal of Image and Graphics, 26(5): 1058-1066
阮晨, 郭浩, 安居白. 2021. 复杂背景下SAR近岸舰船检测. 中国图象图形学报, 26(5): 1058-1066 [DOI: 10.11834/jig.200266http://dx.doi.org/10.11834/jig.200266]
Ultralytics. 2023. YOLOv8 [EB/OL]. [2023-12-12]. https://github.com/ultralytics/ultralyticshttps://github.com/ultralytics/ultralytics
Wang C Y, Bochkovskiy A and Liao H Y M. 2023. YOLOv7: trainable bag-of-freebies sets new state-of-the-art for real-time object detectors//Proceedings of 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Vancouver, Canada: IEEE: 7464-7475 [DOI: 10.1109/CVPR52729.2023.00721http://dx.doi.org/10.1109/CVPR52729.2023.00721]
Wang Y Y, Wang C, Zhang H, Dong Y B and Wei S S. 2019. A SAR dataset of ship detection for deep learning under complex backgrounds. Remote Sensing, 11(7): #765 [DOI: 10.3390/rs11070765http://dx.doi.org/10.3390/rs11070765]
Yang X, Zhang X, Wang N N and Gao X B. 2022. A robust one-stage detector for multiscale ship detection with complex background in massive SAR images. IEEE Transactions on Geoscience and Remote Sensing, 60: #5217712 [DOI: 10.1109/TGRS.2021.3128060http://dx.doi.org/10.1109/TGRS.2021.3128060]
Yu Y, Yang X, Li J and Gao X B. 2022. A cascade rotated anchor-aided detector for ship detection in remote sensing images. IEEE Transactions on Geoscience and Remote Sensing, 60: #5600514 [DOI: 10.1109/TGRS.2020.3040273http://dx.doi.org/10.1109/TGRS.2020.3040273]
Zhang T W, Zhang X L, Liu C, Shi J, Wei S J, Ahmad I, Zhan X, Zhou Y, Pan D C, Li J W and Su H. 2021. Balance learning for ship detection from synthetic aperture radar remote sensing imagery. ISPRS Journal of Photogrammetry and Remote Sensing, 182: 190-207 [DOI: 10.1016/j.isprsjprs.2021.10.010http://dx.doi.org/10.1016/j.isprsjprs.2021.10.010]
Zheng Z H, Wang P, Liu W, Li J Z, Ye R G and Ren D W. 2020. Distance-IoU loss: faster and better learning for bounding box regression//Proceedings of the 34th AAAI Conference on Artificial Intelligence. New York, USA: AAAI, 12993-13000 [DOI: 10.1609/aaai.v34i07.6999http://dx.doi.org/10.1609/aaai.v34i07.6999]