系统仿真学报 ›› 2023, Vol. 35 ›› Issue (11): 2333-2344.doi: 10.16182/j.issn1004731x.joss.22-0690
赵为平1,2(
), 陈雨2(
), 项松1, 刘远强1, 王超越1
收稿日期:2022-06-17
修回日期:2022-08-16
出版日期:2023-11-25
发布日期:2023-11-24
通讯作者:
陈雨
E-mail:3370477370@qq.com;1009857106@qq.com
第一作者简介:赵为平(1968-),男,副教授,博士,研究方向为飞行器设计、图像处理。E-mail:3370477370@qq.com
基金资助:
Zhao Weiping1,2(
), Chen Yu2(
), Xiang Song1, Liu Yuanqiang1, Wang Chaoyue1
Received:2022-06-17
Revised:2022-08-16
Online:2023-11-25
Published:2023-11-24
Contact:
Chen Yu
E-mail:3370477370@qq.com;1009857106@qq.com
摘要:
目前主流图像语义分割网络往往存在误分割、分割不连续和模型复杂度高的问题,不能灵活高效地部署于实际场景中。针对这一现象,通过综合考虑网络的参数量、预测时间和准确度,设计出一种优化DeepLabv3+模型的图像语义分割网络。骨干网络改用轻量级EfficientNetv2网络提取特征,提高参数利用率;在空洞空间金字塔池化模块中使用混合条带池化模块代替全局平均池化,引入深度可分离膨胀卷积,减少参数量和提高学习多尺度信息的能力;使用注意力机制增强模型表征力,提取骨干网络多条浅层特征,丰富图像的几何细节信息。实验表明,本文算法可达到mIoU为81.19%,参数量为55.51×106,有效优化了分割精度和模型复杂度,同时也提高了模型泛化性。
中图分类号:
赵为平,陈雨,项松等 . 基于改进的DeepLabv3+图像语义分割算法研究[J]. 系统仿真学报, 2023, 35(11): 2333-2344.
Zhao Weiping,Chen Yu,Xiang Song,et al . Image Semantic Segmentation Algorithm Based on Improved DeepLabv3+[J]. Journal of System Simulation, 2023, 35(11): 2333-2344.
| 1 | Wang Lei, Wu Jiaji, Liu Xunyu, et al. Semantic Segmentation of Large-scale Point Clouds Based on Dilated Nearest Neighbors Graph[J]. Complex & Intelligent Systems, 2022, 8(5): 3833-3845. |
| 2 | 田萱, 王亮, 丁琪. 基于深度学习的图像语义分割方法综述[J]. 软件学报, 2019, 30(2): 440-468. |
| Tian Xuan, Wang Liang, Ding Qi. Review of Image Semantic Segmentation Based on Deep Learning[J]. Journal of Software, 2019, 30(2): 440-468. | |
| 3 | Asgari Taghanaki S, Abhishek K, Cohen J P, et al. Deep Semantic Segmentation of Natural and Medical Images: A Review[J]. Artificial Intelligence Review, 2021, 54(1): 137-178. |
| 4 | Yuan Xiaohui, Shi Jianfang, Gu Lichuan. A Review of Deep Learning Methods for Semantic Segmentation of Remote Sensing Imagery[J]. Expert Systems with Applications, 2021, 169: 114417. |
| 5 | 王奕清. 基于计算机视觉的卫星云图反演降水量方法研究[D]. 成都: 电子科技大学, 2021. |
| Wang Yiqing. A Computer Vision Method for Precipitation Inversion With Satellite Cloud Images[D]. Chengdu: University of Electronic Science and Technology of China, 2021. | |
| 6 | Ivanovs M, Ozols K, Dobrajs A, et al. Improving Semantic Segmentation of Urban Scenes for Self-driving Cars with Synthetic Images[J]. Sensors, 2022, 22(6): 2252. |
| 7 | Kontschieder P, Samuel Rota Bulò, Bischof H, et al. Structured Class-labels in Random Forests for Semantic Image Labelling[C]//2011 International Conference on Computer Vision. Piscataway, NJ, USA: IEEE, 2011: 2190-2197. |
| 8 | Martijn van den Heuvel, Mandl R, Hulshoff Pol H. Normalized Cut Group Clustering of Resting-state FMRI Data[J]. PLoS One, 2008, 3(4): e2001. |
| 9 | Cherkassky V, Ma Yunqian. Practical Selection of SVM Parameters and Noise Estimation for SVM Regression[J]. Neural Networks, 2004, 17(1): 113-126. |
| 10 | Hu Yaosi, Chen Zhenzhong, Lin Weiyao. RGB-D Semantic Segmentation: A Review[C]//2018 IEEE International Conference on Multimedia & Expo Workshops (ICMEW). Piscataway, NJ, USA: IEEE, 2018: 1-6. |
| 11 | Kamilaris A, Prenafeta-Boldú Francesc X. Deep Learning in Agriculture: A Survey[J]. Computers and Electronics in Agriculture, 2018, 147: 70-90. |
| 12 | 刘瑞军, 王向上, 张晨, 等. 基于深度学习的视觉SLAM综述[J]. 系统仿真学报, 2020, 32(7): 1244-1256. |
| Liu Ruijun, Wang Xiangshang, Zhang Chen, et al. A Survey on Visual SLAM Based on Deep Learning[J]. Journal of System Simulation, 2020, 32(7): 1244-1256. | |
| 13 | 罗荣, 王亮, 肖玉杰. 深度学习技术应用现状分析与发展趋势研究[J]. 计算机教育, 2019(10): 19-22. |
| 14 | Yu Changqian, Wang Jingbo, Peng Chao, et al. BiSeNet: Bilateral Segmentation Network for Real-time Semantic Segmentation[C]//Computer Vision – ECCV 2018. Cham: Springer International Publishing, 2018: 334-349. |
| 15 | Zhang Fan, Chen Yanqin, Li Zhihang, et al. ACFNet: Attentional Class Feature Network for Semantic Segmentation[C]//2019 IEEE/CVF International Conference on Computer Vision (ICCV). Piscataway, NJ, USA: IEEE, 2019: 6797-6806. |
| 16 | Wu Tianyi, Tang Sheng, Zhang Rui, et al. CGNet: A Light-weight Context Guided Network for Semantic Segmentation[J]. IEEE Transactions on Image Processing, 2021, 30: 1169-1179. |
| 17 | Zhao Yaochi, Liu Shiguang, Hu Zhuhua. Focal Learning on Stranger for Imbalanced Image Segmentation[J]. IET Image Processing, 2022, 16(5): 1305-1323. |
| 18 | Zhao Yaochi, Liu Shiguang, Hu Zhuhua. Dynamically Balancing Class Losses in Imbalanced Deep Learning[J]. Electronics Letters, 2022, 58(5): 203-206. |
| 19 | Long J, Shelhamer E, Darrell T. Fully Convolutional Networks for Semantic Segmentation[C]//2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway, NJ, USA: IEEE, 2015: 3431-3440. |
| 20 | Guo Yanming, Liu Yu, Georgiou T, et al. A Review of Semantic Segmentation Using Deep Neural Networks[J]. International Journal of Multimedia Information Retrieval, 2018, 7(2): 87-93. |
| 21 | Badrinarayanan V, Kendall A, Cipolla R. SegNet: A Deep Convolutional Encoder-decoder Architecture for Image Segmentation[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39(12): 2481-2495. |
| 22 | Ronneberger O, Fischer P, Brox T. U-Net: Convolutional Networks for Biomedical Image Segmentation[C]//Medical Image Computing and Computer-Assisted Intervention-MICCAI 2015. Cham: Springer International Publishing, 2015: 234-241. |
| 23 | Schönfeld Edgar, Schiele B, Khoreva A. A U-net Based Discriminator for Generative Adversarial Networks[C]//2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway, NJ, USA: IEEE, 2020: 8204-8213. |
| 24 | Jaeger P F, Kohl S A A, Bickelhaupt S, et al. Retina U-net: Embarrassingly Simple Exploitation of Segmentation Supervision for Medical Object Detection[C]//Proceedings of the Machine Learning for Health NeurIPS Workshop. Chia Laguna Resort, Sardinia, Italy: PMLR, 2020: 171-183. |
| 25 | Chen L C, Papandreou G, Kokkinos I, et al. Semantic Image Segmentation With Deep Convolutional Nets and Fully Connected CRFs[EB/OL]. (2016-06-07) [2022-05-30]. . |
| 26 | Simonyan K, Zisserman A. Very Deep Convolutional Networks for Large-scale Image Recognition[EB/OL]. (2015-04-10) [2022-05-30]. . |
| 27 | Chen L C, Papandreou G, Kokkinos I, et al. DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2018, 40(4): 834-848. |
| 28 | Chen L C, Papandreou G, Schroff F, et al. Rethinking Atrous Convolution for Semantic Image Segmentation[EB/OL]. (2017-12-05) [2022-05-30]. . |
| 29 | Chen L C, Zhu Yukun, Papandreou G, et al. Encoder-decoder With Atrous Separable Convolution for Semantic Image Segmentation[C]//Computer Vision-ECCV 2018. Cham: Springer International Publishing, 2018: 833-851. |
| 30 | Tan Mingxing, Le Q. EfficientNetV2: Smaller Models and Faster Training[C]//Proceedings of the 38th International Conference on Machine Learning. Chia Laguna Resort, Sardinia, Italy: PMLR, 2021: 10096-10106. |
| 31 | Tan Mingxing, Le Q. EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks[C]//Proceedings of the 36th International Conference on Machine Learning. Chia Laguna Resort, Sardinia, Italy: PMLR, 2019: 6105-6114. |
| 32 | Hou Qibin, Zhang Li, Cheng Mingming, et al. Strip Pooling: Rethinking Spatial Pooling for Scene Parsing[C]//2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway, NJ, USA: IEEE, 2020: 4002-4011. |
| 33 | Liu Yichao, Shao Zongru, Teng Yueyang, et al. NAM: Normalization-based Attention Module[EB/OL]. (2021-11-24) [2022-05-30]. . |
| 34 | Hu Jie, Shen Li, Sun Gang. Squeeze-and-excitation Networks[C]//2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway, NJ, USA: IEEE, 2018: 7132-7141. |
| 35 | Woo S, Park J, Lee J Y, et al. CBAM: Convolutional Block Attention Module[C]//Computer Vision-ECCV 2018. Cham: Springer International Publishing, 2018: 3-19. |
| [1] | 江明, 何韬. 基于深度强化学习的带容量约束车辆路径问题求解[J]. 系统仿真学报, 2025, 37(9): 2177-2187. |
| [2] | 姜彦吉, 张颖阳, 董浩, 张晓光, 王美惠. 基于实例关联的暗光下车道线检测[J]. 系统仿真学报, 2025, 37(9): 2188-2199. |
| [3] | 马仑, 杨跃, 王迨贺, 廖桂生, 李幸. 联合自注意力机制与权值共享的人体行为识别模型[J]. 系统仿真学报, 2025, 37(9): 2409-2419. |
| [4] | 鲁斌, 杨烜, 杨振宇, 高啸天. 自适应采样与重影多尺度特征融合的轻量化焊缝缺陷检测[J]. 系统仿真学报, 2025, 37(8): 1978-1990. |
| [5] | 刘子龙, 张磊. 自然环境下改进YOLOv5对小目标苹果的检测[J]. 系统仿真学报, 2025, 37(8): 2124-2138. |
| [6] | 王子怡, 张凯, 钱殿伟, 刘玉贞. 一种基于DRL的分布式装备体系优选方法[J]. 系统仿真学报, 2025, 37(6): 1565-1573. |
| [7] | 伍国华, 曾家恒, 王得志, 郑龙, 邹伟. 基于深度强化学习的四旋翼航迹跟踪控制方法[J]. 系统仿真学报, 2025, 37(5): 1169-1187. |
| [8] | 王祥, 谭国真. 基于知识与大语言模型的高速环境自动驾驶决策研究[J]. 系统仿真学报, 2025, 37(5): 1246-1255. |
| [9] | 李杰, 刘扬, 李良, 苏本淦, 魏佳隆, 周广达, 石艳敏, 赵振. 基于跨阶段双分支特征聚合的遥感小目标检测[J]. 系统仿真学报, 2025, 37(4): 1025-1040. |
| [10] | 郑岚月, 张玉洁. 基于改进YOLOv7的交通信号灯检测[J]. 系统仿真学报, 2025, 37(4): 993-1007. |
| [11] | 李想, 任晓羽, 周永兵, 张剑. 基于改进D3QN算法的随机工时下柔性综合调度问题研究[J]. 系统仿真学报, 2025, 37(2): 474-486. |
| [12] | 费帅迪, 蔡长龙, 刘飞, 陈明晖, 刘晓明. 舰船防空反导的目标分配方法研究[J]. 系统仿真学报, 2025, 37(2): 508-516. |
| [13] | 张文康, 孙霄峰, 钟一平, 尹勇. 基于图神经网络的船舶液舱晃荡数值仿真[J]. 系统仿真学报, 2025, 37(12): 3087-3099. |
| [14] | 伍枢珩, 刘永奎, 张霖, 肖莹莹, 王力翚. 基于改进YOLOv8的轻量级装配工件检测算法[J]. 系统仿真学报, 2025, 37(12): 3099-3111. |
| [15] | 唐金琳, 王艳, 刘相, 王团结, 纪志成. 基于神经网络-遗传规划的布尔网络模型优化[J]. 系统仿真学报, 2025, 37(11): 2812-2825. |
| 阅读次数 | ||||||
|
全文 |
|
|||||
|
摘要 |
|
|||||