系统仿真学报 ›› 2023, Vol. 35 ›› Issue (12): 2680-2691.doi: 10.16182/j.issn1004731x.joss.23-FZ0843E

• • 上一篇    下一篇

基于跨模块注意力的3D目标检测方法研究

许仁杰1(), 张小明1, 王晨2, 吴鹏2   

  1. 1.陆军装甲兵学院,北京 100072
    2.浙江理工大学,浙江 杭州 310018
  • 出版日期:2023-12-15 发布日期:2023-12-12

Research on 3D Object Detection Method with Cross-module Attention

Xu Renjie1(), Zhang Xiaoming1, Wang Chen2, Wu Peng2   

  1. 1.Academy of Army Armored Force, Beijing 100072, China
    2.Zhejiang Sci-Tech University, Hangzhou 310018, China
  • Online:2023-12-15 Published:2023-12-12
  • About author:Xu Renjie(1974-), male, associate professor, doctor, research area: VR and unmanned system simulation. E-mail: 1728217581@qq.com

摘要:

针对三维目标检测任务中利用点云数据在提取和传输目标特征过程中发生的特征丢失问题,提出一种跨模块注意力目标检测 方法 。该方法结合通道注意力模块和空间注意力模块来增强关键特征信息。通过特征转换,将注意力模块不同阶段的特征连接起来,以减轻提取和传输过程中特征的损失。针对目标检测网络中不同尺度目标检测性能不足的问题,提出了一种跨尺度特征提取和融合方法。该方法通过采用多尺度特征提取和融合技术增强了网络获取多级特征的能力。实验结果表明:所提方法在保持33 Hz实时推理速度的同时获得了先进的性能。

关键词: 3D目标检测, 体素网络, 注意力模块, 多尺度特征

Abstract:

To address the issue of feature loss that occurs during the extraction and transmission of target features in 3D object detection tasks using point cloud data, this study proposes an object detection method based on cross-module attention. This method incorporates a channel attention module and a spatial attention module to enhance the crucial feature information. Through feature transformationthe features from different stages of the attention module are connected to mitigate the loss of features during the extraction and transmission process. To tackle the problem of inadequate detection performance in target detection networks for objects of different scales, a cross-scale feature extraction and fusion method is introduced. This method enhances the network’s ability to acquire multilevel features by employing multi-scale feature extraction and fusion techniques. Experimental results demonstrate that the proposed method achieves state-of-the-art performance while maintaining a real-time inference speed of 33 Hz.

Key words: 3D object detection, voxel-based network, attention module, multi-scale feature

中图分类号: