Journal of System Simulation ›› 2024, Vol. 36 ›› Issue (4): 1028-1042.doi: 10.16182/j.issn1004731x.joss.22-1332

• Papers • Previous Articles    

Research on Dynamic Scene SLAM Based on Improved Object Detection

Shi Lanxi1(), Yan Wenxu1(), Ni Hongyu2, Zhao Feng2   

  1. 1.School of Internet of Things Engineering, Jiangnan University, Wuxi 214100, China
    2.State Grid Shaoxing Power Supply Company, Shaoxing 312000, China
  • Received:2022-11-09 Revised:2023-01-06 Online:2024-04-15 Published:2024-04-18
  • Contact: Yan Wenxu E-mail:2066760176@qq.com;ywx01@jiangnan.edu.cn

Abstract:

Aiming at the epipolar constraint matching problem of monocular SLAM in dynamic scenes adynamic feature point selection method based on object detection is proposed, in which the dynamic feature points in the front-end image frame of SLAM system is eliminated during feature extraction to improve the localization accuracy of SLAM. An improved target detection network is proposed to construct a loss function to describe the bounding box by using the overlap area, distance similarity and cosine similarity, which can achieve the accurate localization of target objects and obtain the range of object feature points in the current image frame. The object category is judged in SLAM, and the dynamic feature points in the front-end image frame are rejected according to the target detection result for the objects marked as dynamic. Based on the static feature point results, the epipolar geometry is used for the feature matching between two frames to estimate pose the to carry out the tracking, map building and closed-loop detection of monocular camera motion. The speed of the inference process is improved by the structural reparameterization of the backbone of target detection network to ensure the real-time operation of the overall system. Experimental results on KITTI dataset show that the improved system improves the localization accuracy by 23.4% over ORB-SLAM3 system, and the frame rate can reach more than 30fps. The algorithm can effectively improve the localization accuracy of monocular SLAM system in dynamic scenes under the condition of ensuring the real-time operation.

Key words: visual SLAM, epipolar geometry, feature matching, object detection, IoU loss function, structural reparameterization

CLC Number: