Journal of System Simulation ›› 2025, Vol. 37 ›› Issue (11): 2701-2713.doi: 10.16182/j.issn1004731x.joss.24-0564

• Papers •    

Improvement of SLAM Localization Accuracy in AR by Enhancing YOLOv8

Liu Jia1,2,3, Zhang Zengwei1,2, Chen Dapeng1,2,3, Huang Nanxuan1,2, Wang Bin1,2, Song Hong1,2   

  1. 1.Tianchang Research Institute, Nanjing University of Information Science and Technology, Chuzhou 239356, China
    2.School of Automation, Nanjing University of Information Science and Technology, Nanjing 210044, China
    3.Jiangsu Collaborative Innovation Center of Atmospheric Environment and Equipment Technology (CICAEET), Nanjing 210044, China
  • Received:2024-05-24 Revised:2024-07-20 Online:2025-11-18 Published:2025-11-27
  • Contact: Chen Dapeng

Abstract:

In the presence of dynamic interference in the environment, traditional simultaneous localization and mapping (SLAM) methods often experience reduced precision and stability in the registration of virtual objects during three-dimensional registration in augmented reality (AR). To address these issues, an improved method for dynamic scenes based on semantic segmentation and optical flow tracking was proposed. The convolutional block attention module (CBAM) attention mechanism was incorporated into YOLOv8 to enhance its focus on dynamic objects in the environment, thereby improving detection performance and accuracy. The semantic segmentation functionality of the improved YOLOv8 was integrated into the front-end of ORB-SLAM3 to segment dynamic objects in the scene and remove dynamic feature points that affect map construction. The optical flow method was further used to track moving objects, thereby improving the positioning accuracy of the camera. Validation was conducted on the TUM dataset and in real-world scenarios. The results indicate that, compared to traditional ORB-SLAM3, the proposed method improves positioning accuracy in dynamic scenes, significantly enhancing the stability of 3D registration in AR.

Key words: augmented reality(AR), three-dimensional registration, YOLOv8, ORB-SLAM3, semantic segmentation, dynamic detection

CLC Number: