系统仿真学报 ›› 2026, Vol. 38 ›› Issue (3): 818-828.doi: 10.16182/j.issn1004731x.joss.25-0031

• 论文 • 上一篇    

面向城市复杂环境视觉地点识别算法研究

刘沛津1, 张闽心1, 何林2, 孙艺阁1, 苏庭琪1   

  1. 1.西安建筑科技大学 机电工程学院,陕西 西安 710055
    2.西安建筑科技大学 理学院,陕西 西安 710055
  • 收稿日期:2025-01-08 修回日期:2025-04-08 出版日期:2026-03-18 发布日期:2026-03-27
  • 通讯作者: 何林
  • 第一作者简介:刘沛津(1971-),女,教授,博士,研究方向为计算机视觉、分布式机电液系统智能测控等。
  • 基金资助:
    陕西省教育厅科研计划项目(24JC048)

Research on Visual Place Recognition Algorithms for Complex Urban Environments

Liu Peijin1, Zhang Minxin1, He Lin2, Sun Yige1, Su Tingqi1   

  1. 1.School of Mechatronic Engineering, Xi'an University of Architecture and Technology, Xi'an 710055, China
    2.School of Science, Xi'an University of Architecture and Technology, Xi'an 710055, China
  • Received:2025-01-08 Revised:2025-04-08 Online:2026-03-18 Published:2026-03-27
  • Contact: He Lin

摘要:

针对复杂城市环境中交通流量、人群密度等动态因素降低视觉地点识别(visual place recognition,VPR)算法精度的问题,提出一种融合语义引导的视觉地点识别(semantic-guided VPR,SG-VPR)算法。设计语义引导特征抑制模块,通过构建语义引导模块与特征抑制层弱化动态物体干扰,精准提取关键静态特征改进传统三元损失函数,提出自适应三元损失函数(adaptive triplet marginal loss,ATML),依据样本分布自适应调整边距,解决固定边距策略导致的次优解收敛问题,提升特征区分能力。实验结果表明:SG-VPR算法在Pittsburgh250k和Tokyo24/7城市复杂环境公开数据集上的表现优于现有方法,在动态干扰场景下的视觉地点识别性能得到提升。

关键词: 城市复杂环境, 视觉地点识别, 深度学习, 语义引导, 特征抑制, 自适应三元损失函数

Abstract:

Dynamic factors such as traffic flow and crowd density in complex urban environments reduce the accuracy of visual place recognition (VPR) algorithms. To solve these problems, a semantic-guided visual place recognition (SG-VPR) algorithm was proposed. A semantic-guided feature suppression module was designed. A semantic-guided module and feature suppression layer were constructed to reduce the dynamic object interference and more accurately extract the key static features. An adaptive triplet margin loss function (ATML) was proposed by improving the traditional triplet margin loss. The margins were adaptively adjusted according to the sample distribution, solving the problem of suboptimal solution convergence caused by a fixed margin strategy and improving the feature differentiation ability. Experimental results show that the SG-VPR outperforms existing methods on the Pittsburgh250k and Tokyo24/7 public datasets with complex urban environments, especially in coping with dynamic interference scenarios, which significantly improves the performance of the VPR algorithm.

Key words: complex urban environments, visual place recognition, deep learning, semantic guidance, feature suppression, adaptive triplet marginal loss(ATML)

中图分类号: