系统仿真学报 ›› 2024, Vol. 36 ›› Issue (1): 110-119.doi: 10.16182/j.issn1004731x.joss.23-0112

• 论文 • 上一篇    

基于自适应空间特征增强的多视图深度估计

魏东(), 刘欢(), 张潇瀚, 李昌恺, 孙天翼, 张子优   

  1. 沈阳工业大学信息科学与工程学院,辽宁 沈阳 110870
  • 收稿日期:2023-02-09 修回日期:2023-04-10 出版日期:2024-01-20 发布日期:2024-01-19
  • 通讯作者: 刘欢 E-mail:dongweisut@sut.edu.cn;1121339441@qq.com
  • 第一作者简介:魏东(1968-),男,副教授,硕士,研究方向为计算机图形学、计算机视觉、虚拟现实等。E-mail:dongweisut@sut.edu.cn
  • 基金资助:
    辽宁省教育厅项目(LJGD2020006)

Multi-view Depth Estimation Based on Adaptive Space Feature Enhancement

Wei Dong(), Liu Huan(), Zhang Xiaohan, Li Changkai, Sun Tianyi, Zhang Ziyou   

  1. School of Information Science and Engineering, Shenyang University of Technology, Shenyang 110870, China
  • Received:2023-02-09 Revised:2023-04-10 Online:2024-01-20 Published:2024-01-19
  • Contact: Liu Huan E-mail:dongweisut@sut.edu.cn;1121339441@qq.com

摘要:

为了提高多视图深度估计结果精度,提出一种基于自适应空间特征增强的多视图深度估计算法。设计了由改进后的特征金字塔网络( feature pyramid network, FPN)和自适应空间特征增强(adaptive space feature enhancement, ASFE)组成的多尺度特征提取模块,获取到具有全局上下文信息和位置信息的多尺度特征图像。通过残差学习网络对深度图进行优化,防止多次卷积操作出现重建边缘模糊的问题。通过分类的思想构建focal loss函数增强网络模型的判断能力。由实验结果可知,该算法在DTU(technical university of denmark)数据集上和CasMVSNet(Cascade MVSNet)算法相比,在整体精度误差、运行时间、显存资源占用上分别降低了14.08%、72.15%、4.62%。在Tanks and Temples数据集整体评价指标Mean上该模型优于其他算法,证明提出的基于自适应空间特征增强的多视图深度估计算法的有效性。

关键词: 多视图深度估计, 自适应空间特征增强, 残差学习网络, 卷积操作, focal loss函数

Abstract:

A multi-view depth estimation algorithm based on adaptive space feature enhancement (ASFE) is presented to improve the multi-view depth estimation accuracy.A multi-scale feature extraction module composed of an improved feature pyramid network (FPN) and ASFE is designed. This module obtains multi-scale feature maps withglobal context-aware information and coordinate information. The residual learning network is used to optimize the depth map to prevent the problem of blurred reconstructed edges in multiple convolution operations. The proposed algorithm constructs a focal loss function through the idea of classification to enhance the prediction ability of the network model. The experimental results show that on the technical university of denmark (DTU) dataset, compared with the cascade MVSNet (CasMVSNet) method, the proposed method reduces overall accuracy error, running time, and video memory resource occupation by 14.08%, 72.15%, and 4.62%, respectively. The Mean of the model on the Tanks and Temples dataset is superior to other algorithms, which proves the effectiveness of the proposed multi-view depth estimation algorithm based on ASFE.

Key words: multi-view depth estimation, adaptive space feature enhancement, residual learning network, convolution operation, focal loss function

中图分类号: