系统仿真学报 ›› 2026, Vol. 38 ›› Issue (3): 572-583.doi: 10.16182/j.issn1004731x.joss.25-0568
• 专栏 • 上一篇
曹明伟, 王凤娜, 王子龙, 赵海峰
收稿日期:2025-06-17
修回日期:2025-09-19
出版日期:2026-03-18
发布日期:2026-03-27
通讯作者:
赵海峰
第一作者简介:曹明伟(1986-),男,副教授,硕士生导师,博士,主要研究方向为三维视觉。
基金资助:Cao Mingwei, Wang Fengna, Wang Zilong, Zhao Haifeng
Received:2025-06-17
Revised:2025-09-19
Online:2026-03-18
Published:2026-03-27
Contact:
Zhao Haifeng
摘要:
针对神经辐射场(neural radiance fields,NeRF)在稀疏视图输入及复杂场景下新视图合成易出现伪影和纹理模糊的问题,提出了一种基于显式特征匹配和缩放点积注意力的神经辐射场方法(NeRF based on explicit feature matching and scaled dot-product attention,EMD-NeRF)。使用多尺度特征提取网络从输入的稀疏视图中提取多尺度特征信息。利用融合点积模块计算视图交互信息,作为共享分支。采用余弦相似度作为匹配线索,进行相似性嵌入体渲染。使用正则化损失函数增强场景颜色密度场的质量,提高所渲染的新视图的真实性。在多个开源数据集上的实验结果均证明了所提方法的有效性。
中图分类号:
曹明伟,王凤娜,王子龙等 . 基于显式特征匹配和缩放点积注意力的神经辐射场[J]. 系统仿真学报, 2026, 38(3): 572-583.
Cao Mingwei,Wang Fengna,Wang Zilong,et al . Neural Radiance Fields Based on Explicit Feature Matching and Scaled Dot-product Attention[J]. Journal of System Simulation, 2026, 38(3): 572-583.
表1
EMD-NeRF在DTU数据集上的定量评估结果
| 输入 | 方法 | 类别 | PSNR | SSIM | LPIPS |
|---|---|---|---|---|---|
| 3视图 | SRF[ | 预训练 | 15.68 | 0.698 | 0.281 |
| PixelNeRF[ | 18.95 | 0.710 | 0.269 | ||
| MVSNeRF[ | 26.63 | 0.931 | 0.168 | ||
| GeoNeRF[ | 24.01 | 0.928 | 0.162 | ||
| IBRNet[ | 26.04 | 0.917 | 0.190 | ||
| EMD-NeRF | 27.23 | 0.936 | 0.159 | ||
| FreeNeRF[ | 18.02 | 0.680 | |||
| DietNeRF[ | 11.85 | 0.633 | 0.314 | ||
| 3视图 | RegNeRF[ | 正则化 | 18.89 | 0.745 | 0.190 |
| MixNeRF[ | 18.95 | 0.744 | 0.203 | ||
| SparseNeRF[ | 19.55 | 0.769 | 0.201 |
表2
EMD-NeRF在LLFF数据集上的定量评估结果
| 输入 | 方法 | 类别 | PSNR | SSIM | LPIPS |
|---|---|---|---|---|---|
| 3视图 | SRF[ | 预训练 | 17.07 | 0.436 | 0.529 |
| PixelNeRF[ | 16.17 | 0.438 | 0.512 | ||
| MVSNeRF[ | 21.93 | 0.795 | 0.252 | ||
| GeoNeRF[ | 21.10 | 0.827 | 0.293 | ||
| IBRNet[ | 21.79 | 0.786 | 0.279 | ||
| EMD-NeRF | 22.27 | 0.802 | 0.250 | ||
| FreeNeRF | 19.63 | 0.612 | 0.308 | ||
| DietNeRF[ | 14.94 | 0.370 | 0.496 | ||
| RegNeRF[ | 19.08 | 0.587 | 0.336 | ||
| 3视图 | MixNeRF[ | 正则化 | 19.27 | 0.629 | 0.236 |
| SparseNeRF[ | 19.86 | 0.624 | 0.328 | ||
| FSGS[ | 20.31 | 0.652 | 0.288 |
| [1] | 王自力, 高鋆添, 杨德真, 等. 智能系统可靠性仿真测试与验证技术: 前沿进展与挑战[J]. 系统仿真学报, 2025, 37(7): 1583-1606. |
| Wang Zili, Gao Juntian, Yang Dezhen, et al. Reliability Simulation Testing and Verification Technologies for Intelligent Systems: Frontiers, Progress, and Challenges[J]. Journal of System Simulation, 2025, 37(7): 1583-1606. | |
| [2] | Deng Kangle, Liu A, Zhu Junyan, et al. Depth-supervised NeRF: Fewer Views and Faster Training for Free[C]//2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway: IEEE, 2022: 12872-12881. |
| [3] | Roessle Barbara, Barron J T, Mildenhall B, et al. Dense Depth Priors for Neural Radiance Fields from Sparse Input Views[C]//2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway: IEEE, 2022: 12882-12891. |
| [4] | Mescheder Lars, Oechsle Michael, Niemeyer Michael, et al. Occupancy Networks: Learning 3D Reconstruction in Function Space[C]//2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway: IEEE, 2019: 4455-4465. |
| [5] | Liu Lingjie, Gu Jiatao, Zaw Lin Kyaw, et al. Neural Sparse Voxel Fields[C]//Proceedings of the 34th International Conference on Neural Information Processing Systems. Red Hook: Curran Associates Inc., 2020: 15651-15663. |
| [6] | Kellnhofer P, Jebe L C, Jones A, et al. Neural Lumigraph Rendering[C]//2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway: IEEE, 2021: 4285-4295. |
| [7] | Park J J, Florence P, Straub J, et al. DeepSDF: Learning Continuous Signed Distance Functions for Shape Representation[C]//2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway: IEEE, 2019: 165-174. |
| [8] | Michalkiewicz Mateusz, Jhony Kaesemodel Pontes, Jack Dominic, et al. Implicit Surface Representations as Layers in Neural Networks[C]//2019 IEEE/CVF International Conference on Computer Vision (ICCV). Piscataway: IEEE, 2019: 4742-4751. |
| [9] | Chen Zhiqin, Zhang Hao. Learning Implicit Fields for Generative Shape Modeling[C]//2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway: IEEE, 2019: 5932-5941. |
| [10] | Mildenhall B, Srinivasan P P, Tancik M, et al. NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis[J]. Communications of the ACM, 2022, 65(1): 99-106. |
| [11] | Xu Dejia, Jiang Yifan, Wang Peihao, et al. SinNeRF: Training Neural Radiance Fields on Complex Scenes from a Single Image[C]//Computer Vision – ECCV 2022. Cham: Springer Nature Switzerland, 2022: 736-753. |
| [12] | Kim Mijeong, Seo Seonguk, Han Bohyung. InfoNeRF: Ray Entropy Minimization for Few-shot Neural Volume Rendering[C]//2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway: IEEE, 2022: 12902-12911. |
| [13] | Truong P, Rakotosaona M J, Manhardt F, et al. SPARF: Neural Radiance Fields from Sparse and Noisy Poses[C]//2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway: IEEE, 2023: 4190-4200. |
| [14] | Chen Di, Liu Yu, Huang Lianghua, et al. GeoAug: Data Augmentation for Few-shot NeRF with Geometry Constraints[C]//Computer Vision – ECCV 2022. Cham: Springer Nature Switzerland, 2022: 322-337. |
| [15] | Niemeyer Michael, Barron J T, Mildenhall B, et al. RegNeRF: Regularizing Neural Radiance Fields for View Synthesis from Sparse Inputs[C]//2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway: IEEE, 2022: 5470-5480. |
| [16] | Fu Hongyu, Yu Xin, Li Lincheng, et al. CBARF: Cascaded Bundle-adjusting Neural Radiance Fields from Imperfect Camera Poses[J]. IEEE Transactions on Multimedia, 2024, 26: 9304-9315. |
| [17] | Tang Jiaxiang, Chen Xiaokang, Wang Jingbo, et al. Compressible-composable NeRF Via Rank-residual Decomposition[C]//Proceedings of the 36th International Conference on Neural Information Processing Systems. Red Hook: Curran Associates Inc., 2022: 14798-14809. |
| [18] | Yu A, Li Ruilong, Tancik M, et al. PlenOctrees for Real-time Rendering of Neural Radiance Fields[C]//2021 IEEE/CVF International Conference on Computer Vision (ICCV). Piscataway: IEEE, 2021: 5732-5741. |
| [19] | Mohammad Mahdi Johari, Lepoittevin Yann, Fleuret François. GeoNeRF: Generalizing NeRF with Geometry Priors[C]//2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway: IEEE, 2022: 18344-18347. |
| [20] | Chibane Julian, Bansal A, Lazova Verica, et al. Stereo Radiance Fields (SRF): Learning View Synthesis for Sparse Views of Novel Scenes[C]//2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway: IEEE, 2021: 7907-7916. |
| [21] | Wang Qianqian, Wang Zhicheng, Genova K, et al. IBRNet: Learning Multi-view Image-based Rendering[C]//2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway: IEEE, 2021: 4688-4697. |
| [22] | Chen Anpei, Xu Zexiang, Zhao Fuqiang, et al. MVSNeRF: Fast Generalizable Radiance Field Reconstruction from Multi-view Stereo[C]//2021 IEEE/CVF International Conference on Computer Vision (ICCV). Piscataway: IEEE, 2021: 14104-14113. |
| [23] | Rematas K, Martin-Brualla R, Ferrari V. ShaRF: Shape-conditioned Radiance Fields from a Single View[EB/OL]. (2021-06-23)[2025-04-12]. . |
| [24] | Yu A, Ye V, Tancik M, et al. pixelNeRF: Neural Radiance Fields from One or Few Images[C]//2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway: IEEE, 2021: 4576-4585. |
| [25] | Yu Yingchen, Wu Rongliang, Yifang Men, et al. MorphNeRF: Text-guided 3D-aware Editing via Morphing Generative Neural Radiance Fields[J]. IEEE Transactions on Multimedia, 2024, 26: 8516-8528. |
| [26] | Long Lee Jie, Li Chen, Hee Lee Gim. DiSR-NeRF: Diffusion-guided View-consistent Super-resolution NeRF[C]//2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway: IEEE, 2024: 20561-20570. |
| [27] | Wang Chuandong, Cai Meng, Li Jianxun. DKD-NeRF: Depth Knowledge-distillation NeRF for Sparse Input Views[C]//Proceedings of the 2024 4th International Joint Conference on Robotics and Artificial Intelligence. New York: ACM, 2025: 119-123. |
| [28] | Niemeyer Michael, Mescheder Lars, Oechsle Michael, et al. Differentiable Volumetric Rendering: Learning Implicit 3D Representations Without 3D Supervision[C]//2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway: IEEE, 2020: 3501-3512. |
| [29] | Niemeyer Michael, Mescheder Lars, Oechsle Michael, et al. Occupancy Flow: 4D Reconstruction by Learning Particle Dynamics[C]//2019 IEEE/CVF International Conference on Computer Vision (ICCV). Piscataway: IEEE, 2019: 5378-5388. |
| [30] | Kerbl Bernhard, Kopanas Georgios, Leimkuehler Thomas, et al. 3D Gaussian Splatting for Real-time Radiance Field Rendering[J]. ACM Transactions on Graphics, 2023, 42(4): 139. |
| [31] | Diels Laurens, Vlaminck Michiel, Philips Wilfried, et al. Fast 3D Gaussian Splatting Rendering via Easily Integrable Improvements[J]. IEEE Signal Processing Letters, 2025, 32: 381-385. |
| [32] | Guo Shuai, Wang Qiuwen, Gao Yijie, et al. Depth-guided Robust Point Cloud Fusion NeRF for Sparse Input Views[J]. IEEE Transactions on Circuits and Systems for Video Technology, 2024, 34(9): 8093-8106. |
| [33] | Jain A, Tancik M, Abbeel P. Putting NeRF on a Diet: Semantically Consistent Few-shot View Synthesis[C]//2021 IEEE/CVF International Conference on Computer Vision (ICCV). Piscataway: IEEE, 2021: 5865-5874. |
| [34] | Jang W, Agapito L. CodeNeRF: Disentangled Neural Radiance Fields for Object Categories[C]//2021 IEEE/CVF International Conference on Computer Vision (ICCV). Piscataway: IEEE, 2021: 12929-12938. |
| [35] | Li Jiaxin, Feng Zijian, She Qi, et al. MINE: Towards Continuous Depth MPI with NeRF for Novel View Synthesis[C]//2021 IEEE/CVF International Conference on Computer Vision (ICCV). Piscataway: IEEE, 2021: 12558-12568. |
| [36] | Trevithick A, Yang Bo. GRF: Learning a General Radiance Field for 3D Representation and Rendering[C]//2021 IEEE/CVF International Conference on Computer Vision (ICCV). Piscataway: IEEE, 2021: 15162-15172. |
| [37] | Liu Yuan, Peng Sida, Liu Lingjie, et al. Neural Rays for Occlusion-aware Image-based Rendering[C]//2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway: IEEE, 2022: 7814-7823. |
| [38] | Cao Mingwei, Wang Fengna, Sun Dengdi, et al. BCS-NeRF: Bundle Cross-sensing Neural Radiance Fields[C]//Proceedings of the 6th ACM International Conference on Multimedia in Asia. New York: ACM, 2024: 37. |
| [39] | Xu Haofei, Zhang Jing, Cai Jianfei, et al. GMFlow: Learning Optical Flow via Global Matching[C]//2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway: IEEE, 2022: 8111-8120. |
| [40] | Jensen Rasmus, Dahl Anders, Vogiatzis George, et al. Large Scale Multi-view Stereopsis Evaluation[C]//2014 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE, 2014: 406-413. |
| [41] | Mildenhall B, Srinivasan P P, Ortiz-Cayon R, et al. Local Light Field Fusion: Practical View Synthesis with Prescriptive Sampling Guidelines[J]. ACM Transactions on Graphics, 2019, 38(4): 29. |
| [42] | Wang Zhou, Bovik A C, Sheikh H R, et al. Image Quality Assessment: from Error Visibility to Structural Similarity[J]. IEEE Transactions on Image Processing, 2004, 13(4): 600-612. |
| [43] | Zhang R, Isola P, Efros A A, et al. The Unreasonable Effectiveness of Deep Features as a Perceptual Metric[C]//2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE, 2018: 586-595. |
| [44] | Yang Jiawei, Pavone M, Wang Yue. FreeNeRF: Improving Few-shot Neural Rendering with Free Frequency Regularization[C]//2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway: IEEE, 2023: 8254-8263. |
| [45] | Seo Seunghyeon, Han Donghoon, Chang Yeonjin, et al. MixNeRF: Modeling a Ray with Mixture Density for Novel View Synthesis from Sparse Inputs[C]//2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway: IEEE, 2023: 20659-20668. |
| [46] | Wang Guangcong, Chen Zhaoxi, Chen Change Loy, et al. SparseNeRF: Distilling Depth Ranking for Few-shot Novel View Synthesis[C]//2023 IEEE/CVF International Conference on Computer Vision (ICCV). Piscataway: IEEE, 2023: 9031-9042. |
| [47] | Zhu Zehao, Fan Zhiwen, Jiang Yifan, et al. FSGS: Real-time Few-shot View Synthesis Using Gaussian Splatting[C]//Computer Vision-ECCV 2024. Cham: Springer Nature Switzerland, 2025: 145-163. |
| [1] | 陆后军, 朱益飞, 荣延平, 张汪荟. 基于激光雷达的散货堆场数字孪生建模方法[J]. 系统仿真学报, 2025, 37(9): 2269-2286. |
| [2] | 姚万业, 庞泽伟, 孙沛杰, 王祝. 基于窗口化匹配估计的ORB-SLAM算法研究[J]. 系统仿真学报, 2024, 36(9): 2032-2042. |
| [3] | 史蓝兮, 颜文旭, 倪宏宇, 赵峰. 基于改进目标检测的动态场景SLAM研究[J]. 系统仿真学报, 2024, 36(4): 1028-1042. |
| [4] | 杨敬辉, 刘德康, 杜万和, 邢立宁. 基于图像特征的双目测距系统的研究[J]. 系统仿真学报, 2022, 34(3): 624-632. |
| [5] | 蔡鹏, 沈朝萍, 李红燕. 基于标定区域特征点组合匹配的位姿估计方法[J]. 系统仿真学报, 2021, 33(7): 1638-1646. |
| [6] | 陈立家, 王凯, 李世刚, 田延飞. 基于航空影像的航海模拟器视景快速建模方法[J]. 系统仿真学报, 2021, 33(7): 1565-1573. |
| [7] | 吴宇豪, 曹雪峰, 彭锦超, 徐连瑞. 基于运动结构图的无人机序列影像三维重建[J]. 系统仿真学报, 2020, 32(6): 1094-1102. |
| [8] | 康来, 魏迎梅, 蒋杰, 谢毓湘. 融合视惯传感数据的非接触式物体尺寸获取方法[J]. 系统仿真学报, 2020, 32(5): 892-900. |
| [9] | 张友鹏, 王淳, 刘艳丽. 移动视点下在线视频的动态阴影检测与跟踪[J]. 系统仿真学报, 2019, 31(7): 1439-1447. |
| [10] | 徐连瑞, 张锦明. 基于Kinect的室内环境建模[J]. 系统仿真学报, 2019, 31(12): 2643-2651. |
| [11] | 张航, 陈彬, 薛含章, 朱正秋, 王戎骁. 基于无人机和LIDAR的三维场景建模研究[J]. 系统仿真学报, 2017, 29(9): 1914-1920. |
| [12] | 林丽萍, 张亚萍. 基于错配剔除的三维重建研究[J]. 系统仿真学报, 2017, 29(11): 2644-2648. |
| [13] | 孙晶晶, 于佳骏, 李贝, 谢志峰, 丁友东. 基于人像字典集的卡通自动生成方法[J]. 系统仿真学报, 2015, 27(4): 682-688. |
| [14] | 孟悦, 周明全, 税午阳, 武仲科. 基于单幅图像的三维自由曲面浮雕生成[J]. 系统仿真学报, 2015, 27(12): 3012-3017. |
| [15] | 陈国军, 韦鑫. 基于深度图的可视外壳凹面优化[J]. 系统仿真学报, 2015, 27(10): 2508-2513. |
| 阅读次数 | ||||||
|
全文 |
|
|||||
|
摘要 |
|
|||||