系统仿真学报 ›› 2026, Vol. 38 ›› Issue (5): 1290-1302.doi: 10.16182/j.issn1004731x.joss.25-0502

• • 上一篇    

融合物理与几何先验的无抓取标注6-DoF抓取检测方法

石敏1, 郭诗盛1, 王素琴1, 李兆歆2,3, 朱登明4,5   

  1. 1.华北电力大学 控制与计算机工程学院,北京 102206
    2.中国农业科学院 农业信息研究所,北京 100081
    3.农业农村部 农业大数据重点实验室,北京 100081
    4.中国科学院 计算技术研究所,北京 100190
    5.太仓中科信息技术研究院,江苏 太仓 215400
  • 收稿日期:2025-06-03 修回日期:2025-08-11 出版日期:2026-05-21 发布日期:2026-05-29
  • 通讯作者: 王素琴
  • 第一作者简介:石敏(1975-),女,副教授,博士,研究方向为具身智能、三维场景感知与重建。
  • 基金资助:
    国家自然科学基金(61972379);国家自然科学基金(62172392);山东省新旧动能转换重大产业攻关项目(2021-55);苏州市科技计划前沿技术研究(SYG202327);苏州市科技计划(SYC2022048)

Annotation-free 6-DoF Grasp Detection Method Integrating Physical and Geometric Priors

Shi Min1, Guo Shisheng1, Wang Suqin1, Li Zhaoxin2,3, Zhu Dengming4,5   

  1. 1.School of Control and Computer Engineering, North China Electric Power University, Beijing 102206, China
    2.Agricultural Information Technology Research Laboratory, Chinese Academy of Agricultural Sciences, Beijing 100081, China
    3.Key Laboratory of Agricultural Big Data, Ministry of Agriculture and Rural Affairs, Beijing 100081, China
    4.Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China
    5.Taicang-CAS Institute of Information and Technology, Taicang 215400, China
  • Received:2025-06-03 Revised:2025-08-11 Online:2026-05-21 Published:2026-05-29
  • Contact: Wang Suqin

摘要:

为提升复杂堆叠场景中抓取姿态估计的稳定性与跨类别泛化能力,提出一种融合物理规则与几何结构先验的无标注6-DoF抓取检测方法。在离线阶段,基于多物理约束,构建可行抓取姿态模板库,无需依赖人工抓取标注。在网络设计中,引入物体结构对称性与空间重叠关系建模,设计具备遮挡感知与暴露度建模能力的几何引导机制,并结合关键点回归实现目标物体的稳健姿态对齐。构建多类型堆叠数据集IPA-Stack++,并在单类与混合类工业零部件抓取任务中进行测试。实验结果表明:所提方法在单类型和多类型场景中的抓取成功率明显优于现有其他同类方法,展现出良好的抓取稳定性与通用性。

关键词: 六自由度抓取检测, 无标注学习, 物理规则约束, 几何结构先验, 多任务点云感知

Abstract:

To improve the stability and cross-category generalization capability of grasp pose estimation in complex stacked scenes, an annotation-free 6-DoF grasp detection method integrating physical rules and geometric structure priors was proposed. In the offline stage, a template library of feasible grasp poses was constructed based on multi-physical constraints, without relying on manual grasp annotations. In the network design, the modeling of structural symmetry of objects and spatial overlap relationships was introduced; a geometric guidance mechanism with occlusion perception and exposure modeling capabilities was designed, and robust pose alignment of target objects was achieved by combining keypoint regression. A multi-type stacked dataset IPA-Stack++ was constructed, and tests were conducted in grasping tasks of single-category and mixed-category industrial parts. Experimental results show that the grasp success rate of the proposed method in single-type and multi-type scenes is significantly superior to other existing methods of the same type, demonstrating good grasp stability and versatility.

Key words: 6-DoF grasp detection, annotation-free learning, physical rule constraint, geometric structure prior, multi-task point cloud perception

中图分类号: