从几何解析到语义推理:机器人抓取感知范式的演进
DOI:
作者:
作者单位:

1.国防科技大学计算机学院;2.国防科技大学 计算机学院

作者简介:

通讯作者:

中图分类号:

TP242.6

基金项目:

国家自然科学基金资助项目(62522219,62372457,62132021,62572477)


From Geometric Analysis to Semantic Reasoning: The Evolution of Robotic Grasping Perception Paradigms
Author:
Affiliation:

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献()
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    机器人抓取感知是实现机器人自主操作与具身智能的重要基础,其技术范式正经历从依赖显式几何建模的解析法,向以数据驱动学习与语义推理增强为核心的智能感知体系的深刻变革。围绕机器人抓取感知范式的演进脉络,对相关研究进行了系统综述,阐述了从解析几何模型驱动、视觉数据驱动以及语义理解与推理增强三个递进阶段的演变过程,并剖析了各阶段的代表性算法与关键技术路线。通过对不同范式在输入模态、数据需求、泛化能力与任务适应性等方面的对比分析,总结了各类方法在非结构化环境下的优势与局限。此外,系统梳理了抓取数据集从平面基准到大规模综合数据的演进历程,并剖析了由任务可靠性与提议准确度构成的量化评价体系。进一步总结了当前机器人抓取感知在仿真到现实迁移、推理效率、跨模态信息融合以及复杂任务扩展等方面面临的共性挑战,并展望了结合具身基础模型与灵巧操作的发展趋势,旨在为构建高泛化、强理解能力的通用机器人抓取系统提供参考借鉴。

    Abstract:

    Robotic grasping perception is a fundamental prerequisite for autonomous manipulation and embodied intelligence. The technical paradigm is undergoing a profound shift from analytical methods based on explicit geometric modeling to intelligent perception frameworks driven by data-driven learning and enhanced semantic reasoning. Research on robotic grasping perception was systematically reviewed along the lines of paradigm evolution. The evolutionary process was described through three progressive stages: analytical geometry-driven methods, visual data-driven methods, and semantic understanding and reasoning enhancement. Representative algorithms and key technical pathways for each stage were examined and analyzed. Through a comparative analysis of input modalities, data requirements, generalization ability, and task adaptability across different paradigms, the advantages and limitations of various methods in unstructured environments were summarized. Furthermore, the evolution of grasping datasets from planar benchmarks to large-scale comprehensive data was systematically traced, and the quantitative evaluation system composed of task reliability and proposal accuracy was analyzed. Prevailing challenges—including sim-to-real transfer, inference efficiency, cross-modal information fusion, and the extension to complex tasks—are identified. Future development trends that integrate embodied foundation models with dexterous manipulation are discussed to provide references for building general-purpose robotic grasping systems with high generalization performance and robust task comprehension.

    参考文献
    相似文献
    引证文献
引用本文
分享
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2026-02-02
  • 最后修改日期:2026-03-18
  • 录用日期:2026-03-19
  • 在线发布日期:
  • 出版日期:
文章二维码