Intelligent robotic sonographer: Mutual information-based disentangled reward learning from few demonstrations

被引：6

作者：

Jiang, Zhongliang ^{[1
]}

Bi, Yuan ^{[1
]}

Zhou, Mingchuan ^{[2
]}

Hu, Ying ^{[3
]}

Burke, Michael ^{[4
]}

Navab, Nassir ^{[1
,5
]}

机构：

[1] Tech Univ Munich, Chair Comp Aided Med Procedures & Augmented Real, Munich, Germany

[2] Zhejiang Univ, Coll Biosyst Engn & Food Sci, 866 Yuhangtang Rd, Hangzhou 310027, Peoples R China

[3] Chinese Acad Sci, Shenzhen Inst Adv Technol, 1068 Xueyuan Ave, Shenzhen 518055, Peoples R China

[4] Monash Univ, Dept Elect & Comp Syst Engn, Clayton, Vic, Australia

[5] Johns Hopkins Univ, Lab Comp Aided Med Procedures, Baltimore, MD USA

来源：

INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH | 2024年 / 43卷 / 07期

关键词：

Robotic ultrasound; medical robotics; learning from demonstration; latent feature disentanglement; FETAL ULTRASOUND; LOCALIZATION; GUIDANCE; SYSTEM;

D O I：

10.1177/02783649231223547

中图分类号：

TP24 [机器人技术];

学科分类号：

080202 ; 1405 ;

摘要：

Ultrasound (US) imaging is widely used for biometric measurement and diagnosis of internal organs due to the advantages of being real-time and radiation-free. However, due to inter-operator variations, resulting images highly depend on the experience of sonographers. This work proposes an intelligent robotic sonographer to autonomously "explore" target anatomies and navigate a US probe to standard planes by learning from the expert. The underlying high-level physiological knowledge from experts is inferred by a neural reward function, using a ranked pairwise image comparison approach in a self-supervised fashion. This process can be referred to as understanding the "language of sonography." Considering the generalization capability to overcome inter-patient variations, mutual information is estimated by a network to explicitly disentangle the task-related and domain features in latent space. The robotic localization is carried out in coarse-to-fine mode based on the predicted reward associated with B-mode images. To validate the effectiveness of the proposed reward inference network, representative experiments were performed on vascular phantoms ("line" target), two types of ex vivo animal organ phantoms (chicken heart and lamb kidney representing "point" target), and in vivo human carotids. To further validate the performance of the autonomous acquisition framework, physical robotic acquisitions were performed on three phantoms (vascular, chicken heart, and lamb kidney). The results demonstrated that the proposed advanced framework can robustly work on a variety of seen and unseen phantoms as well as in vivo human carotid data. Code: https://github.com/yuan-12138/MI-GPSR. Video: https://youtu.be/u4ThAA9onE0.

引用

页码：981 / 1002

页数：22