Partial Policy-Based Reinforcement Learning for Anatomical Landmark Localization in 3D Medical Images

被引：38

作者：

Al, Walid Abdullah ^{[1
]}

Yun, Il Dong ^{[1
]}

机构：

[1] Hankuk Univ Foreign Studies, Dept Comp & Elect Syst Engn, Yongin 449791, South Korea

来源：

IEEE TRANSACTIONS ON MEDICAL IMAGING | 2020年 / 39卷 / 04期

基金：

新加坡国家研究基金会;

关键词：

Actor-critic; landmark localization; medical image; partial policy; reinforcement learning; RECOGNITION; CT;

D O I：

10.1109/TMI.2019.2946345

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Utilizing the idea of long-term cumulative return, reinforcement learning (RL) has shown remarkable performance in various fields. We follow the formulation of landmark localization in 3D medical images as an RL problem. Whereas value-based methods have been widely used to solve RL-based localization problems, we adopt an actor-critic based direct policy search method framed in a temporal difference learning approach. In RL problems with large state and/or action spaces, learning the optimal behavior is challenging and requires many trials. To improve the learning, we introduce a partial policy-based reinforcement learning to enable solving the large problem of localization by learning the optimal policy on smaller partial domains. Independent actors efficiently learn the corresponding partial policies, each utilizing their own independent critic. The proposed policy reconstruction from the partial policies ensures a robust and efficient localization, where the sub-agents uniformly contribute to the state-transitions based on their simple partial policies mapping to binary actions. Experiments with three different localization problems in 3D CT and MR images showed that the proposed reinforcement learning requires a significantly smaller number of trials to learn the optimal behavior compared to the original behavior learning scheme in RL. It also ensures a satisfactory performance when trained on fewer images.

引用

页码：1245 / 1255

页数：11

共 43 条

[1]

Abdullah A, 2018, IEEE INT C BIOINFORM, P609, DOI 10.1109/BIBM.2018.8621575

[2] Automatic aortic valve landmark localization in coronary CT angiography using colonial walk [J].

Al, Walid Abdullah ;

Jung, Ho Yub ;

Yun, Il Dong ;

Jang, Yeonggul ;

Park, Hyung-Bok ;

Chang, Hyuk-Jae .

PLOS ONE, 2018, 13 (07)

[3] Evaluating reinforcement learning agents for anatomical landmark detection [J].

Alansary, Amir ;

Oktay, Ozan ;

Li, Yuanwei ;

Le Folgoc, Loic ;

Hou, Benjamin ;

Vaillant, Ghislain ;

Kamnitsas, Konstantinos ;

Vlontzos, Athanasios ;

Glocker, Ben ;

Kainz, Bernhard ;

Rueckert, Daniel .

MEDICAL IMAGE ANALYSIS, 2019, 53 :156-164

[4] Automatic View Planning with Multi-scale Deep Reinforcement Learning Agents [J].

Alansary, Amir ;

Le Folgoc, Loic ;

Vaillant, Ghislain ;

Oktay, Ozan ;

Li, Yuanwei ;

Bai, Wenjia ;

Passerat-Palmbach, Jonathan ;

Guerrero, Ricardo ;

Kamnitsas, Konstantinos ;

Hou, Benjamin ;

McDonagh, Steven ;

Glocker, Ben ;

Kainz, Bernhard ;

Rueckert, Daniel .

MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2018, PT I, 2018, 11070 :277-285

[5] Lung Nodule Detection via Deep Reinforcement Learning [J].

Ali, Issa ;

Hart, Gregory R. ;

Gunabushanam, Gowthaman ;

Liang, Ying ;

Muhammad, Wazir ;

Nartowt, Bradley ;

Kane, Michael ;

Ma, Xiaomei ;

Deng, Jun .

FRONTIERS IN ONCOLOGY, 2018, 8

[6]

[Anonymous], TECH REP

[7]

[Anonymous], 2016, Advances in Neural Information Processing Systems

[8]

[Anonymous], 2016, MED IMAGE COMPUTING, DOI 10.1007/978-3-319-46726-927

[9]

Barto A.G, 2004, HDB LEARNING APPROXI, P45, DOI [10.1109/9780470544785.ch2, DOI 10.1109/9780470544785.CH2]

[10] Multi-Modality Vertebra Recognition in Arbitrary Views Using 3D Deformable Hierarchical Model [J].

Cai, Yunliang ;

Osman, Said ;

Sharma, Manas ;

Landis, Mark ;

Li, Shuo .

IEEE TRANSACTIONS ON MEDICAL IMAGING, 2015, 34 (08) :1676-1693

← 1 2 3 4 5 →