End-to-end Active Object Tracking via Reinforcement Learning

被引:0
|
作者
Luo, Wenhan [1 ]
Sun, Peng [1 ]
Zhong, Fangwei [2 ]
Liu, Wei [1 ]
Zhang, Tong [1 ]
Wang, Yizhou [2 ]
机构
[1] Tencent AI Lab, Bellevue, WA 98004 USA
[2] Peking Univ, Beijing, Peoples R China
来源
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80 | 2018年 / 80卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We study active object tracking, where a tracker takes as input the visual observation (i.e., frame sequence) and produces the camera control signal (e.g., move forward, turn left, etc.). Conventional methods tackle the tracking and the camera control separately, which is challenging to tune jointly. It also incurs many human efforts for labeling and many expensive trial-and-errors in real-world. To address these issues, we propose, in this paper, an end-to-end solution via deep reinforcement learning, where a ConvNet-LSTM function approximator is adopted for the direct frame-to-action prediction. We further propose an environment augmentation technique and a customized reward function, which are crucial for a successful training. The tracker trained in simulators (ViZDoom, Unreal Engine) shows good generalization in the case of unseen object moving path, unseen object appearance, unseen background, and distracting object. It can restore tracking when occasionally losing the target. With the experiments over the VOT dataset, we also find that the tracking ability, obtained solely from simulators, can potentially transfer to real-world scenarios.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] End-to-end active object tracking football game via reinforcement learning
    Qin, Haobin
    Liu, Ming
    Dong, Liquan
    Kong, Lingqin
    Hui, Mei
    Zhao, Yuejin
    OPTICAL METROLOGY AND INSPECTION FOR INDUSTRIAL APPLICATIONS IX, 2022, 12319
  • [2] End-to-End Active Object Tracking and Its Real-World Deployment via Reinforcement Learning
    Luo, Wenhan
    Sun, Peng
    Zhong, Fangwei
    Liu, Wei
    Zhang, Tong
    Wang, Yizhou
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2020, 42 (06) : 1317 - 1332
  • [3] End-to-end multimodal image registration via reinforcement learning
    Hu, Jing
    Luo, Ziwei
    Wang, Xin
    Sun, Shanhui
    Yin, Youbing
    Cao, Kunlin
    Song, Qi
    Lyu, Siwei
    Wu, Xi
    MEDICAL IMAGE ANALYSIS, 2021, 68
  • [4] SAROD: EFFICIENT END-TO-END OBJECT DETECTION ON SAR IMAGES WITH REINFORCEMENT LEARNING
    Kang, Junhyung
    Jeon, Hyeonseong
    Bang, Youngoh
    Woo, Simon S.
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 1889 - 1893
  • [5] Optimization of Neuroprosthetic Vision via End-to-End Deep Reinforcement Learning
    Kucukoglu, Burcu
    Rueckauer, Bodo
    Ahmad, Nasir
    van Steveninck, Jaap de Ruyter
    Guclu, Umut
    van Gerven, Marcel
    INTERNATIONAL JOURNAL OF NEURAL SYSTEMS, 2022, 32 (11)
  • [6] AUV Position Tracking Control Using End-to-End Deep Reinforcement Learning
    Carlucho, Ignacio
    De Paula, Mariano
    Wang, Sen
    Menna, Bruno V.
    Petillot, Yvan R.
    Acosta, Gerardo G.
    OCEANS 2018 MTS/IEEE CHARLESTON, 2018,
  • [7] Toward End-to-End Object Detection and Tracking on the Edge
    Tabkhi, Hamed
    SEC 2017: 2017 THE SECOND ACM/IEEE SYMPOSIUM ON EDGE COMPUTING (SEC'17), 2017,
  • [8] Towards End-to-End Control of a Robot Prosthetic Hand via Reinforcement Learning
    Sharif, Mohammadreza
    Erdogmus, Deniz
    Amato, Christopher
    Padir, Taskin
    2020 8TH IEEE RAS/EMBS INTERNATIONAL CONFERENCE FOR BIOMEDICAL ROBOTICS AND BIOMECHATRONICS (BIOROB), 2020, : 641 - 647
  • [9] A Vision-Based End-to-End Reinforcement Learning Framework for Drone Target Tracking
    Zhao, Xun
    Huang, Xinjian
    Cheng, Jianheng
    Xia, Zhendong
    Tu, Zhiheng
    DRONES, 2024, 8 (11)
  • [10] End-to-End Learning to Grasp via Sampling From Object Point Clouds
    Alliegro, Antonio
    Rudorfer, Martin
    Frattin, Fabio
    Leonardis, Ales
    Tommasi, Tatiana
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (04) : 9865 - 9872