Learning Disentangled Representation with Mutual Information Maximization for Real-Time UAV Tracking

被引:4
|
作者
Wang, Xucheng [1 ]
Yang, Xiangyang [1 ]
Ye, Hengzhou [1 ]
Li, Shuiwang [1 ]
机构
[1] Guilin Univ Technol, Guilin, Peoples R China
关键词
UAV tracking; Disentangled representation; Mutual information;
D O I
10.1109/ICME55011.2023.00231
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Efficiency has been a critical problem in UAV tracking due to limitations in computation resources, battery capacity, and unmanned aerial vehicle maximum load. Although discriminative correlation filters (DCF)-based trackers prevail in this field for their favorable efficiency, some recently proposed lightweight deep learning (DL)-based trackers using model compression demonstrated quite remarkable CPU efficiency as well as precision. Unfortunately, the model compression methods utilized by these works, though simple, are still unable to achieve satisfying tracking precision with higher compression rates. This paper aims to exploit disentangled representation learning with mutual information maximization (DR-MIM) to further improve DL-based trackers' precision and efficiency for UAV tracking. The proposed disentangled representation separates the feature into an identity-related and an identity-unrelated features. Only the latter is used, which enhances the effectiveness of the feature representation for subsequent classification and regression tasks. Extensive experiments on four UAV benchmarks, including UAV123@10fps, DTB70, UAVDT and VisDrone2018, show that our DR-MIM tracker significantly outperforms state-of-the-art UAV tracking methods.
引用
收藏
页码:1331 / 1336
页数:6
相关论文
共 50 条
  • [1] Real-Time UAV Tracking Through Disentangled Representation With Mutual Information Maximization
    Ye, Hengzhou
    Yang, Xiangyang
    Wu, You
    Wang, Xucheng
    Li, Yongxin
    Li, Shuiwang
    IEEE ACCESS, 2024, 12 : 135325 - 135337
  • [2] Disentangled Speaker Representation Learning via Mutual Information Minimization
    Mun, Sung Hwan
    Han, Min Hyun
    Kim, Minchan
    Lee, Dongjune
    Kim, Nam Soo
    PROCEEDINGS OF 2022 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2022, : 89 - 96
  • [3] Learning View-Disentangled Human Pose Representation by Contrastive Cross-View Mutual Information Maximization
    Zhao, Long
    Wang, Yuxiao
    Zhao, Jiaping
    Yuan, Liangzhe
    Sun, Jennifer J.
    Schroff, Florian
    Adam, Hartwig
    Peng, Xi
    Metaxas, Dimitris
    Liu, Ting
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 12788 - 12797
  • [4] Unsupervised Deep Representation Learning for Real-Time Tracking
    Wang, Ning
    Zhou, Wengang
    Song, Yibing
    Ma, Chao
    Liu, Wei
    Li, Houqiang
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2021, 129 (02) : 400 - 418
  • [5] Unsupervised Deep Representation Learning for Real-Time Tracking
    Ning Wang
    Wengang Zhou
    Yibing Song
    Chao Ma
    Wei Liu
    Houqiang Li
    International Journal of Computer Vision, 2021, 129 : 400 - 418
  • [6] Graph Representation Learning via Graphical Mutual Information Maximization
    Peng, Zhen
    Huang, Wenbing
    Luo, Minnan
    Zheng, Qinghua
    Rong, Yu
    Xu, Tingyang
    Huang, Junzhou
    WEB CONFERENCE 2020: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2020), 2020, : 259 - 270
  • [7] Multimodal Representation Learning via Maximization of Local Mutual Information
    Liao, Ruizhi
    Moyer, Daniel
    Cha, Miriam
    Quigley, Keegan
    Berkowitz, Seth
    Horng, Steven
    Golland, Polina
    Wells, William M.
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT II, 2021, 12902 : 273 - 283
  • [8] Real-time eye tracking using representation learning and regression
    Dharbaneshwer, S. J.
    Sowmya, Gayathri G.
    Chauhan, Sumit Singh
    Shekhawat, Bharat Singh
    Kumar, Lava
    Ghosh, Soumitra
    PROCEEDINGS OF 7TH JOINT INTERNATIONAL CONFERENCE ON DATA SCIENCE AND MANAGEMENT OF DATA, CODS-COMAD 2024, 2024, : 298 - 306
  • [9] Mutual Information Maximization on Disentangled Representations for Differential Morph Detection
    Soleymani, Sobhan
    Dabouei, Ali
    Taherkhani, Fariborz
    Dawson, Jeremy
    Nasrabadi, Nasser M.
    2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2021), 2021, : 1730 - 1740
  • [10] Learning Aberrance Repressed Correlation Filters for Real-Time UAV Tracking
    Huang, Ziyuan
    Fu, Changhong
    Li, Yiming
    Lin, Fuling
    Lu, Peng
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 2891 - 2900