Vision-based drone control for autonomous UAV cinematography

被引:4
作者
Mademlis, Ioannis [1 ]
Symeonidis, Charalampos [1 ]
Tefas, Anastasios [1 ]
Pitas, Ioannis [1 ]
机构
[1] Aristotle Univ Thessaloniki, Dept Informat, Thessaloniki, Greece
关键词
UAV cinematography; Intelligent shooting; Autonomous drones; PID control; Deep neural networks; Vision-based control; Mobile robotics; Object detection; CLASSIFICATION; SHOTS;
D O I
10.1007/s11042-023-15336-7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
One of the most important aesthetic concepts in autonomous Unmanned Aerial Vehicle (UAV) cinematography is the UAV/Camera Motion Type (CMT), describing the desired UAV trajectory relative to a (still or moving) physical target/subject being filmed. Usually, for the drone to autonomously execute such a CMT and capture the desired shot in footage, the 3D states (positions/poses within the world) of both the UAV/camera and the target are required as input. However, the target's 3D state is not typically known in non-staged settings. This paper proposes a novel framework for reformulating each desired CMT as a set of requirements that interrelate 2D visual information, UAV trajectory and camera orientation. Then, a set of CMT-specific vision-driven Proportional-Integral-Derivative (PID) UAV controllers can be implemented, by exploiting the above requirements to form suitable error signals. Such signals drive continuous adjustments to instant UAV motion parameters, separately at each captured video frame/time instance. The only inputs required for computing each error value are the current 2D pixel coordinates of the target's on-frame bounding box, detectable by an independent, off-the-shelf, real-time, deep neural 2D object detector/tracker vision subsystem. Importantly, neither UAV nor target 3D states are required ever to be known or estimated, while no depth maps, target 3D models or camera intrinsic parameters are necessary. The method was implemented and successfully evaluated in a robotics simulator, by properly reformulating a set of standard, formalized UAV CMTs.
引用
收藏
页码:25055 / 25083
页数:29
相关论文
共 60 条
[1]   Optimal trajectory planning for cinematography with multiple Unmanned Aerial Vehicles [J].
Alcantara, Alfonso ;
Capitan, Jesus ;
Cunha, Rita ;
Ollero, Anibal .
ROBOTICS AND AUTONOMOUS SYSTEMS, 2021, 140
[2]   Autonomous Execution of Cinematographic Shots With Multiple Drones [J].
Alcantara, Alfonso ;
Capitan, Jesus ;
Torres-Gonzalez, Arturo ;
Cunha, Rita ;
Ollero, Anibal .
IEEE ACCESS, 2020, 8 :201300-201316
[3]   Adaptive PID Controllers: State of the Art and Development Prospects [J].
Alexandrov, A. G. ;
Palenov, M. V. .
AUTOMATION AND REMOTE CONTROL, 2014, 75 (02) :188-199
[4]  
Astrom K.J., 2006, Advanced PID control
[5]  
Bahdanau D, 2016, Arxiv, DOI arXiv:1409.0473
[6]   Classification of Cinematographic Shots Using Lie Algebra and its Application to Complex Event Recognition [J].
Bhattacharya, Subhabrata ;
Mehran, Ramin ;
Sukthankar, Rahul ;
Shah, Mubarak .
IEEE TRANSACTIONS ON MULTIMEDIA, 2014, 16 (03) :686-696
[7]  
Bonatti R, 2019, Arxiv, DOI arXiv:1904.02319
[8]  
Bradski G., 2005, Intel Technology Journal, V9, P119, DOI DOI 10.1535/ITJ.0902.03
[9]   SiamAPN plus plus : Siamese Attentional Aggregation Network for Real-Time UAV Tracking [J].
Cao, Ziang ;
Fu, Changhong ;
Ye, Junjie ;
Li, Bowen ;
Li, Yiming .
2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, :3086-3092
[10]  
Caraballo LE, 2020, P INT C INT ROB SYST