Inverse Submodular Maximization with Application to Human-in-the-Loop Multi-Robot Multi-Objective Coverage Control

被引:0
作者
Shi, Guangyao [1 ]
Sukhatme, Gaurav S. [1 ]
机构
[1] Univ Southern Calif, Los Angeles, CA 90007 USA
来源
2024 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2024) | 2024年
关键词
OPTIMIZATION;
D O I
10.1109/IROS58592.2024.10801908
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We consider a new type of inverse combinatorial optimization, Inverse Submodular Maximization (ISM), for human-in-the-loop multi-robot coordination. Forward combinatorial optimization - solving a combinatorial problem given the reward (cost)-related parameters - is widely used in multi-robot coordination. In the standard pipeline, the reward (cost)related parameters are designed offline by domain experts. These parameters are utilized for coordinating robots online. What if non-expert human supervisors desire to change these parameters during task execution to adapt to some new requirements? We are interested in the case where human supervisors can suggest what actions to take, and the robots need to change these internal parameters accordingly. We study such problems from the perspective of inverse combinatorial optimization, i.e., the process of finding parameters given solutions to the problem. Specifically, we propose a new formulation for ISM, in which we aim to find a new set of parameters that minimally deviate from the current parameters while causing a greedy algorithm to output actions which are the same as those desired by the human supervisors. We show that such problems can be formulated as a Mixed Integer Quadratic Program (MIQP) which is intractable for existing solvers when the problem size is large. We propose a new Branch & Bound algorithm to solve such problems. In numerical simulations, we demonstrate how to use ISM in multi-robot multi-objective coverage control, and we show that the proposed algorithm provides significant advantages in running time and peak memory usage compared to directly using an existing solver.
引用
收藏
页码:8921 / 8928
页数:8
相关论文
共 34 条
[1]  
Biyik E, 2019, PR MACH LEARN RES, V100
[2]   An inverse optimization approach for a capacitated vehicle routing problem [J].
Chen, Lu ;
Chen, Yuyi ;
Langevin, Andre .
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2021, 295 (03) :1087-1098
[3]  
Finn C, 2016, PR MACH LEARN RES, V48
[4]   Inverse combinatorial optimization: A survey on problems, methods, and results [J].
Heuberger, C .
JOURNAL OF COMBINATORIAL OPTIMIZATION, 2004, 8 (03) :329-361
[5]   Sampling-based robotic information gathering algorithms [J].
Hollinger, Geoffrey A. ;
Sukhatme, Gaurav S. .
INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2014, 33 (09) :1271-1287
[6]   Inverse Problem of Power System Reliability Evaluation: Analytical Model and Solution Method [J].
Hu, Bo ;
Xie, Kaigui ;
Tai, Heng-Ming .
IEEE TRANSACTIONS ON POWER SYSTEMS, 2018, 33 (06) :6569-6578
[7]   Inverse optimal control from incomplete trajectory observations [J].
Jin, Wanxin ;
Kulic, Dana ;
Mou, Shaoshuai ;
Hirche, Sandra .
INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2021, 40 (6-7) :848-865
[8]  
Krause A, 2008, J MACH LEARN RES, V9, P235
[9]  
Li W, 2005, IEEE DECIS CONTR P, P2542
[10]   From human to humanoid locomotion-an inverse optimal control approach [J].
Mombaur, Katja ;
Truong, Anh ;
Laumond, Jean-Paul .
AUTONOMOUS ROBOTS, 2010, 28 (03) :369-383