基于数据驱动的多智能体规划识别与博弈围捕

王勇; 李尚昊; 徐伯辰; 方浩

doi:10.15918/j.tbit1001-0645.2024.167

基于数据驱动的多智能体规划识别与博弈围捕

Multi-Agent Planning Recognition and Game Encirclement Based on Data Driving

摘要

摘要: 在进行实时对抗的任务中，对于敌方的动作识别较为困难，需要根据对方的移动轨迹或行为来分析对方的意图，预测其未来目标，构建规划策略库. 针对此问题，提出基于数据驱动的多智能体识别算法，该算法首先采用基于自动机的特征提取方法，获得规划需要的位置和任务信息；然后将规划识别问题转换为多分类问题，并从单智能体角度切入，给出了一种基于极端梯度提升（extreme gradient boosting，XGBoost）的多分类模型；之后，对于多智能体之间可能存在的合作行为，使用无监督学习的一种基于密度对噪声鲁棒的空间聚类算法（density-based spatial clustering of applications with noise，DBSCAN）对多智能体进行分簇，以促进协同合作. 对于同簇智能体，构建了一种针对多智能体的多分类模型，完成对多智能体的目标预测. 在获悉敌方目标后，提出基于博弈的围捕逼停算法，构建非合作动态博弈模型，通过求解纳什均衡得到应对敌方的最优策略. 最后，通过仿真验证了所提出算法的有效性.

Abstract: To analyze and predict the opponent’s intentions and future targets scientifically in real-time adversarial tasks, a multi-agent recognition algorithm was proposed based on data driving. Firstly, taking a feature extraction method with automata, the arithmetic was arranged to obtain position and task information from samples for planning. Then, transforming the planning recognition into a multi-classification problem, a multi-classification model was developed based on extreme gradient boosting (XGBoost) to start with a single agent. And then, considering collaboration potential among agents, the density-based spatial clustering of applications with noise (DBSCAN) algorithm was employed for multi-agent clustering and collaborating. And a multi-classification model was then developed for clustered multi-agent systems to predict goals within the same cluster. After identifying the opponent's targets, an encirclement and stop strategy was designed based on game theory. Finally, a non-cooperative dynamic game model was constructed, and the Nash equilibrium solution was used to derive the optimal strategy. Simulation results demonstrate the effectiveness of the proposed algorithm.

HTML全文

参考文献(22)

施引文献

资源附件(0)