基于数据驱动的多智能体规划识别与博弈围捕

Multi-Agent Planning Recognition and Game Encirclement Based on Data Driving

  • 摘要: 在进行实时对抗的任务中,对于敌方的动作识别较为困难,需要根据对方的移动轨迹或行为来分析对方的意图,预测其未来目标,构建规划策略库. 针对此问题,提出基于数据驱动的多智能体识别算法,该算法首先采用基于自动机的特征提取方法,获得规划需要的位置和任务信息;然后将规划识别问题转换为多分类问题,并从单智能体角度切入,给出了一种基于极端梯度提升(extreme gradient boosting,XGBoost) 的多分类模型;之后,对于多智能体之间可能存在的合作行为,使用无监督学习的一种基于密度对噪声鲁棒的空间聚类算法(density-based spatial clustering of applications with noise,DBSCAN)对多智能体进行分簇,以促进协同合作. 对于同簇智能体,构建了一种针对多智能体的多分类模型,完成对多智能体的目标预测. 在获悉敌方目标后,提出基于博弈的围捕逼停算法,构建非合作动态博弈模型,通过求解纳什均衡得到应对敌方的最优策略. 最后,通过仿真验证了所提出算法的有效性.

     

    Abstract: To analyze and predict the opponent’s intentions and future targets scientifically in real-time adversarial tasks, a multi-agent recognition algorithm was proposed based on data driving. Firstly, taking a feature extraction method with automata, the arithmetic was arranged to obtain position and task information from samples for planning. Then, transforming the planning recognition into a multi-classification problem, a multi-classification model was developed based on extreme gradient boosting (XGBoost) to start with a single agent. And then, considering collaboration potential among agents, the density-based spatial clustering of applications with noise (DBSCAN) algorithm was employed for multi-agent clustering and collaborating. And a multi-classification model was then developed for clustered multi-agent systems to predict goals within the same cluster. After identifying the opponent's targets, an encirclement and stop strategy was designed based on game theory. Finally, a non-cooperative dynamic game model was constructed, and the Nash equilibrium solution was used to derive the optimal strategy. Simulation results demonstrate the effectiveness of the proposed algorithm.

     

/

返回文章
返回